Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0197 |
Symbol | |
ID | 5537658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 236232 |
End bp | 238049 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640892360 |
Product | extracellular solute-binding protein |
Protein accession | YP_001430348 |
Protein GI | 156740219 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.386701 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAGC AACGACTCAG AAGTTTCGTG CAGCGTGTTC TGATCCTGAC GGTCGCCTGC ATCGCACTGG CGGGATGCAC GATTGAGGGA GGCGTCCCGA CGCCGACGCC TGAACCAACC CTCCCGGCGA GCGCTCCCAT TCCCACCGGT GCAGTGGTTG CCGAACGCAT CGCCGAGCGC AACGACACAT GGATGATCGG GGCGCTTGAT CTTCCAGCGG ATCTGTACCC ATACCCGCAG TCCGCCGCCA CCCGCCGCGC ATCGGCGACG ATCACCGAGT TGCTCTTCCC GTCGCCAATC CTACCCTATA ACTACGGTTA TACCGCAACA GGAGTGCTCG AACGCATCCC CACACTCGAA AATGGCGATG CGGAAATGCG CAAGGTCGAT GTCTATCTCG ATGCCACCGG CGCGATAACC ACTACGGTCA CCGATGTCGT CACCCAGGTC GATCAACTGG TGATCACCTT CCGCTGGAAC CCGCGGCTGC GCTGGTCCGA CGGCACGCCG GTTACAGCGG ACGACTCGGT GTTCGCCTAT GAATTGGCGA AAGCCGCGCC GCCAGGCGAC GCGGCAGCCG AATTGCTGGC AAAAACCGCT ACATACGAGA AGATCGATGA TCATACCACG CGCGCGGTGC TCCGCCCCGA TTATGTGGGG GCGGCATATT TTGTGAGTTA CTGGACGCCG CTGCCACGCC ATCTGCTCCA GGGCGTCGAT CCGGCGCGGG TGCGCGAGAG CGCATTTGCT CGTCAACCGG TCGGGTATGG TCCCTATATG CTGGTCGAAC GCACTGCCAC TGAACTGCGC TTCGAGCGCA ACCCGCATTA TTTCGGTCCG ACGCCAGCGG CGTCGCGGCT GGTCGTGCGC GTGTTTCCCG ATCTCGACCT GCTGCGCGCC AATCTGCTCA ACGGCAATCT CGATCTGGGC ATTGCTGATC GGATCTCGAC CGCCCCGCTG ATCCGCTTCG ACACCGACGC CGCCGAAGGC GCCGTACAGG TCTTCACCGT CTCCAGTCCA GTTTGGGAAC ATATTCTGTT CAATCTGGAT GTTCCGGCAC TCCAGGATAT TCGGGTGCGG CGCGCGCTGG CGTATGGCAC AAACCGGCAG GCGATGGTCG ATGCGCTCTT TGGTGGACGA ACGCCGGTTC TGGATGGTTG GGTGGTGCCG GAGCACCCCC TTGCCGCGCC GCCCGATCAG GTGACCCGCT ACCCGTATAA TCCCGATCAG GCACGGCAGT TGCTCGATGA AGCCGGATAT ACCGACCCCG ATGGCGATGG CATCCGTGCG TCGCCCGATG GCGCCACGCT AACGCTGCAA CTGCTGACGA CGCAAGGGAG CGACGTGCGG CGCGCAATTG CCCGCCGTTT TCAAGCAGAT ATGCGCGCAA TCGGCGTCGC AATCGATATT AACGAAGCGT CGCCCGACGA GGTGTTCGAC TCAGATGGAC CCCTCTACCT CCGACAGTTC GATCTTGCGC TCTTCGGGTG GATCGCTGGA CCAGAGCCGG GCGGGTTGCA ACTCTGGAGT TGCGCCGCCG TTCCCGCCGA GAGCAACAAC TATCGTGGCG AGAACTTTGC CGGCTGGTGT TTCCGCGACG CGGATCGCGC GGTACGCACC GCCGACACGA CCCTCGACCC CGCCGAACGG GCTGAGGCGT ACCTGCGTCA GCAGCAACTG TGGACGCAGG AACTCCCGGC GATTCCTCTG TTTCAACGCC TGAGCATCGT GGTGGCAGCA CCCGATGTGC GTGGGCTTGC CCCCGATCCC CTCGCGCCGG TGACATGGAA TGTGGCGGCG TGGAAAAGGG AAAAGTAA
|
Protein sequence | MTQQRLRSFV QRVLILTVAC IALAGCTIEG GVPTPTPEPT LPASAPIPTG AVVAERIAER NDTWMIGALD LPADLYPYPQ SAATRRASAT ITELLFPSPI LPYNYGYTAT GVLERIPTLE NGDAEMRKVD VYLDATGAIT TTVTDVVTQV DQLVITFRWN PRLRWSDGTP VTADDSVFAY ELAKAAPPGD AAAELLAKTA TYEKIDDHTT RAVLRPDYVG AAYFVSYWTP LPRHLLQGVD PARVRESAFA RQPVGYGPYM LVERTATELR FERNPHYFGP TPAASRLVVR VFPDLDLLRA NLLNGNLDLG IADRISTAPL IRFDTDAAEG AVQVFTVSSP VWEHILFNLD VPALQDIRVR RALAYGTNRQ AMVDALFGGR TPVLDGWVVP EHPLAAPPDQ VTRYPYNPDQ ARQLLDEAGY TDPDGDGIRA SPDGATLTLQ LLTTQGSDVR RAIARRFQAD MRAIGVAIDI NEASPDEVFD SDGPLYLRQF DLALFGWIAG PEPGGLQLWS CAAVPAESNN YRGENFAGWC FRDADRAVRT ADTTLDPAER AEAYLRQQQL WTQELPAIPL FQRLSIVVAA PDVRGLAPDP LAPVTWNVAA WKREK
|
| |