Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3112 |
Symbol | |
ID | 4075559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 83029 |
End bp | 84546 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 638004614 |
Product | extracellular solute-binding protein |
Protein accession | YP_611348 |
Protein GI | 99078090 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.817588 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTCA GATCCATGCT GCTTGCCTCA GCGCTGGCGC TTTCCTCCGC GCTTCCCAGT TTTGCCGACA AGGCGAATGA CACGCTGGTC GCAGCTTTCA ACAAGGAAGT TCAGACGCTT GACGGTCTCT ACTCAACGTC GCGCGAGAAC CTGATCCTGT CCTATCTGAC CTCGGATCAG CTGGTCGAGC TGAACCTCGA CACCGGAGAG TATGAAGGGG CACTAGCGGA AAGCTATACT TGGGTCGATG ATCGCACCAT CGATTTCACA CTCCGCGAAG GCCTGACATT CCATGACGGT TCGCCTGTAA TGGTCGAAGA CATCGTCTAT TCCTTTGACT GGATCGCCAA TGCGGACTCC AAGACCAAAC GCGGCGCCTT TATCCGTGGT TGGTTCGAGA GCGCTGTCGC GATTGATGAT CGCACAGTGC GCGTCACCGC CAAACAACCC TATCCTTTGA TGCTGCGTGA CATCGCCGTC TTCGTTCTCA CTCGCAAGGC AGGCAGCTAT GGCGATGGCA ACCCTGATGC GCTGACGCAG AACTTCGTCG GCACCGGCCC TTACAAGATT TCTGAATTTG CCATGGGCGC CGGTGTGCAG CTGGAGCGCT ATGATGGATA CTACACTGGC GGACCCAAGG CGGCCGGTTC GATTGAGAAA ATCGTCCTGC GCCCGATTCC CGACTGGGGC ACCGTGACGG CGGAATTGCT GTCGGGGGGC GTAAACTGGT CTTTCAACGT GCCTGACGAT ACGGCCAAAG ATCTGGGCGG ATTGCCCATG GTGGATCATG TATCCGGCGT GTCCACGCGC GTTGCCTTCC TGGTGCTGGA CGCCGCAGGT GTCAGCGATG CGGAAGGCCC AATGACCAAC AAGCTGGTGC GTCAGGCGCT CAACCATGCG GTAAACCGGA AAGAAATCGT TGAATTTCTC GTCGGCGGTT CGGGCCGCGT TGTTCACTCG ACCTGTAACG CGGGCATGTT CGGCTGTGAT GTCGAGATCA CGGAATACGA TTATGACCCC GAAAAGGCCA AGGCTTTGCT GGCTGAAGCG GGCTACCCGG ACGGCTTTGA GTTCGACCTG ACCGCCTATC GCGAACGCCC CATCATGGAA GCAGTTGCCG CGGATCTGGC CGAAATCGGC GTGATCGCGA ATATCAACTT CGTAAAGCTT TCCGCGTTGT CCAAATCCCG CGCCGAAGGT CAGCTTGAAG CATTCCAGAA CGCCTGGGGC TTTTATGCGA CGCCGGATCT GGGCGCGATT TCCAACTACT ATGTCGAAGG ATCCAACCGC AACCTACACC AAGACGCAGA GGTTCAAGGT TGGTTCAAGG CTGCGCTGGA AACTGTCGAT CAAGACGAGC GCGCAGATCT CTATGCGAAG GCCCTGCAGA AGATCGCCGA TGAGGCTTAC CTGCTTCCGA TCTTCCAGTA TTCGCAAAAC TACGTGAAGA GCGTGGATGT GAATTTTGCG GCACCGGCCG ACGGCCTGCC ACGGCTCAAT GAGCTGAGCT GGAAGTAA
|
Protein sequence | MSVRSMLLAS ALALSSALPS FADKANDTLV AAFNKEVQTL DGLYSTSREN LILSYLTSDQ LVELNLDTGE YEGALAESYT WVDDRTIDFT LREGLTFHDG SPVMVEDIVY SFDWIANADS KTKRGAFIRG WFESAVAIDD RTVRVTAKQP YPLMLRDIAV FVLTRKAGSY GDGNPDALTQ NFVGTGPYKI SEFAMGAGVQ LERYDGYYTG GPKAAGSIEK IVLRPIPDWG TVTAELLSGG VNWSFNVPDD TAKDLGGLPM VDHVSGVSTR VAFLVLDAAG VSDAEGPMTN KLVRQALNHA VNRKEIVEFL VGGSGRVVHS TCNAGMFGCD VEITEYDYDP EKAKALLAEA GYPDGFEFDL TAYRERPIME AVAADLAEIG VIANINFVKL SALSKSRAEG QLEAFQNAWG FYATPDLGAI SNYYVEGSNR NLHQDAEVQG WFKAALETVD QDERADLYAK ALQKIADEAY LLPIFQYSQN YVKSVDVNFA APADGLPRLN ELSWK
|
| |