Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0038 |
Symbol | |
ID | 4076305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 39490 |
End bp | 41088 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638005325 |
Product | extracellular solute-binding protein |
Protein accession | YP_612033 |
Protein GI | 99079879 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.216937 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.341252 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCTT TTTCCAAACT AATCGGCTCT GCTGCACTTG GGCTCGCGCT GGGTGTCACC GCCCTGCCGG CGGTCGCCGC GACACCAGAC AATATGCTGG TCATCGCCAA CCGTATCGAC GACATCACCA CGCTGGATCC GGCGCAAAGC TTTGAATTTG CGGGCGCGGA TGTGATCCGC AACATGTACG GCAAGCTGGT GAATTTCGAC CCCTCCAACC TTGAGGCAGG CTATCAGCCC GATCTGGCCG AAAGCTGGAC TGTCTCGGAA GATGGCAAAA CCATCACCTT CACCATGCGC GAGGGCGTCA AGTTCCACTC TGGCAACCCG GTTCGGGCGG AAGATGCGGC CTTCTCACTG CGCCGCGTGA TCAAGCTCAA CAAAACGCCG TCGTTCATCC TGACACAGTT TGGGTTCACT CCGGAAAACG TCGATGAGAT GATCACCGTC GATGGCAACA CCGTGTCGAT CACGACCGAC AAACGCTATG CGACCTCCTT TGTGCTGAAC TGCCTGACAG CGACCTTGGG CGCAATCGTC GACGAAAAAC TGGTGATGGA GCACGACAAG GACGGCGACC TCGGCAACGA GTGGCTGATG ACCAACTCCG CCGGCTCCGG CCCCTACAAG CTGGCCTCTT GGAAGCCCAA GGAAAGCGTC ACGCTGACCG CAAACCCAGA CTACTATGAA GGCGCCCCGG CGATGCAGCG CGTGATCGTG CGTCACGTTC AGGAAAGTGC GACCCAGCGC CTGATGCTGG AGCGCGGCGA TATTGACGTG GCTCGTGACC TGACCCCTGC GGATGTGGAC GGCCTGGCAG GTATCGACGG CGTCGAAGTG CAGCGCGAGA TGCGCGGTCG CCTGATGTAT GTGTCCTTCA ACCAAAAGCA TCCTGAGCTC TCCAAGCCGC AGGTCCGTCA GGCGCTGAAG TATCTGATCG ACTATGACGG CATGGAAAAC AGCTTCCTCA AGAACTGGTA TGTGAAGCAC CAGAACTTCC TGCCAAAGAC CTATCTGGGT GCCGTGGATG AAAACCCCTT CTCGCTCAAT ATCGAGAAAG CCAAGGAATT GCTGGCAGAG GCCGGTGTGC CCGAAGGGCT GGAGCTGACG GTTGGTGTAC GCGAGGCGCA AGAGCGTCTG GAGATCGCGC AATCGCTGCA AAACACCTTT TCGCAGGCCG GCATCAAGCT GAACCTCGAG GTCGGCACCG GCAAGCAGGT TCTGGGCAAG TACCGCGCGC GGGAACTGGA CATCTACCTT GGCGCCTGGG GTCCGGATTA TCCCGATCCT CAGACCAACG CGGGCACATT TGCCTATAAC CCCGACAACT CTGACGCGGC CAATGCAACG GGCCTTCTGG CGTGGCGCAA CGGTTGGGAC ACTGCAGGGC TGACCGACAA AGTCGCCGCA GCCGTGGTCG AAGGCGATAC CGTGAAACGT GCAGACATGT ATCACGAGAT CCAGGCCGAG TTCCGCGATA CCGCGCCTTT TGCCGTGATG TTCCAGCTGG TTGAACAGGC TGGCATCAGC GAAAAGGTCG AGGGTCTGAA CCTTGGTGGC GCGATCACCG CAGCCGCTTA CTGGGACGTG ACCAAGTAA
|
Protein sequence | MNAFSKLIGS AALGLALGVT ALPAVAATPD NMLVIANRID DITTLDPAQS FEFAGADVIR NMYGKLVNFD PSNLEAGYQP DLAESWTVSE DGKTITFTMR EGVKFHSGNP VRAEDAAFSL RRVIKLNKTP SFILTQFGFT PENVDEMITV DGNTVSITTD KRYATSFVLN CLTATLGAIV DEKLVMEHDK DGDLGNEWLM TNSAGSGPYK LASWKPKESV TLTANPDYYE GAPAMQRVIV RHVQESATQR LMLERGDIDV ARDLTPADVD GLAGIDGVEV QREMRGRLMY VSFNQKHPEL SKPQVRQALK YLIDYDGMEN SFLKNWYVKH QNFLPKTYLG AVDENPFSLN IEKAKELLAE AGVPEGLELT VGVREAQERL EIAQSLQNTF SQAGIKLNLE VGTGKQVLGK YRARELDIYL GAWGPDYPDP QTNAGTFAYN PDNSDAANAT GLLAWRNGWD TAGLTDKVAA AVVEGDTVKR ADMYHEIQAE FRDTAPFAVM FQLVEQAGIS EKVEGLNLGG AITAAAYWDV TK
|
| |