Gene TM1040_3683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3683 
Symbol 
ID4075652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp740642 
End bp742561 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content60% 
IMG OID638005203 
Productextracellular solute-binding protein 
Protein accessionYP_611912 
Protein GI99078654 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.378617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.495229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAACG CCCGTCCCCT GCTTTCCTTC ACCAGTGCCC TTGCCTTTCT GCTGCTGCCG 
CTTGCGACGG TTGCTCAACC CACCGTGCAA GAAAGCGCCT TCTGGAGCCA TGAGGTCAGC
GCAGGCGATC TGCCGCCCGC CGCTGAACGC ATCCCCGCAG AGCCGCTGGT GGTGAACCTC
GCCGCCAAGG GGCGCAGTCC GGGCATTCCC GGCGGCACGC TCAATACGAT GGTCACGCGC
TCCAAGGACA TCCGCCAGAT GGTTGTCTAC GGCTATGCGC GCCTTGTGGG ATACAACGAG
AACTACGAGC TCGTGCCCGA TGTGCTACAA AGCTTTGAAA GCGAGGGCGA TCGCAAATTC
ACCCTGCACC TCCGCGAGGG GCATAAATGG TCCTCTGGCG ATCCTTTCAC CTCGGCGGAT
TTTGAATACT GGTGGACACA CATTGCCAAC AACCGCGAGC TCAATCCCAC CGGCCCGCCG
GATTTTGTGC GCGTCGAGGG CGAGTTGCCG GAGGTCACTT TTCCGGATGA CACCACTGTG
GTTTTTGAGT GGAGCACCCC CAACCCCAGC TTCCTTCAGG TGCTGGCGCA GGCGGTGCCA
CCCTTCATCT ACCGTCCCTC CGCCTATCTG AAACAGTTCC ACAAGGACTT TGCCGACCCC
GAAGCCTTGG AAGAGGAAAT CGACTACGCC CGCGTCAAGA GCTGGGCCGC GCTGCACAAC
AAGCGCGACA ATATGGACAA GTTCGACAAT CCTGACCTGC CGACGCTACA GCCTTGGATC
AACGCCACCG CAGGCAAGAA GATCCGCCAT CAATTTGTGC GCAATCCGTA CTATCATCGC
ATCGATGAGA ACGGCGTGCA GCTCCCTTAT ATCGACACGG TCGAAATGGA GATTGTGTCG
GGCGGGTTGG TCGCGGCAAA ATCCAACGCG GGCGAGGCCG ACCTGCAGGC ACGTGGGCTT
GATTTCAGAG ACATTCCGAT CTTGCGCAAA GGCGAGGCAA ATGGTGACTA TCGCACCGAG
CTATGGTCCT CGGGCACTGC CTCGCAGATT GCGATTTATC CGAACCTGAA CGCGGCGGAT
GAAGTCTGGC GCGCGACCCT GCGCGATGTG CGTGTGCGCC GCGCCCTTTC GGTGGCGATC
AACCGAGCAG CCATCAATAA ATCGCTCTAT TTCAAGCTGG CAAAGCCCGG CGCGATGACG
GTTCTGGAGA AAAGCCCCTT CTTTGAGCAA GAGTTGCGGG ACGCGTGGGC CCAGTATGAT
CCCGATCTCG CCAACACGCT GCTGGATGAG GCAGGCCTCA CGGAACGTGA CGGCTACGGC
ATCCGCCGCC TGCCCGACGG GCGCCCGATG GAATTGGTGG TGGAAACCGC AGGCGAGCGG
CAAGAGGTAG AAAATGCGCT GCAGATCATC ACAGATGACT GGCGCGATGT GGGTGTAAAG
CTGGTGATGC GCCCGCTCGA TCGCGACATC CTGCGCAATC GTGTGTTCTC GGGCACCACC
ATGGCCTCGG TCTGGTACGG CTGGGACAAT GGCCTCCCAC AGAGCTACAC CTCCCCGGCC
TATCTTGCGC CCACGGATCA GGTGTTTTTG TCCTGGCCCA AATGGGGTCA GTATTATCAA
ACCAGCGGCG CGGTGGGCGA AGCGCCAGAT ATGGCACCGG CGCAGCGTCT GATGGAGCTT
CTGGACGACT GGAACAAGGC ACCAGATGCC AACAAACGGG CCGAAGCCTG GCATGAGATG
CTTGAAATTC ACGCCGAAAA TGTCTTTGCA ATTGGTCTGG TGGCAGCCGC GCCGCAGCCC
GTTGTGGTCT CAAACCGTCT GCGCAATGTG CCCAAGACGG CGATCTGGGC CTGGGATCCC
GGCGCACATT TTGGCGTGCA CCGGATGGAT GAGTTCTACT TTGAGGATGG CGAAGGCTGA
 
Protein sequence
MSNARPLLSF TSALAFLLLP LATVAQPTVQ ESAFWSHEVS AGDLPPAAER IPAEPLVVNL 
AAKGRSPGIP GGTLNTMVTR SKDIRQMVVY GYARLVGYNE NYELVPDVLQ SFESEGDRKF
TLHLREGHKW SSGDPFTSAD FEYWWTHIAN NRELNPTGPP DFVRVEGELP EVTFPDDTTV
VFEWSTPNPS FLQVLAQAVP PFIYRPSAYL KQFHKDFADP EALEEEIDYA RVKSWAALHN
KRDNMDKFDN PDLPTLQPWI NATAGKKIRH QFVRNPYYHR IDENGVQLPY IDTVEMEIVS
GGLVAAKSNA GEADLQARGL DFRDIPILRK GEANGDYRTE LWSSGTASQI AIYPNLNAAD
EVWRATLRDV RVRRALSVAI NRAAINKSLY FKLAKPGAMT VLEKSPFFEQ ELRDAWAQYD
PDLANTLLDE AGLTERDGYG IRRLPDGRPM ELVVETAGER QEVENALQII TDDWRDVGVK
LVMRPLDRDI LRNRVFSGTT MASVWYGWDN GLPQSYTSPA YLAPTDQVFL SWPKWGQYYQ
TSGAVGEAPD MAPAQRLMEL LDDWNKAPDA NKRAEAWHEM LEIHAENVFA IGLVAAAPQP
VVVSNRLRNV PKTAIWAWDP GAHFGVHRMD EFYFEDGEG