Gene TM1040_3312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3312 
Symbol 
ID4075717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp321168 
End bp322187 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content58% 
IMG OID638004820 
Productputative periplasmic solute-binding protein 
Protein accessionYP_611546 
Protein GI99078288 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00706911 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATA TCGCTCTTAC GGTTCTGGCT GCAACCGCTC TTGCGGCACC GCTGACCGCG 
TCTTTTGCAT CTTCGGCGCA GGCCGAAGAC GCCATCTGCT ACAATTGCCC CCCGCAATGG
GCGGATTGGG CGTCCATGCT GGAGGCCATC GAGACCGAGA TCGGTGTCAG CCTTCCGCAT
GACAACAAGA ACTCCGGCCA GACATTTGCA CAGCTTGTGG CTGAAAAAGA CAGCCCCGTG
GCAGATGTGG CCTATTACGG TGTGACCACC GGCATCAAGG CGGGCAAGGA AGGTCTGGTC
GAGGCGTACA AGCCCGCAGG TTTTGACGAG ATCCCGGAGG GGCTCAAAGA CCCCGAAGGC
AAGTGGTTCG CAGTGCATTA CGGCACCATC GGGTTCTTTG TGAATGTGGA CGCCCTTGGC
GGCGCGCCCG TCCCGCAGTG CTTTGCAGAC CTGAAAAAGC CTGCCTATCA GGGAATGGTG
GGTTATCTGG ATCCCTCGTC GGCCTTTGTC GGATATGCCG GGGCCGTCGC CGTCAACCTT
TCCTTTGGGG GCGATCTGCA AGACTTTGAC CCCGCAATCG AGTATTTTTC CGAGCTGGCA
GAGAACGCAC CGATCGTGCC CAAGCAGACG TCTTATGCGC GGGTCGTATC GGGAGAGATC
CCGATCCTGT TTGATTACGA CTTCAACGCC TATCGCGCGA AATACGAAGA AGACGGAAAT
TTTGAATTTG TCCTGCCCTG CGAGGGGTCG GTGCGTGTAC CCTATGTCAT GAGCCTCGTG
GGCAATGCGC CTCACGGCGA GACCGGCAAG AAGGTTCTGG ATTTCATTCT CTCTGACAAA
GGGCAGGCGA TCTGGACCAA CGCCTATCTG CAGCCCGCGC GCCCGGTTGA GCTGCCTGCT
GAGGTGGCGG AGAAATTCCT GCCCGCCAGC GATTATGCCC GTGCACAGGC TGTGAACTAT
GCAGAGATGG AAAAGGCGCA GGCCGGTTTT GGCGAACGCT ACCTGAACGA AGTCAAATAA
 
Protein sequence
MKHIALTVLA ATALAAPLTA SFASSAQAED AICYNCPPQW ADWASMLEAI ETEIGVSLPH 
DNKNSGQTFA QLVAEKDSPV ADVAYYGVTT GIKAGKEGLV EAYKPAGFDE IPEGLKDPEG
KWFAVHYGTI GFFVNVDALG GAPVPQCFAD LKKPAYQGMV GYLDPSSAFV GYAGAVAVNL
SFGGDLQDFD PAIEYFSELA ENAPIVPKQT SYARVVSGEI PILFDYDFNA YRAKYEEDGN
FEFVLPCEGS VRVPYVMSLV GNAPHGETGK KVLDFILSDK GQAIWTNAYL QPARPVELPA
EVAEKFLPAS DYARAQAVNY AEMEKAQAGF GERYLNEVK