Gene TM1040_3307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3307 
Symbol 
ID4075711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp315194 
End bp316543 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content59% 
IMG OID638004815 
Productextracellular solute-binding protein 
Protein accessionYP_611541 
Protein GI99078283 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0444358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.775669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAG TATTGATGTC AGCCGCAGCG ATCACCGCGT TGATGGCAGG AACGGCGAGC 
GCCCAGGATC TGATTTTTCC GGTCGGCGAA GGCGCCTTCA ACTGGGACAG CTACGCAGAG
CTGGAGAAGA TCGACCTGAA CGGTGAGCAG GTCACCGTGT TTGGTCCATG GCTCGGGCCT
GACCAAGAGG TTGTTGAAAA CGTACTGGCC TACTTCGCAG CCGCGACCGG TGCGGATGTG
CGCTATGCAG GCTCCGACAG CTTTGAGCAG CAGATCGTGG TCGATGCCGA GGCAGGCTCT
GCCCCCAATG TTGCTGTTTT CCCGCAGCCC GGTCTGGTGT CTGATATGGC CAAGCGAGGC
TTCATCACGC CGCTTGGTGA AGAAACCGCC GACTGGGTGC GCGACAACTA CGCCGCGGGT
CAGTCCTGGG TGGATCTCGG AACCTATCCG GGCGCAGATG GCAATGACGG GCTCTTTGGT
CTGTTCTACA AGGTCGATGT GAAGTCTCTG GTTTGGTATA ACCCGGAAAA CTTTGAGGAT
TTCGGATATG AAACTCCGCA GTCCATGGAA GAGCTGAAGG CGCTGACCGA GCAGATGGTG
GCCGATGGCA ACACTCCATG GTGCATCGGC CTGGGATCCG GTGGCGCGAC TGGCTGGCCT
GCGACCGACT GGGTCGAGGA CATGATGCTG CGCACGCAGG AACCCGCAGT CTACGACAAA
TGGGTCTCCA ATGAGCTGAA GTTCGATGAT CCTGCCGTCA TCGGTGCGAT TGAGGAATTC
GGTTGGTTCG CCAAGAACGA TGACTTCGTT TCTGGTGGTG CTGGTGCCGT GGCGTCTACC
GACTTCCGCG ATAGCCCCAA AGGTCTCTTT GCCAGCCCGC CGCAGTGCAT GATGCACCGT
CAGGCGTCCT TCATTCCGGC CTTCTTCCCA GAAGGCACCG AAATGGGTCT GGATGCTGAT
TTCTTCTACT TCCCTGCCTA CGAAGGCAAA GAACTTGGCA ATCCGGTACT GGGCGCGGGC
ACCATCTGGT CGATCACCAA TGACAGCCCC GGTGCTCAGG CGCTGATGGA GTTCCTGAAG
GCGCCGATCG CTCATGAAGT CTGGATGGCG CAGCAAGGGT TCCTGACCCC GCTGAAGAGC
GTCAACACCG ACCTCTATGC CACCGACACG CTGAAGAAGA TGGGCGAGAT TCTTCTCTCT
GCAGATACCT TCCGCTTTGA TGCATCCGAT CTGATGCCGG GTGGCGTGGG CGCCGGGTCG
TTCTGGACCG GCATGGTGGA TTACGCAGGT GGCAAACCTG CCGAAGAGGT TGCAACCGAG
ATCCAGTCCT CCTGGGATGC GCTCAAGTAA
 
Protein sequence
MKRVLMSAAA ITALMAGTAS AQDLIFPVGE GAFNWDSYAE LEKIDLNGEQ VTVFGPWLGP 
DQEVVENVLA YFAAATGADV RYAGSDSFEQ QIVVDAEAGS APNVAVFPQP GLVSDMAKRG
FITPLGEETA DWVRDNYAAG QSWVDLGTYP GADGNDGLFG LFYKVDVKSL VWYNPENFED
FGYETPQSME ELKALTEQMV ADGNTPWCIG LGSGGATGWP ATDWVEDMML RTQEPAVYDK
WVSNELKFDD PAVIGAIEEF GWFAKNDDFV SGGAGAVAST DFRDSPKGLF ASPPQCMMHR
QASFIPAFFP EGTEMGLDAD FFYFPAYEGK ELGNPVLGAG TIWSITNDSP GAQALMEFLK
APIAHEVWMA QQGFLTPLKS VNTDLYATDT LKKMGEILLS ADTFRFDASD LMPGGVGAGS
FWTGMVDYAG GKPAEEVATE IQSSWDALK