Gene TM1040_3866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3866 
Symbol 
ID4074929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp118357 
End bp119772 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content57% 
IMG OID638004523 
Productfructuronate reductase 
Protein accessionYP_611258 
Protein GI99077999 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0246] Mannitol-1-phosphate/altronate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.154848 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATT CTCTTATCCC TTGTTCTTAC GAACAAACCC TGCTCAAGCC GCGTATTCTG 
CATATCGGCT TTGGTGCTTT TGCCCGTGCC CACCCTATGG TCTATCTTCA CCATGGGCTT
GTGGCCGAGG GTGGCGATTG GGGTGTGGTC GCTGCGCGAC TGAATTCCGG AGTGGATGCG
TTGGACAGCT TGGACGCAGT CCAGGGGCGA TATCACATCG CCGAAGCCGA TGGTGACACT
ATAACACTAC GAGAAATCGG CCTCTTGTGT GGCACCTGTC ACCCCGCCCG CGACGGGGTA
GATGCTATCC CCGCGTTAAT CGCGTCTCCA GATATGTCTG TGATCTTGTT GACCATTACC
GAAAAAGGGT ACTGTACAAA GGATGGCCAG CTCGACCTGA CGCAAGCCGC TATACAGGCT
GAGTTGGACG GCGGGTTGCC GACCACTGCC ATCGGTGTGT TGGTGTCAGG CTTGGAGCGC
CGGCGCGCGG CAGATCTTGG CGGAATCACG ATCTTGTCTT GTGACAATCA GCCGGATAAC
GGCGCGCTCA CTCGCGCCGC TGTGCTGGGG TTTGCTGAGG AATTGGATCT AAATCTTGCG
GAATGGATTA GAACCCATGT TCGGTTCCCC TCGTCGATGG TGGACCGGAT CGTGCCTGCC
ATGACCGATG ACAGTCATAC AGCGGTCGCA TCCGCACTTG GCCGAGATGA CCCCAATGCG
GTTTTGTGTG AACCCTTCAG ACAGTGGGTG ATTGAAGATG ACTTTGCCAA TGAGCGCCCC
CCCTTTGCAG AGGGTGGTGC TATGTTGGTT GCAGACGTAC AGCCGTTTGA GGAAATGAAA
CTAAGACTGC TCAATGGTGC GCATACCACT TTGGCTTGGC TGGGTCAGTT GCTGGGATAT
CAAACAGTGG CCGACTGCAT GGCTGACAAG GAGCTGCGTG CTTTAATCCG CCACCTAATG
TTGGCTGAGC AGGCCGCAAC ACTGCGTCCA CTCGAGGGTA TCGATCTCGC AGCCTATGCG
GATGAGTTAT TAAAACGGTT TGAAAACACC CGGCTCCGGC ATCGACTAGA CCAGATCGCC
AGCGACAGCA GCCAGAAGAT GCCGCAACGC CTGTTCGCTC CGATAGCTAT TAACCTCGAA
GCCAAACGCG AGTGGTCGGT TTCAGCCTTG GCGGTGGCTG CTTGGATCAA AGGGTTGGGT
AGCCTTCCTC CTGTTCCGGA TCCTCGACAG GATGAGTTGC GTCGTGCCGC TCTTTGCAAT
GACCCAGTCG CGGCGGTTCT GTCGCTACCC TCCTTGGTTC CAGACGCGCT GCGTCCGCTA
GCAGAGTTCC AAGCCGCTAT TAGCGTGGCC TTTGAGCGAT TGCAGGGCGG GGCAAAGGCG
ACCGTGACAA CCACCGCGAA GGAACTACGC AGATGA
 
Protein sequence
MTNSLIPCSY EQTLLKPRIL HIGFGAFARA HPMVYLHHGL VAEGGDWGVV AARLNSGVDA 
LDSLDAVQGR YHIAEADGDT ITLREIGLLC GTCHPARDGV DAIPALIASP DMSVILLTIT
EKGYCTKDGQ LDLTQAAIQA ELDGGLPTTA IGVLVSGLER RRAADLGGIT ILSCDNQPDN
GALTRAAVLG FAEELDLNLA EWIRTHVRFP SSMVDRIVPA MTDDSHTAVA SALGRDDPNA
VLCEPFRQWV IEDDFANERP PFAEGGAMLV ADVQPFEEMK LRLLNGAHTT LAWLGQLLGY
QTVADCMADK ELRALIRHLM LAEQAATLRP LEGIDLAAYA DELLKRFENT RLRHRLDQIA
SDSSQKMPQR LFAPIAINLE AKREWSVSAL AVAAWIKGLG SLPPVPDPRQ DELRRAALCN
DPVAAVLSLP SLVPDALRPL AEFQAAISVA FERLQGGAKA TVTTTAKELR R