Gene TM1040_3713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3713 
Symbol 
ID4075420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp771666 
End bp773009 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content64% 
IMG OID638005233 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_611942 
Protein GI99078684 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0945015 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGACTG CTACGGCGAA GGATGTCGAC CGCGCGGTGG TCTCGGCCCG GCGCGCTTTT 
GACGACGGTC GCTGGTCAGG GCTGGCCCCG GCGGCGCGCA AAAAGGTTCT GCACCGGATT
GCCGACAAGA TCGAGGCCGA GGCGCTGGCG CTCACGGTGC TTGGCGTGCG TGACAATGGC
ACCGAGTTCA ATATGGCGTT GAAGGCCGAG GCCGGATCTG CTGCAGGGAC TTTCCGCTAC
TACGCCGAGG CGCTGGACAA AGTCGCAGGC GAGGTCGCGC CCACCGCGCC GGACGTTCTG
GGACTCGTGC ATCGCGCGCC CGTTGGTGTC GTGGGCGCAA TCGTGCCATG GAACTTTCCG
CTGATGATCG GCGCCTGGAA GCTCGCGCCC GCGCTCGCGA TGGGCAATTC CGTGGTGCTG
AAGCCGGCTG AGACTGCCTC GCTGTCGCTG CTACGTCTTG CTGAGATCTG CGCAGACTGC
GGCCTGCCGG ATGGGGTGTT GAATGTAGTG ACCGGGCCGG GCGCCGTGAC TGGCGCCGCC
CTATCGGAGC ATATGGATGT GGACGTGATG GTCTTTACCG GTTCTGGCGC GACGGGGCGG
CGCCTGCTGG TGGCCTCGGC GCGGTCCAAC CTCAAGCGCT GCTACCTCGA GCTGGGAGGC
AAGTCCCCCA ATATCGTTTT TGCGGATGCA AAGGATCTCG ATCACGTGGC CAAGGTCTCG
GCCATGGGGA TTTTCCGCAA TTCCGGTCAG GTCTGTGTCG CAGGATCGCG CCTTTTGGTG
GAGGCCTCCA TCCACGAGGA ATTTGTGGCC CGGGTCGTGG CCCATGCACA GGCGCTTCGG
GTTGGCGACC CCCTGGATAT GAACACTCAG ATTGGCGCCG TGAATTCAGA GACCCAGCTT
GCAGCAAACC TTGCCCACGT GGAGCGTGCC GCCGCCCAAG GGGGCGAGGT GCTCTGCGGG
GGCGGTCGCA TCCTCTCCGA GACGGGTGGA ACCTACATGG CGCCGACCGT TGTGGCGGGT
GTCACGCAGG ACGCGGACCT CTTTCAAAAG GAGGTGTTTG GTCCGGTGCT CTCGGTCACG
GCGTTTGAGA GCGAAGACGA AGCACTTCGG CTTGCCAATG CCACCGACTA TGGGCTTGCG
GCAGGGGTCT GGTCGCAGGA TCTGTCGCGC GCGCATCGCT GCGTGGCCGG TATCCGCGCA
GGCGTCGTGC ATGTGAACAC CTATGGCGGG GCTGATAATA CGGTGCCTTT GGGTGGGGTT
GGCCAATCCG GTAACGGTCA CGACAAATCC CTGCATGCGC TCGAGAAATA CGTCGATCTG
AAAACGGCCT GGATTCAGCT TTGA
 
Protein sequence
MATATAKDVD RAVVSARRAF DDGRWSGLAP AARKKVLHRI ADKIEAEALA LTVLGVRDNG 
TEFNMALKAE AGSAAGTFRY YAEALDKVAG EVAPTAPDVL GLVHRAPVGV VGAIVPWNFP
LMIGAWKLAP ALAMGNSVVL KPAETASLSL LRLAEICADC GLPDGVLNVV TGPGAVTGAA
LSEHMDVDVM VFTGSGATGR RLLVASARSN LKRCYLELGG KSPNIVFADA KDLDHVAKVS
AMGIFRNSGQ VCVAGSRLLV EASIHEEFVA RVVAHAQALR VGDPLDMNTQ IGAVNSETQL
AANLAHVERA AAQGGEVLCG GGRILSETGG TYMAPTVVAG VTQDADLFQK EVFGPVLSVT
AFESEDEALR LANATDYGLA AGVWSQDLSR AHRCVAGIRA GVVHVNTYGG ADNTVPLGGV
GQSGNGHDKS LHALEKYVDL KTAWIQL