Gene TM1040_3621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3621 
Symbol 
ID4075048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp677733 
End bp679253 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content59% 
IMG OID638005140 
Productaldehyde dehydrogenase 
Protein accessionYP_611850 
Protein GI99078592 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.096261 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAAA TGACGCAGGT TGCGGGCGAA TATACGTCGC CCTTCAAGGC GCGCTATGAC 
AATTTCATTG GCGGTAAATT CGTCGCCCCG GTCAAAGGGC AGTATTTCGA CAACATTACC
CCGATCACTG GCGCCAAGGT CTGTGAGGTC GCCCGCTCCA GCGCCGAGGA CGTGGAGCTT
GCGCTCGATG CCGCTCACGC GGCCAAGGAC AGCTGGGGCA AGACCTCTCC TGCGGAACGC
GCGAATACCT TGTTGAAAAT CGCGGACCGC ATCGAGGAAA ACCTCGAGCT GATCGCCACG
GCGGAAACAT GGGACAATGG CAAACCGATC CGAGAGACCA TGGCAGCCGA TATCCCCCTG
TCTGTCGATC ACTTTCGCTA TTTCGCGGGT GTTCTGCGTG CCCAAGAGGG CAGCATGTCC
GAAATCGATC ACGACACGGT CGCCTACCAC TTCCACGAGC CGTTGGGCGT AGTGGGGCAG
ATCATCCCGT GGAACTTCTC GGTTCTGATG GCCGCCTGGA AGCTGGCCCC CGCGCTGGCG
GCCGGCAACT GCATCGTACT GAAGCCCGCC GAGCAGACCC CGGCGGCGAT CCATGTGCTG
ATCGAAGTGA TCGCGGATCT TTTGCCTGCG GGAGTTCTGA ACATCGTGAA TGGCATGGGG
CCAGAGGTGG GCGGCCCGCT TGCCGAGTCC GATCGGATCG CCAAGATCGC CTTTACCGGC
TCGACCGAGA CCGGGCGGAT CATCATGAAG GCAGCGACCA AGAACCTGAT CCCGGTCACG
CTCGAACTTG GGGGAAAATC GCCAAATATC TTCTTTTCGG ATGTGATGGC GCAGGATGAC
GCGTTTCTCG ACAAGGCGGT CGAGGGCTTT GTTCTATTTG CCTTCAACCA GGGCGAGGTC
TGCACCTGCC CCTCGCGAGC GCTGATCCAG GAAGACATCT ACGAAGAATT CATCGAACGC
TGCATCGCGC GCGTCAAAGC GATCAAGCAA GGCGATCCGC GCGACATTGA AACCATGGTC
GGGGCGCAGG CGAGCCAGGA ACAACAGGAA AAAATCATGT CCTACCTGAC CATCGGCGTC
GAAGAAGGCG CTGAGGTACT TGTCGGTGGC GATGCCGCCC GGTTCAACGG GGACATTGCC
AACGGGTTCT ATGTTCAGCC AACCATCCTA AAGGGCCACA ACAAGATGCG CGTGTTCCAA
GAGGAAATCT TTGGCCCTGT CGTGTCGGTG ACCACCTTCA AAGACGAAGA AGAGGCCCTC
GCGATTGCAA ATGACACCAT GTATGGCCTC GGCGCCGGCG TCTGGACCCG TGACGGAACC
CGCGCGTATC GCTTTGGGCG CAATATCGAG GCAGGCCGCG TCTGGGTGAA CAACTACCAC
GCATATCCGG CGCACGCGGC CTTTGGCGGG TATAAACAAT CCGGGATTGG GCGCGAAACC
CACAAGATGA TGCTCGACCA TTATCAGCAG ACCAAGAACA TGCTGGTGAG CTACAACCCC
AACAAACTCG GGTTCTTCTG A
 
Protein sequence
MNEMTQVAGE YTSPFKARYD NFIGGKFVAP VKGQYFDNIT PITGAKVCEV ARSSAEDVEL 
ALDAAHAAKD SWGKTSPAER ANTLLKIADR IEENLELIAT AETWDNGKPI RETMAADIPL
SVDHFRYFAG VLRAQEGSMS EIDHDTVAYH FHEPLGVVGQ IIPWNFSVLM AAWKLAPALA
AGNCIVLKPA EQTPAAIHVL IEVIADLLPA GVLNIVNGMG PEVGGPLAES DRIAKIAFTG
STETGRIIMK AATKNLIPVT LELGGKSPNI FFSDVMAQDD AFLDKAVEGF VLFAFNQGEV
CTCPSRALIQ EDIYEEFIER CIARVKAIKQ GDPRDIETMV GAQASQEQQE KIMSYLTIGV
EEGAEVLVGG DAARFNGDIA NGFYVQPTIL KGHNKMRVFQ EEIFGPVVSV TTFKDEEEAL
AIANDTMYGL GAGVWTRDGT RAYRFGRNIE AGRVWVNNYH AYPAHAAFGG YKQSGIGRET
HKMMLDHYQQ TKNMLVSYNP NKLGFF