Gene TM1040_2782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2782 
Symbol 
ID4076550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2941116 
End bp2942498 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content61% 
IMG OID638008107 
Productaldehyde dehydrogenase 
Protein accessionYP_614776 
Protein GI99082622 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.845682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGA CACTGAATTG TATTTCGCCG ATCGACGGAT CGGTCTACGC CAGCCGCGAG 
ACGCTGGATC TGGACGCCGC CCGTAGCGAG GTGGCCGCCG TGCGGTCTGC GCAAAAGGCT
TGGGCCGCGC GTCCGCTACA GGAACGTATC GATCTCGTGA TGGCGGGCGT CGCCGCCATT
GGCGAGATGA ACGATGAAAT CGTGCCTGAG CTTGCCCATA TGATGGGGCG TCCGGTACGG
TTTGGCGGCG AATTTGGTGG GTTCAACGAG CGCGCGTCGC ATATGGCGCA GATCGCTGCC
GAGTCGCTTG CGGACATCGA GGTTGGCGAG GACGCCACTT TCAAGCGTTA CATCAAACGC
ATTCCGCATG GTGTGGTCTT TGTGGTGGCG CCCTGGAACT ACCCCTACAT GACCGCGATC
AACACCGTCG CGCCAGCGCT TATCGCGGGC AACACCGTGG TGCTGAAGCA CGCCACCCAG
ACGCTCTTGG TCGGTGAGCG GATGGCTAAG GCGTTCCATG CGGCGGGCGT GCCTGAGGAT
GTGTTCAAGA ACGTGTTCCT CGATCACGAT ACCACCTCTG CTTTGATCGC GGAGAAGGCG
TTTGATTTTG TGAACTTTAC CGGTTCTGTC GGCGGCGGCC AGGCAATGGA ACGTGCCGCA
GCGGGCACCT TCACGGGCGT TGGGACCGAG CTCGGTGGCA AGGATCCGGG CTACGTGATG
GAAGACGCGG ACGTGGACGC CGCAGTTGAT ACGCTGATCG ACGGCGCGAT GTTCAACTCT
GGCCAGTGCT GCTGCGGAAT CGAGCGCATC TACGTCCACG AGAGCCTCTA TGACGCTTTC
GTTGAAAAAG CGCTGGCGGT GGTCAATGGC TACAAGCTCG GCAACCCGCT GGATGCGGAA
ACCACCATCG GCCCGATGGC CAACGTGCGT TTTGCCCAAG AGGTGCGCGA CCAGATCGCC
GAGGCGATCG AACAGGGTGC CGTGGCGCAC ATCGCCACAA TGGATGCGGA TGATGGGGGT
GCCTATCTGA CTCCGCAGAT CCTGACCAAT GTCACCCATG AGATGCGCGT GATGCGCGAT
GAAAGCTTTG GACCGGTCGT GGGCATCATG AAGGTCTCCT CCGATGAGGA AGCGATCAAA
CTCATGAATG ACAGCGACTT CGGCCTCACC GCTTCCATCT GGACCAAGGA TCTGGAGCGC
GCACAGGCTG TGGGCGACCA AATCGAGACC GGCACCGTGT TCATGAACCG CGCAGATTAC
CTCGATCCGG GGCTCTGCTG GACCGGCTGC AAGAACACCG GTCGCGGCGG TGGTCTTTCG
GTGATCGGCT ATCACAACCT GACCCGGCCG AAGTCCTACC ACCTGAAAAA AGTCACCGCC
TAA
 
Protein sequence
MGKTLNCISP IDGSVYASRE TLDLDAARSE VAAVRSAQKA WAARPLQERI DLVMAGVAAI 
GEMNDEIVPE LAHMMGRPVR FGGEFGGFNE RASHMAQIAA ESLADIEVGE DATFKRYIKR
IPHGVVFVVA PWNYPYMTAI NTVAPALIAG NTVVLKHATQ TLLVGERMAK AFHAAGVPED
VFKNVFLDHD TTSALIAEKA FDFVNFTGSV GGGQAMERAA AGTFTGVGTE LGGKDPGYVM
EDADVDAAVD TLIDGAMFNS GQCCCGIERI YVHESLYDAF VEKALAVVNG YKLGNPLDAE
TTIGPMANVR FAQEVRDQIA EAIEQGAVAH IATMDADDGG AYLTPQILTN VTHEMRVMRD
ESFGPVVGIM KVSSDEEAIK LMNDSDFGLT ASIWTKDLER AQAVGDQIET GTVFMNRADY
LDPGLCWTGC KNTGRGGGLS VIGYHNLTRP KSYHLKKVTA