Gene Moth_2310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2310 
Symbol 
ID3831424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2429708 
End bp2431108 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content59% 
IMG OID637830234 
ProductFAD dependent oxidoreductase 
Protein accessionYP_431140 
Protein GI83591131 
COG category[R] General function prediction only 
COG ID[COG2509] Uncharacterized FAD-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000884654 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGG GATACGATGT TGTCATCGTC GGTGCCGGCC CGGCGGGTAT TTTTACCGCC 
CTGGAGCTGG TGCGGCAGAG AAGCGGCCTG AAGGTGCTGA TCCTGGAGAA AGGGCACGGT
TTACAACGCC GGGTTTGCCC GTCCCAGGAG ACACGTTCCA GTTGCCTCCA CTGTAACCCC
TGCTCGGTGG TATCCGGCTG GGGGGGCGCC GGCGCTTTCA GTGACGGTAA ATTGACCCTG
TCGCCGGAAG TTGGGGGCTG GCTTAATGAA TATATACCCC AGCAGGATGT TACTTCCCTC
ATTGATTATG TCGACAAGAT TTACCTTTAT TTTGGGGCAC CGGACAGGGT ATACGGCGGC
CCTGACGACG AGCGGATTTT AGATATCCAG CGCCAGGCGA TCCTGGCGGA CTTGAAGTTA
ATCCCGGCGC CCATTCGCCA CCTGGGAACG GGTCGTACCC AGGAAATCCT CCAGGCCATG
AAAGACTACC TGGAGGAACG AGGGGTGGAG GTGCGGACGG AGACACCGGT GGAGGAGATC
CTGGTGGATG GCCACCGGGT CGCCGGGGTA GTTACCCGTA AGGGTGAGGA AATCAAGGCC
CGCTACGTGG TCCTGGCCCC GGGCCGGGAG GGTTCCGACT GGCTCCGTAA AGTGGCTGGT
AAACTGGGAT TAAAGCTGGC GGTCAATCCG GTGGACGTCG GCGTTCGGGT GGAAGTCCCG
GCGGCGGTCA TGGAACACCT GACCAGCGTG ATCTATGAAT CCAAGTTTAT CTTTTACTCC
CGGAAATTTG ATGACCGGGT ACGGACCTTC TGCATGAACC CTTACGGTGA GGTCGTGCTG
GAAAATAACG AGGGCCTGGT GACGGTCAAC GGTCATTCCT ATGCGGAAAA GAAGACCACC
AATACCAATT TTGCCCTCCT GGTCAGCAAG ACCTTTACGG AACCCTTTAA AGAACCCATT
GCCTATGGCC GTTATGTGGC GCAACTGGCC AACCTCCTGG GGGGCGGCGT CCTGGTACAG
CGCCTGGGGG ATCTCCTTTC CGGCCGGCGG ACCACAGCCG ACCGCCTGGA AAAAGGGCTG
GTCAACCCGA CCCTGACGGA AGCGACGCCG GGGGACCTCT CCCTGGTTTT CCCCTACCGG
CACTTGACGG CTATTATCGA AATGCTTCAG GCGATGGATA AGATTGCTCC CGGGGTTTAC
TCCCGCCATA CCCTCCTTTA CGGGGTCGAG GTTAAGTTCT ATTCATCCAG GTTAAATTTA
ACCCGTGACC TGGAGACCAA TATCCGCGGC TTGTACGCCG CCGGTGACGG CGCCGGGGTT
ACCCGGGGCC TGGCCCAGGC CTCGGCCGCC GGGGTGATTA CCGCCCGGGC CATTATGGCG
GGTACTTCCG CCCGTGACTA G
 
Protein sequence
MAEGYDVVIV GAGPAGIFTA LELVRQRSGL KVLILEKGHG LQRRVCPSQE TRSSCLHCNP 
CSVVSGWGGA GAFSDGKLTL SPEVGGWLNE YIPQQDVTSL IDYVDKIYLY FGAPDRVYGG
PDDERILDIQ RQAILADLKL IPAPIRHLGT GRTQEILQAM KDYLEERGVE VRTETPVEEI
LVDGHRVAGV VTRKGEEIKA RYVVLAPGRE GSDWLRKVAG KLGLKLAVNP VDVGVRVEVP
AAVMEHLTSV IYESKFIFYS RKFDDRVRTF CMNPYGEVVL ENNEGLVTVN GHSYAEKKTT
NTNFALLVSK TFTEPFKEPI AYGRYVAQLA NLLGGGVLVQ RLGDLLSGRR TTADRLEKGL
VNPTLTEATP GDLSLVFPYR HLTAIIEMLQ AMDKIAPGVY SRHTLLYGVE VKFYSSRLNL
TRDLETNIRG LYAAGDGAGV TRGLAQASAA GVITARAIMA GTSARD