Gene Moth_2277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2277 
Symbol 
ID3831388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2384894 
End bp2386789 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content60% 
IMG OID637830197 
Productthiamine pyrophosphate enzyme 
Protein accessionYP_431107 
Protein GI83591098 
COG category[C] Energy production and conversion 
COG ID[COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 
TIGRFAM ID[TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00118672 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000449131 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGGCCCGGA TTACGGCGGG CCTGGAGTAC GGGAGAGGTG AAATCGTGCG GGAACTGTTA 
ATCGGCAACC ATGCCCTGGC CCGGGGGGCC TGGGAAGCCG GGGTCCGGGT GGCCGCCGCT
TACCCGGGGA CGCCGAGTAC AGAGATTATA GAAGCCCTGG CCCGCTATCC GGAGGTATAC
GCGGAATGGG CGCCCAACGA AAAGGTGGCC CTGGAGGTAG CCATTGGTGC GGCCATAGGC
GGCGCGCGCT CCCTGGCCGC TATGAAACAT GTGGGTGTTA ACGTCGCCGC CGATCCCCTG
ATGACCCTGG CCTATACCGG CGTCAATGCC GGGCTGGTCC TGGTTTCGGC CGACGACCCC
GGCCTTTTTA GCTCCCAGAA CGAGCAGGAT AACCGCTTTT ACGCCCGCAT GGCCCAGATA
CCCTGCTTGG AGCCTGCCGA CAGCCAGGAA GTTAAGGATA TGGTCATGCA GGCTTTTAAT
TTAAGCGAGG AATTTGATAC CCCGGTAATT TTACGCCTGA CCACCCGGAT TGCCCATTCT
TACAGCCTGG TGGAACTTGG CGACCGCCAG GAAGTGCCGC TTAAAAGTTA TGTCAAACAG
CCCGCCAAGT ATGTCATGCT ACCGGCCTTT GGCAAGGCCC GCCATGTAGT GGTGGAAGAA
CGGCGTCTCA AGCTGGCGGC CTATGCGGAA ACTGCCCCCT TCAACCGGGT GGAATGGGCC
GACCGGCGGG TGGGCATCAT TACCTCCGGT ATTGCCTACC AGTACGTTAA AGAGGCCCTG
CCCGGGGTTT CGGTATTGAA GCTGGGGTTA ACTTATCCTC TGCCGGAGAA GCTGATTTCC
GATTTTGTAA AGGCAGTAAA AACTTGCTAT GTGGTAGAAG AACTGGAACC CTTCCTGGAA
GATCAGATCC GCGCCTGGGG ACTGGGTGTA GTAGGCAAGG AATTGGTGCC AAGGGTAGAT
GAATTGAGTA GCGCCATTGT CGCCCGTACG GTGGGCTCGC AGGTGGCGGC CGTTGCCCCG
GAGCTGGTTG CTCCTGATCT GACTGCGGTG GCCTTGCCCG GTATGAGGAC CCCGGGCGGT
CAAAGGGCAG GCGAAGGATC GGTCCCGGAT GCGAGGGAAA CGACGACTGC CGCGCCGGCT
GAACTCCCCG GTCGTCCACC CCTCATGTGC CCCGGCTGCC CCCACCGCGG CGTCTTTTAC
GTCCTGAAAA AGCTCCGGCT GGTGGTGGCC GGTGACATCG GCTGCTATAC CCTGGGGGCT
ACACCGCCTC TCCAAGCCAT GGATAGCTGT ATTTGCATGG GGGCCAGCCT GGGGGTGGCC
ATGGGGCTGG AGAAGGCCCG CGGTGCGGAC TTCGCCCGGC GGGTGGTAGG GGTTATTGGC
GATTCAACTT TCCTCCACTC GGGAATGACC GGACTCCTGG ACATGGTCTA CAATGGCGGC
ACCGGGACCT TGATTATCCT GGATAATAGC ACCACGGCCA TGACCGGCCA CCAGGACCAT
CCCGGCACGG GTTATACTGC TTCCCATCAG CCGGCCCCTA AGGTTGACCT GGAACAGATA
GCCCGCGCCC TGGGGGTGCA CCGGGTACAG GTGGTTGATA GTTATAATCT AGAGACCCTC
GAGAGGGCGA TCCAGGAAGA AACGGCCGCC AGGGAACCAT CGGTAATCAT TGCCAGGCGG
CCCTGCGCCC TCTTGAAGAA AGAAAAAGAA GCCGTTTACG CTGTAAGCCC CGATAACTGC
CTGAGCTGCC GTTATTGCCT GGACCTGGGC TGTCCCGCCA TTTCCTTTAG CGACGGGCAC
GGAGTGATTG ACCCGGTACT GTGCAACGGC TGCGGCCTTT GTACCCAGGT CTGCCCTGGT
GAGGCCATCA GGAAGGCTGG TGAAGAAGAT GAGTAA
 
Protein sequence
MARITAGLEY GRGEIVRELL IGNHALARGA WEAGVRVAAA YPGTPSTEII EALARYPEVY 
AEWAPNEKVA LEVAIGAAIG GARSLAAMKH VGVNVAADPL MTLAYTGVNA GLVLVSADDP
GLFSSQNEQD NRFYARMAQI PCLEPADSQE VKDMVMQAFN LSEEFDTPVI LRLTTRIAHS
YSLVELGDRQ EVPLKSYVKQ PAKYVMLPAF GKARHVVVEE RRLKLAAYAE TAPFNRVEWA
DRRVGIITSG IAYQYVKEAL PGVSVLKLGL TYPLPEKLIS DFVKAVKTCY VVEELEPFLE
DQIRAWGLGV VGKELVPRVD ELSSAIVART VGSQVAAVAP ELVAPDLTAV ALPGMRTPGG
QRAGEGSVPD ARETTTAAPA ELPGRPPLMC PGCPHRGVFY VLKKLRLVVA GDIGCYTLGA
TPPLQAMDSC ICMGASLGVA MGLEKARGAD FARRVVGVIG DSTFLHSGMT GLLDMVYNGG
TGTLIILDNS TTAMTGHQDH PGTGYTASHQ PAPKVDLEQI ARALGVHRVQ VVDSYNLETL
ERAIQEETAA REPSVIIARR PCALLKKEKE AVYAVSPDNC LSCRYCLDLG CPAISFSDGH
GVIDPVLCNG CGLCTQVCPG EAIRKAGEED E