Gene Moth_1931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1931 
Symbol 
ID3832423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2005722 
End bp2006945 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content54% 
IMG OID637829863 
Producthypothetical protein 
Protein accessionYP_430773 
Protein GI83590764 
COG category[S] Function unknown 
COG ID[COG2461] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000706509 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00330228 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGAAC TGCTCAACAA CCAGGATTAC CGTAAAGAAG CCTTAAAAGA AATTATCCGG 
GAATTGCACC GGGGCAAGAG CGTGGAGGAA GTGAAGGCCA GGTTTAATGA ACTGATCAAG
GATGTGGCCC CGGCGGAGAT CTCCCTCATG GAGCAGGCCC TGATCAACGA AGGCCTGCCG
GTGGAGGAGG TCCAGCGCCT CTGCGACGTC CACGCGGCGG TCTTTAAAGA GTCATTAGAA
AGGGCGCCGC AACCGGAAAC CATCCCCGGT CACCCGGTGC ATACTTTTAA AGAAGAAAAC
CGGGCCCTGG AAGATTTAAT GATCAGAGAG ATTCAACCGC TCCTGGCTGA ATTGCGCCGG
GCGAACCCGG ACGTTGAAAA AGACCTGGCC ATAAAGCTGG CGGAAAAACT GAATCTTCTC
CAGGATGTCA ACAAGCATTA TAGCCGCAAG GAGAATCTCC TCTTCCCTTA CCTGGAGAAG
TACCAGATAG TAGGGCCGCC CAAGGTCATG TGGGGCGTCG ACGACGAGAT CAGGGACCTC
TTAAAGGAAG CCCGGGACCT GGCCGTCAAT TACGTACCGG ATAAAAAAGA AGAACTCATT
ACCAGAACAG AAGCCGCCCT GGCAAAAATC AAAGAAATGA TCTTTAAAGA AGAAAGGATT
CTCTTCCCCA TGGCCCTGGA GACCCTGACC GAGGACGAGT GGTACCGGAT CATGCTTGAC
AGCGCCAGTA TCGGTTATTG CCTCATCGAG CCCCGGGAAG ACTGGCGGCC GGCGCAGGTC
AAACTTGACC AGAAAGAAAC TGTCGCCAGC GAGGAAACCA GGGGATACAT TAAGTTTGCT
ACCGGTATCC TGACGCCCCG GGAGATCAGC TTGATCTTCG ATCACCTGCC AGTAGACATA
ACCTTTGTCG ACAAGGATAA TGTGGTCAAG TATTTTTCCA ATACCAGGGA GCGCATCTTT
ACCCGCAGCC GGGCGGTTAT CGGCCGGCGG GTCGAAAACT GTCATCCCCC GGCCAGCGTC
CAGGTGGTGG AGAAACTCAT TGCCGACTTC AAAAGCGGGC GTAAAGACCG GGAAGCCTTC
TGGCTGCACC TGGGTGATAA GTATGTGTTT ATCCAGTATT TTGCCGTCCG GGACGAAAAA
GGCGACTTTG CCGGCACCCT GGAGGTGACC ATGGACCTCA AGCCTCTCCA GGCCATTAGC
GGTGAGAAGA GGATTATGGA TTAG
 
Protein sequence
MTELLNNQDY RKEALKEIIR ELHRGKSVEE VKARFNELIK DVAPAEISLM EQALINEGLP 
VEEVQRLCDV HAAVFKESLE RAPQPETIPG HPVHTFKEEN RALEDLMIRE IQPLLAELRR
ANPDVEKDLA IKLAEKLNLL QDVNKHYSRK ENLLFPYLEK YQIVGPPKVM WGVDDEIRDL
LKEARDLAVN YVPDKKEELI TRTEAALAKI KEMIFKEERI LFPMALETLT EDEWYRIMLD
SASIGYCLIE PREDWRPAQV KLDQKETVAS EETRGYIKFA TGILTPREIS LIFDHLPVDI
TFVDKDNVVK YFSNTRERIF TRSRAVIGRR VENCHPPASV QVVEKLIADF KSGRKDREAF
WLHLGDKYVF IQYFAVRDEK GDFAGTLEVT MDLKPLQAIS GEKRIMD