Gene Moth_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2023 
Symbol 
ID3831398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2110872 
End bp2112164 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content49% 
IMG OID637829952 
Producthypothetical protein 
Protein accessionYP_430862 
Protein GI83590853 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCTT GCTGGTGGCG GGAAATAGAT TTCCGTGGCT GGAATGGGGC TGTCGAGTTC 
GGAAATGACC TGATCCGGGT GGTTATGGTC CCGAACCTGG GTGGAAGAAT TATGGCCTAC
GATCTTGGTG ATTATTCCTT TCTTTACGTT AATAAAGAGT TAGAGGGTAA ACTTTTCACG
CCCGAAGAAA ATTACGGTGA TGGTTCCATT GCCGCCTGGA AGAATTACGG TGGCGATAAA
ACCTGGCCAG CACCCCAGGG GTGGGATACC GAGGAAGAAT GGCACGGGCC GCCGGACCCG
GTACTTGACA GCGGGGTATA TACTGGGCGA TGGCTGGAAT GCAGCCCGGA AAAAGTAAGC
TATGAAGTGG AGAGCCCGCC GGATCCCCGC ACGGGAATTA AGCTCTTTCG TAAGGTTACT
ATCCGGCAGG ACAGCAGCAA GCTGTGGCTG GAGCTGAGGA TGAAGAATAT CAGCTCCCGG
CCAGTGGCGT GGAGCATCTG GAATGTAACC CAACTGGATA CGCGGTTGAG GAATGGCAAA
GGGTATGATC CTAATTGTCG CCTTTATATA CCATTGAACC CTGTGAGCAG GTTTGCAAAA
GGATACCGGG TAATCTTCGG CGAAGAAGAT AACCCGCAAT GGGGCCAACG GGAAGGCAAT
GACCTCTTAG TAATACCTTA CCTATTTTAC GTCGGTAAAA TTGGCGTCGA TTCACCGGTG
GGTTGGATGG CTTTTGTCAA TGATACTGAA GGCTATACCT GGTGCCTGCG CTACCCCTAC
TATCCGGAAG AGAAAGATGC ATATCCCGAC GGGGGTTGCT CGGTAGAATG CTGGACGGTG
GGTCGCGGTG TAGTGACCGG TAAGGATTAT TCCCAGGAGA CCGGTTATCA TATAGAGGCA
GAAGTACTGG GGCCAGTAAG AAAGCTTAAA CCAGGTGAAG AGCAGTTTTT AGAACTGGAA
ATGGGGGTAG CGAAAGGGGG CGGTAGATTT AAAAAAGTCA CAGCAGGGGG TTATATTATC
ATGGGAGGTG GTGCCAGGTT AGAAAAAGGG AAATTAATAA TAAATCTTTC TGGCGGTGTT
TTTTATAAAG GAAGGTTGCA GGTAGTTGTA ACTGATGCCA GGCATAACGT TATTTTGCAA
CAAGATTTAG GAGAAGTATC TCCCCACGAA GAGGTAAAAG TTAACCAAAA AATAGATCTT
CCATTTTCCA GGGTAGTTTT TCCATCCCTG CAAGCTCATT TAATCATTGA CCATCCGGGG
GGTATGGATG AATACTACCT AGCGTACCTC TAA
 
Protein sequence
MNPCWWREID FRGWNGAVEF GNDLIRVVMV PNLGGRIMAY DLGDYSFLYV NKELEGKLFT 
PEENYGDGSI AAWKNYGGDK TWPAPQGWDT EEEWHGPPDP VLDSGVYTGR WLECSPEKVS
YEVESPPDPR TGIKLFRKVT IRQDSSKLWL ELRMKNISSR PVAWSIWNVT QLDTRLRNGK
GYDPNCRLYI PLNPVSRFAK GYRVIFGEED NPQWGQREGN DLLVIPYLFY VGKIGVDSPV
GWMAFVNDTE GYTWCLRYPY YPEEKDAYPD GGCSVECWTV GRGVVTGKDY SQETGYHIEA
EVLGPVRKLK PGEEQFLELE MGVAKGGGRF KKVTAGGYII MGGGARLEKG KLIINLSGGV
FYKGRLQVVV TDARHNVILQ QDLGEVSPHE EVKVNQKIDL PFSRVVFPSL QAHLIIDHPG
GMDEYYLAYL