Gene Moth_2241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2241 
Symbol 
ID3831287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2340019 
End bp2342214 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content54% 
IMG OID637830161 
Producthypothetical protein 
Protein accessionYP_431071 
Protein GI83591062 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0974985 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATGC CGGTACTATC AGTAGAACAG CGCAATAAAC TTGAACGTAC GGTTGTCGAG 
GCCCGGGATG TTGCCGAAGC CGGGGCCAAG GCGGCCCTGG AGGCCCTGGC CGTGCACCAT
CACGAGCCTT ACAGCCATAT GACCCCAGAA CAGCGCCGGC TTCGCAATCA CCTCCGGGCC
CGGGCGCGCC AGCTGGGGGA CAGGCAGGAC CAAAGGGGAA AAATGGAAAT CACCCACCTG
ATCCAGGAGT GGGCTTACGA GCACTGGCAC CGCATGCTCT TTGCCCGTTT CCTGGCGGAA
AACGACCTCC TGATCGAGCC GGAGATGGGG GTGGCTGTCA GCCTTGAGGA GTGTGGGGAA
CTGGCCCAGG AGGAAGGTAC CGATCTCTGG ACCCTGGCCA GCCGTTTTGC CCAGCAGATG
TTGCCGCAGA TCTTTCGCCC CGATGACCCG GTGTTGCAGG TCACCTTTGC GCGCGAATAT
CAACTGAAGC TGGAGCAGTT GCTCAACGAC CTGGAGCCGG GTATCTTTAA AGCCAGCGAT
GCCCTGGGAT GGGCCTACCA GTTCTGGCAG AGCAAGCGCA AGAAACAGGT GAACGAATCG
GGCAATAAAA TCGGCGCCGA TGAATTGCCG GCTGTTACCC AGCTCTTTAC AGAGCCTTAT
ATGGTGAATT TCCTCATTCA CAACACCATT GGCGCCTGGT ACGCGGGCAA GGTGCTGGCG
GAGAACCCGC AGCTTGCCGG GCAGGCAGAA AGTGAAGAAG AGCTTCGCAG GGCGGTTGCG
TTGCCCGGTG TGACTTGGGA TTACCTACGC TTCGCCCGTA GCGGCGATGG GGAGGGGCCG
TGGCGTCCCG CTGCCGGGAC CTTCGAGGGC TGGCCGCAGC GGGCAGCGGA GCTTAAAATC
CTGGACCCCT GCTGTGGTTC GGGCCATTTC CTGGTAGCGA CATTTTATCA TTTAGTTCCG
ATTCGAATGG CCGAGGAAGG CCTTACGGCC CGGGAGGCAT GTGACGCCGT CCTGAGTGAT
AACCTCCACG GCCTGGAGAT CGATGAGCGC TGCACGCAGA TCGCCGCCTT TGCCCTGGCG
CTGGCGGCGT GGACCTACCC CGGCGCCGGT GGCTACCGCC CCTTGCCCGG GTTGCGAATA
GCCTGTTCCG GCATCGCTCC CAATACGAAA AAGGAAAACT GGCTGGCCCT GGCGGGGGAT
GACGAGCGCC TCCGTAACGG CATGGCCCGG CTTTATGACC TTTTCCGGGA AGCGCCTATT
CTAGGAAGCC TTATCGATCC TGGTTCCGCT CTAGAAAACA ATCTGATAGA AGCCGGGTTT
GATGAACTCC GTCCCTTGCT GGAAAAAGTA ATGTCCGCGG AGAAAGACGA TTACGAGCAG
CACGAGTTAG GGGTTGCCGC CTGTGGTATC GCTGATGCAG TTCAAATTCT TAGTGGTCGT
TATCATCTAG TGATTACCAA CGTGCCGTAT CTTGCCCGCG GAAAACAGGG AGCCGGACTC
AAGAATTACC TTGATACCCA TCATGCCCGT TCGAAACAAG ACCTGGCCAC TGCCTTTATC
GAGCGCAATC TAATGTTGTG CCTAGAAGGT GGTTCTACTG CTTTAGTTAC TCCTCAGCAG
TGGCTGTTTC AAACTGGGTA TTCGAAGATT CGTAAGAAGC TGCTTAGAGA TTTACGATGG
GAACTCTTAG CTATACTTGG TGAACACGCA TTCCGAAGCT CTGAAGCTGC AGGGGCATTT
CCTGCAATAT ATGTTTGTTC TAAATTGCTT CCGAAAGACA ATCATCTTTT CTTTAGTATT
AACGTTTCAG AGAGGTCTTC TGCCTGTGAC AAAGACTTAT ATCTCCAACA CGCTTCCTTA
AACAAAATGC AACAGATAAG TCAACTTAAA AACCCGGAAC ACATTATCAT TACAAACACG
TTTTCTGGGT TTAGAGTCTT TGGTGATTAT GCCGATTGTT ACCAAGGCAT ATCAACAGGC
GATAATCCTA GATTTTGTAT AAAGTTTTGG GAACTCCCAT CTATTTTACC AGGTTGGGAA
AGGTTTCAAA CACCACCAGA GGAAACCCAG TACTATGGTG GGCGAGAGCA TCTAATACGT
TGGGAAGAAC GTGCTAGTAT TCAAGAGCTA GGTGCTATTC GAGGGAAAAA TGCTTGGGGA
AAATGGGGCA TTGTCATAGG TGTTGATTGT CAATAG
 
Protein sequence
MPMPVLSVEQ RNKLERTVVE ARDVAEAGAK AALEALAVHH HEPYSHMTPE QRRLRNHLRA 
RARQLGDRQD QRGKMEITHL IQEWAYEHWH RMLFARFLAE NDLLIEPEMG VAVSLEECGE
LAQEEGTDLW TLASRFAQQM LPQIFRPDDP VLQVTFAREY QLKLEQLLND LEPGIFKASD
ALGWAYQFWQ SKRKKQVNES GNKIGADELP AVTQLFTEPY MVNFLIHNTI GAWYAGKVLA
ENPQLAGQAE SEEELRRAVA LPGVTWDYLR FARSGDGEGP WRPAAGTFEG WPQRAAELKI
LDPCCGSGHF LVATFYHLVP IRMAEEGLTA REACDAVLSD NLHGLEIDER CTQIAAFALA
LAAWTYPGAG GYRPLPGLRI ACSGIAPNTK KENWLALAGD DERLRNGMAR LYDLFREAPI
LGSLIDPGSA LENNLIEAGF DELRPLLEKV MSAEKDDYEQ HELGVAACGI ADAVQILSGR
YHLVITNVPY LARGKQGAGL KNYLDTHHAR SKQDLATAFI ERNLMLCLEG GSTALVTPQQ
WLFQTGYSKI RKKLLRDLRW ELLAILGEHA FRSSEAAGAF PAIYVCSKLL PKDNHLFFSI
NVSERSSACD KDLYLQHASL NKMQQISQLK NPEHIIITNT FSGFRVFGDY ADCYQGISTG
DNPRFCIKFW ELPSILPGWE RFQTPPEETQ YYGGREHLIR WEERASIQEL GAIRGKNAWG
KWGIVIGVDC Q