Gene Moth_0386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0386 
Symbol 
ID3832630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp392221 
End bp393390 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content47% 
IMG OID637828323 
Producthypothetical protein 
Protein accessionYP_429263 
Protein GI83589254 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.33173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.334793 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGG AAAAGCTTTA TCAGGAGAGA CTGCAGCGTT ACATCACAGC CATGGAATGT 
GGCAAACCGG ATAAAGTTCC TATAGCTTTC TCAGTTGGTG AATGGGCTGT CAAGTACACC
GGCAGTACTC TTGAAGAAGT ATATTACAAC CTTGACAAAT CAATTGAAAT AACTTGTGAA
GTAGTACAGG ACCTCGATTT TGACATTTTC GGAGGTGGTC CCACCTTATG GTGGCCACCA
ATGTTCGATG CCATGGGTTC GAAATTGTAT AAGTTCCCGG GTATTCATTT AGAGGAGAAC
TCGCAATTCC AGTATATGGA AGAAGAGTAC ATGAAACCGG AAGACTATGA TGATTTTATT
GCCAATCCTA CTGAATGGCT GGCAACTAAA TATTTGCCTC GTATTAGTGA GGAGTTTGCC
AGGCCAGGCT CATACCGAGC CACGGTAGCG CTAATTAAAA GCGCGGCTGC TTATGCCATA
GCCAGCAATG TCATGGCTAA AGGGTGGGAA AAGTGGACGA AGGAACACGG CGTAGTACCT
TCTACGAGCG GGTTTACCAA GGCGCCTTTC GATACCCTGG GTGACACTCT GCGGGGTATG
AAAGGGATTT TGCTTGACAT CCGCCGTCGA CCCGAGAAAG TCCTGGCCGC GTGCGAAGCG
ATTATACCAC ATAATATAGC CTATGCCATG ATTGGGGCTC GTGGAGATAC CACTTTACCT
TGTAAGGCAA CCCTTCATCG AGGTGCCTAC CCGTTCCTGA GCATGGAACA TTGGGAAAAA
TTCTACTGGC CGTCTTTGAA GGCAGTTATC GAAGGTCTCT GGGCGCAAGG AAAGAGGATG
TACTTCTTTG CTGAAGGAAA TTGGACCCCG TATCTTGAGA AGATAGCTGA ACTGCCGGAT
AAAAGTATTG TATTCATCAT AGATACAACT GATGCCAAAA AAGCGAAAGA AATTCTCGGC
GGTAAGTTCT GCCTGTGGGG TGGCGTTCCC ACTACGCTTC TGACTTACGG CACGCCGGCA
CAAGTGAAGG ATTGTGTGAA GCAGGCTATT GATGAACTGG CCTGTGACGG GGGCTTTGTC
CTTGCACCTG GCGGAGCTGT TATGAGTGAT GCCAAGCGGG AAAATATTTT TGCAATGATT
GAAGCAGGAC GAGAGTACGG CGTTTATTAA
 
Protein sequence
MSKEKLYQER LQRYITAMEC GKPDKVPIAF SVGEWAVKYT GSTLEEVYYN LDKSIEITCE 
VVQDLDFDIF GGGPTLWWPP MFDAMGSKLY KFPGIHLEEN SQFQYMEEEY MKPEDYDDFI
ANPTEWLATK YLPRISEEFA RPGSYRATVA LIKSAAAYAI ASNVMAKGWE KWTKEHGVVP
STSGFTKAPF DTLGDTLRGM KGILLDIRRR PEKVLAACEA IIPHNIAYAM IGARGDTTLP
CKATLHRGAY PFLSMEHWEK FYWPSLKAVI EGLWAQGKRM YFFAEGNWTP YLEKIAELPD
KSIVFIIDTT DAKKAKEILG GKFCLWGGVP TTLLTYGTPA QVKDCVKQAI DELACDGGFV
LAPGGAVMSD AKRENIFAMI EAGREYGVY