Gene Moth_1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1022 
Symbol 
ID3832642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1050230 
End bp1051759 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content57% 
IMG OID637828950 
ProductMg chelatase-related protein 
Protein accessionYP_429879 
Protein GI83589870 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0606] Predicted ATPase with chaperone activity 
TIGRFAM ID[TIGR00368] Mg chelatase-related protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00111662 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000780236 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCTGGCCA TTGTTAACTC AGTCGTCCTG GTAGGCCTGG AGGGGCAGAG CGTACGGGTA 
GAGGTGGACA TCAGTAATGG TTTGCCCCTT TGTGATATAG TGGGGCTGCC GGATCCCTCG
GTGAGGGAGG CCCGGGAACG GGTGCGAGCA GCTATTAAAA ATTCGGGTTT TGACTTTCCG
CTCCGCCGGA TTATTGTTAA TCTAGCCCCT GGTGACATCA AAAAAGAAGG TCCAATCTAT
GATTTGCCGA TTGCCCTTGG TATCCTGATG GCTGCAGAAG CTATCGGTAG CGGACCAGAA
TTGGCCATTT ATGCTGTAGG CGAGCTTTCC CTGGAGGGGA GCTTGCGGCC CATTCCCGGG
GTTTTGCCTA TGGCATTGGC CCTCCAGGAG ATCCAGCCCG GGGCTATTTT TATCGTCCCG
GCCGCCAATG CCAATGAAGC CGCCCTGGCT ACCCGCTTAA GGGTGCTGGC AGCCGAAAGC
CTGGCCCAGG TGGTGGCTTA CTGGCGGGGA GAGGGTGAAT TACAGGAAAT AAAGGCTGCA
GGAGATAGTC CGCCGCCGGC CTGCAGAGGG GTGGACCTGG CCGACATCAA AGGTCAGGTG
GCCGCCAAAC GCGGTCTGGA AATAGCCGCT GCCGGTGGCC ATAATATCCT CCTTATCGGC
AGTCCCGGGG CCGGGAAAAC CATGCTCGCC AGGAGTCTGC CTACCATTTT GCCCCCCTTA
ACCTATGAAG AAGCTCTGAC GGTGACCAAG ATTTACAGTG CCGCTGGTTT ACTGGCCCCG
GGCCAGGGGC TGGTAACGGA ACGCCCCTTC CGTACTCCCC ATCATACTGC CTCAACAGCC
AGCATCATTG GTGGCGGGCG GTTCCCCAAA CCCGGCGAGG TAAGCCTGGC CACCCACGGC
GTCCTTTTCC TGGACGAAAT GGCCGAATAT CGCCGTGATG TCCTGGAGGC CCTGCGCCAA
CCCCTGGAGG ACCGGGTGGT CACTGTTTCC AGGGTAGCCG CGGCTATTAC TTATCCTGCT
GATTTTCTTT TAGTTGGCAG TATGAATCCC TGTCCATGCG GGTATTATGG TGACCAGGTA
AAGGAATGCC TTTGTACCCC CCATCAAGTA GCGCAATACC GGAAGAGGTT ATCCGGACCT
CTACTGGATC GCATTGACCT GCACCTGGAG GTGCCCCGTT TGACCTACAG GGAGGTTGAA
GCCGTGATCC CGGCAGAAAA TTCAGTTACC GTGAGAAAAA GAGTACAAAT TGCCCGCCAG
CGCCAGTTGG AAAGGTTAAA GGGTACCGGG GTTACCTGTA ATGCCGCCAT GACTCCCCAG
CAGGTGCATC GCTTTTGCCG ACTGGCACCC CAGGCCCGGA GCATATTACG GGATGCCTTC
AATAAATTAG GCCTCTCTAT GCGTGCCCAT GATCGTTTGC TCAAAGTGGC ACGCACTATA
GCCGACTTGG CCGGGGAGGA AACAATCACC GCAGCCCACC TGGCTGAAGC CATCCAGTAC
CGCAGCCTGG ACTGGGGTGA GAGGGCGTAA
 
Protein sequence
MLAIVNSVVL VGLEGQSVRV EVDISNGLPL CDIVGLPDPS VREARERVRA AIKNSGFDFP 
LRRIIVNLAP GDIKKEGPIY DLPIALGILM AAEAIGSGPE LAIYAVGELS LEGSLRPIPG
VLPMALALQE IQPGAIFIVP AANANEAALA TRLRVLAAES LAQVVAYWRG EGELQEIKAA
GDSPPPACRG VDLADIKGQV AAKRGLEIAA AGGHNILLIG SPGAGKTMLA RSLPTILPPL
TYEEALTVTK IYSAAGLLAP GQGLVTERPF RTPHHTASTA SIIGGGRFPK PGEVSLATHG
VLFLDEMAEY RRDVLEALRQ PLEDRVVTVS RVAAAITYPA DFLLVGSMNP CPCGYYGDQV
KECLCTPHQV AQYRKRLSGP LLDRIDLHLE VPRLTYREVE AVIPAENSVT VRKRVQIARQ
RQLERLKGTG VTCNAAMTPQ QVHRFCRLAP QARSILRDAF NKLGLSMRAH DRLLKVARTI
ADLAGEETIT AAHLAEAIQY RSLDWGERA