Gene Moth_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1820 
Symbol 
ID3832789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1879495 
End bp1880790 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content67% 
IMG OID637829750 
Producthypothetical protein 
Protein accessionYP_430663 
Protein GI83590654 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGGAG TAAACGCTTC CTTTAGCCTG GAGGAAGGGC GGGAACTCCT CCTGGCCGGG 
TTAAAACCGA CGGGAAGGGT GAAAATAAAA CTGGCTGAGG CCCCGGGCCG CATCCTGGCG
GAACCGGTGG TGGCGCCGTG CTCCTTCCCG CCTTTCCCGC GTTCCCGGGT GGACGGTTAC
GCCCTGGGAC TGCCCCCGGC CGCCGGCGGG GCATCGGGAA AAATCTACCG GCTGGTGGCT
ACAGTAGCGG CAGGTAGCTG CCCGGCTGTC ACCCTGGGAC CGGGTACGGC GGCGGCTATT
TTCACCGGCG CCCGGCTCCC GGAAGGCACA ATAACCGTTA TTCCCAGAGA ACTGGCGGAG
CGCCAGGGGG ACCTGGTCAT AGTGCCGGAG CTGCCCCCGG GGGGCAGGTT CATGGAAGCT
GCCGGATCGG AGGTGGCGGC CGGGGAAAGG GTCCTGGCCG CCGGCACTGA ACTGGGACCG
GCGGAAATCG GCCTCCTGGC CGCCCTGGGC CTTACGGAAA TAACCGTCTA CCGTAGTCCC
AGGGCGGTCC TGGCGTCCAG CGGCAGCGAA CTGGTGGAAT TGCCTGGCCT CCGGGGAGGC
TGCCCCGGCG GCCGGCAGGC AGGTGGCGAG GCGATAAGCC CTGTTGCCAG CCAGGTTCGA
GCACTCGGGC CCCGCATTTA TAATAGTAAT TTCTATGCGC TGGCGGCAGC CGCCAGCCGC
GATGGCGCTC GGGTAATCCC CCTCGGGCCG CTGGCCGATG AACTGGAGGA ACAGGTAGAG
GCTTACCGGA AAGCCCTGGA GGAGGGCGAT GTACTTCTAA CTACCGGTGG AGCCGGCGGT
AGCATCCGTG ACCTGACGGC GGCGGCCTTT ACCGGTGCCG GGGGAGAAAT CCTCTTTACG
ACAATCCGGA TGCGCCCGGG CCGGCGGGTG ATAGCCGCCC GCCGGGGGGA TAAATTACTC
CTGGGCCTGC CGGGCAATCC ACCGGCAGCG CTGGTGGCTT ACTACCTCCT GGCGGCACCG
GTGATTCGCG CCCTGGGGGG GAGGGAAGTC CTACCGGCAA CCTTCCCGGC GGTACTGACG
GCGGCCATAG ATAAACCGAG GCCCGAACGG GCCTTTATCT GGGCCCGGGC CTGGCCCGGT
ACGACCGGCT GGCAGGTAGC GCCCCTGCCG CGTCGCCCGG GGGGTATCCG CGCCGCTATT
GGGGCCAACG CCCTCATTGA CCTCCCCGCC GGCCCGGCTC CTGGAGCCGG GGAAGAGGTA
AGGGTGGTGC TGCTTACTGC CCGGGGTTCA CAATAA
 
Protein sequence
MAGVNASFSL EEGRELLLAG LKPTGRVKIK LAEAPGRILA EPVVAPCSFP PFPRSRVDGY 
ALGLPPAAGG ASGKIYRLVA TVAAGSCPAV TLGPGTAAAI FTGARLPEGT ITVIPRELAE
RQGDLVIVPE LPPGGRFMEA AGSEVAAGER VLAAGTELGP AEIGLLAALG LTEITVYRSP
RAVLASSGSE LVELPGLRGG CPGGRQAGGE AISPVASQVR ALGPRIYNSN FYALAAAASR
DGARVIPLGP LADELEEQVE AYRKALEEGD VLLTTGGAGG SIRDLTAAAF TGAGGEILFT
TIRMRPGRRV IAARRGDKLL LGLPGNPPAA LVAYYLLAAP VIRALGGREV LPATFPAVLT
AAIDKPRPER AFIWARAWPG TTGWQVAPLP RRPGGIRAAI GANALIDLPA GPAPGAGEEV
RVVLLTARGS Q