Gene Moth_2137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2137 
Symbol 
ID3833137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2234786 
End bp2236015 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content64% 
IMG OID637830062 
Productmolybdopterin molybdochelatase 
Protein accessionYP_430972 
Protein GI83590963 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTTT TTCAAGTAAT AACCCTGGCG GAGGCCCGCA GGCAATTGGG GCGCTACTGG 
CCCCTACCTG GGCGGCGGGA GATGGTTGTA CCCCTGACGG AAGCCCTGGG ACGGGAACTG
GTACGACCGG TGGTGGCCGG GGAAGATGTC CCCGGTTTCG ACCGCGCCAC CATGGACGGC
TATGCCGTCC GGGCGGTTGA TACCTTTAGC GCCCGGGAAG GTGAACCCGT CCTTTTGCGC
CTGGCCGGGG AGGTACCCAT GGGGCAAAAA GCCGGAGTCA GGGTTAACCC CGGAGAAGCC
GTAGCCGTGG CCACAGGAAG TATGCTACCG CCCGGGGCGG ATGCGGTGGT TATGATCGAA
AACACCGAGG TCCTCGAGGA CGATCGGGTC GCCGTCTATA AGCCGGCAGC CCCCGGCCAG
GACCTGGTAC GGCAGGGATC CGACGTCCGG GCCGGTGCCA CCGTCCTCGA GGCCGGCCAT
CGCCTGCGCC CCCAGGACCT GGGCGTACTG GCCAGCCTGG GCATCAACAG GGTTCCGGTA
TACGAACCCT GGCGGGTGGG CATCCTGGCC ACGGGGAATG AAATCGTCCC CCCGGAGGTC
CAGGCTGGTC CCGGCCAGGT GCGGGATATT AATTCCTATA CCCTTTACGG CCTGGTACGT
GATTGCGGTG CTGAAGCCAC CCTCTACGGC ATCGCCCCCG ACGACCTGGA AACCTTGACA
GCCCGGGTCC AGGAAGCCCT GGCGGAAAAC CACCTGGTGC TCCTCTCCGG AGGAAGTTCC
GTCGGCACCC GGGATTTAAC CGTGCAGGTC CTGGCTGGCC TGGGACAGCC GGGTATCCTC
TTCCACGGCT TAGCCATTCG CCCGGGTAAA CCAATCCTGG CGGCCCTGGC CGGCACAAAG
ATGGTCTTCG GCCTGCCCGG TCACCCGGTT TCCGCCATGG TGAGCTTTAA GGTCCTCCTC
GAACCCCTGC TACGTTACGG CGGTTATGAG GGCCCGGCCG GCAGGGGGAC GGTCACGGCG
ACCCTGGGCA GTCCAATTCC TTCTACTCCC GGCCGGGAGG ACTATATCCG CGTCCGCCTG
GAAGCAGGCC CGGATGGGTT CCTGGCCGTA CCGGTACCAG GAGGTTCGAG TATAATCTCA
TCCATGATCC AGGCCGACGG CCTGGTAACC ATTCCCCTGG AGGAAGAGGG CCTGGAAGCC
GGTACGAAGG TAGAGGTGGA ACTCTTTTAG
 
Protein sequence
MELFQVITLA EARRQLGRYW PLPGRREMVV PLTEALGREL VRPVVAGEDV PGFDRATMDG 
YAVRAVDTFS AREGEPVLLR LAGEVPMGQK AGVRVNPGEA VAVATGSMLP PGADAVVMIE
NTEVLEDDRV AVYKPAAPGQ DLVRQGSDVR AGATVLEAGH RLRPQDLGVL ASLGINRVPV
YEPWRVGILA TGNEIVPPEV QAGPGQVRDI NSYTLYGLVR DCGAEATLYG IAPDDLETLT
ARVQEALAEN HLVLLSGGSS VGTRDLTVQV LAGLGQPGIL FHGLAIRPGK PILAALAGTK
MVFGLPGHPV SAMVSFKVLL EPLLRYGGYE GPAGRGTVTA TLGSPIPSTP GREDYIRVRL
EAGPDGFLAV PVPGGSSIIS SMIQADGLVT IPLEEEGLEA GTKVEVELF