Gene Mlg_1160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1160 
Symbol 
ID4270666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1358659 
End bp1360101 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content71% 
IMG OID638125909 
Productmolybdopterin biosynthesis protein MoeB 
Protein accessionYP_741999 
Protein GI114320316 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2
[COG0607] Rhodanese-related sulfurtransferase
[COG1977] Molybdopterin converting factor, small subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.284227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA CCAGCGTAAC CGTGCGCCTG CCGGCGCCGT TGCGGCCCTT CGCCGGGGAC 
CAGCCGGAAC TGCACGTCCC GGGCGCCAGC GTGGGCGAGG TGCTCGAGGC CCTGGCCCGC
GACTACCCGC TGCTGCACGC CCGTCTGGTG GATCCGGACG GCGAGTTGCG GGCCTTCGTC
AATCTTTTCC GGGGTGAGCA GGACGTACGG GAGCTGGCGG GGCAGAGGAC GCCGCTGCGG
CCGGGCGATG TGCTGACCGT GTTGCCGGCG GTGGCCGGCG GGGCGCCCTC CGCCCTGGAG
CGCCTCTCGG CGCGGATTCG CCGGGAGGTG CCGGAGGTCA CACCGGCGGA GGCGCAGAAG
CTGGCGGCGC AGGGGGCAGT GCTGCTGGAT GTGCGGGAGG CCGGGGAGGT GGCAGAGGGC
AGCCCCACCG GCGCGCTGCG CATCGACCGC AACTGGCTGG AGTTGCGCAT CGAGGAGGCG
GTGCCCGAGC CGGAACGGCC CATCCTTACC CTGTGCGCCG TGGGACAGCG CTCGCTGCTG
GCGGCGGACG ACCTGCGTCG CCTGGGCTAT CGCGACGTGC GCAACATCGC CGGCGGCTTT
AACCGCTGGA AGGACGAAGG CCTGCCCTTC GAGGTGCCGC GGGTGCTGGA TGACGCCTCG
CGGGCCCGCT ACGCCCGCCA CCTGCGCATG CCCGAGGTGG GTGAGGCGGG GCAGCTGCGC
CTGGGCGAGA GCCGGGTGGT GCTGGTGGGG GCCGGAGGGC TGGGTTCGCC GGCGGCGCTC
TATCTGGCCG CGGCCGGGGT GGGCACCCTG GTGCTGGTCG ACCATGACGT GGTGGACCGC
AGCAACCTGC AGCGCCAGAT CCTGCATACC GACGACCGGG TCGGCCAGCC CAAGACGGAG
TCCGGGCGGC AGGCGGTGGC CGCGCTTAAC CCCCAGGTGC GCGTGGAGGC CGTCCAGGCC
CGGCTGAACA GCGAGAACAT CGAGGCCGTG CTCGCCGGCG CCGACTTGGT GATCGACGGC
TCAGATAACT TTCCCACCCG CTACCTGGTC AATGACGCCT GCGTGAAACT GGGCCTGCCG
CTGGTCTACG GCGCGGTCTA CCGGTTCGAG GGTCAGGTCA CGGTGTTCAA TGTGGATGAC
GGGCCCTGCT ACCGCTGCCT CTATCCGGAG CCGCCCCCGG CGGAGCTGGC CCCATCCTGT
GCCCAGGCCG GGGTGCTGGG CGTGCTACCG GGGGTGATTG GGCTGCTGCA GGCCACGGAG
GCGGTCAAGC TCCTGCTGGG TGTGGGGGAG CCGCTGTCCG GTCGACTGGT GCACTACGAT
GCGCTGCGGG GGCAGTTTCA GCAATTGCGG ATGAAGGCCA ACCCCGATTG CCCCGTTTGC
GCCCCCGGGC GTCCGTTCCC AGGTTATGTG GACTACGAGG CCTTCTGCAG CAGTTCGGCC
TGA
 
Protein sequence
MSETSVTVRL PAPLRPFAGD QPELHVPGAS VGEVLEALAR DYPLLHARLV DPDGELRAFV 
NLFRGEQDVR ELAGQRTPLR PGDVLTVLPA VAGGAPSALE RLSARIRREV PEVTPAEAQK
LAAQGAVLLD VREAGEVAEG SPTGALRIDR NWLELRIEEA VPEPERPILT LCAVGQRSLL
AADDLRRLGY RDVRNIAGGF NRWKDEGLPF EVPRVLDDAS RARYARHLRM PEVGEAGQLR
LGESRVVLVG AGGLGSPAAL YLAAAGVGTL VLVDHDVVDR SNLQRQILHT DDRVGQPKTE
SGRQAVAALN PQVRVEAVQA RLNSENIEAV LAGADLVIDG SDNFPTRYLV NDACVKLGLP
LVYGAVYRFE GQVTVFNVDD GPCYRCLYPE PPPAELAPSC AQAGVLGVLP GVIGLLQATE
AVKLLLGVGE PLSGRLVHYD ALRGQFQQLR MKANPDCPVC APGRPFPGYV DYEAFCSSSA