Gene Mchl_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_2040 
Symbol 
ID7118740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2137463 
End bp2138449 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content66% 
IMG OID643524790 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002420815 
Protein GI218529999 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.841099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.44049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTCGA TCCGATTCGA TACGGGGGCC TCCACCCTTC TGGCGGTCAC CCTGGCCCTG 
GCGATGACCG GGCCGGCACA GGCGGCGGAC AAGGTCGTCT TCCTGACGAG CTGGTACGCC
CAGGCCGAGC ACGGCGGCTT CTATCAGGCC AAGGCCACCG GCCTCTACGA GAAGGCCGGG
CTCGATGTCG AGATCCGCAT GGGCGGGCCG CAGGTCAACG GCCTGCAGCT CCTGCTCGCG
GGCGAGGCCG ACGCGATCAT GGGCTACGAC ATCCAGGTGC TCCAGGCGGT CGAGAAGGGC
CTGCCCGTGG TCACCGTGGC GGCTTCGTTC CAGTACGACC TCCAGGGGAT GATGACCCAC
GACGACGTGA CGTCGCTGGC GGACATCAAG GACAGGGCGA TCCTCGTTTC CTCGGCCGGC
ATGACGGCGT GGTGGCCCTG GCTGAAAAAG AAATACGCGC TCTCGGACGC CCAGGTGCGG
GCCTATACCT TCAACCTGCA GCCGTTCTTC GCCGACAAGA ACGTCGTGCA GCAGGCCTAT
CCTTCCTCGG AGCCGTTCCA GGCGCAGGAG AAGGGCGTTC CGGTCAACTT CCATCTCTTC
GCCAGGGACG GTTATCCGCC CTACGGCACC ACGATCGTGA CGACGCGCAA GCTCGCCGAG
GGCAAGCCGG AGGCGATGCG CCGGTTCGTG GCCGCCTCCA TGGAAGGCTG GAAGAGCTAC
ATGGAGAACC CGGCTCCCGC CAACGTGCTG ATCAAGGCGG CCAACCCGAA GATGAGCGAC
GGCCAGATCG CCTTCGGCAT CACCCGGCTG AAGGCACTCA AGGTGCTGGG CGGCGAGGAA
AACGTCCCCA TCGGCACCAT GACGGAGGCC CGCTGGAAGG CGTCCTACGA CTACCTCGTC
GAGGCGGGGC TGCTCAAAGC CTCCACGGAC TGGAAACGGG CCTTCAGCCT CGATTTCATG
CCCGTCCTCT CGGCAAAAGC CGAGTGA
 
Protein sequence
MRSIRFDTGA STLLAVTLAL AMTGPAQAAD KVVFLTSWYA QAEHGGFYQA KATGLYEKAG 
LDVEIRMGGP QVNGLQLLLA GEADAIMGYD IQVLQAVEKG LPVVTVAASF QYDLQGMMTH
DDVTSLADIK DRAILVSSAG MTAWWPWLKK KYALSDAQVR AYTFNLQPFF ADKNVVQQAY
PSSEPFQAQE KGVPVNFHLF ARDGYPPYGT TIVTTRKLAE GKPEAMRRFV AASMEGWKSY
MENPAPANVL IKAANPKMSD GQIAFGITRL KALKVLGGEE NVPIGTMTEA RWKASYDYLV
EAGLLKASTD WKRAFSLDFM PVLSAKAE