Gene Mchl_4026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_4026 
Symbol 
ID7118031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp4236727 
End bp4238055 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content69% 
IMG OID643526745 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_002422754 
Protein GI218531938 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGGA TCAATCTTGA AGGTATCAGC AAGATATTCG GCTCGAACCC CTCCAAGGCG 
CTCGATCTGA TCGGCCAGGG GAAGCGCAAG GGCGACATCG CCGCCGCCTG CGGTGCGGTC
GTCGGATTGC GCGACATCTC GTTCGACATC GAAGAGGGCG AGATCCTCGT CCTGATGGGC
CTGTCCGGCT CCGGCAAGTC GACGCTCCTG CGCTGCATGA ACCGTCTGGT CGAGCCGTCC
TGCGGCCGGA TCGTCGTGGA CGGGGTGGAC GTGACCCGGC TCGGCCGCAA GGAGCTGCTC
GCCTTCCGTC AGAAGACCTT CGGCATGGTC TTCCAGCACT TCGCGCTGCT GCCCAACCGG
ACCATCCTCG GGAATGTCGG GTTCGGCCTC GAGATCAAGC AGGTCCCGGC CAAGGAGCGG
ATCGAGCGGT CGATGCAGGC CATCGAACTC GTCGGCCTCA AGGGCTGGGA GACGAAGTAT
CCCAATGAAT TGTCGGGCGG CATGCAGCAG CGGGCGGGCC TCGCGCGGGC GCTCGCCGCC
GATGCCGACA TCCTGCTCAT GGACGAGGCC TTCAGCGCCC TCGACCCCCT GATCCGCCGC
GATATGCAGG CGGAGCTGCG CGACCTCCAG CGCAAGCTCA AGAAGACCAT CGTCTTCGTC
TCGCACGACC TCGACGAGGC CATCGCGCTC GGCGGCCGCA TCGTCCTGAT GAAGGACGGC
GAGGTGGTGC AGATCGGGCA GCCCGAGGAC ATCGTGGCCC GCCCCGCGAC CGACTATGTC
GAGCGCTTCG TCGAGCATAT CGATCTCGCC GCCGTGCTGC GGGCGGAGCA GGTCGCGGAT
CGCTCCGCCC CCGTGCTCGC CCCCACGCAG ACGGTGGCCG AGGCGCGGAC TGCGCTCGGC
GCGGCAGGTA ACCGCACAAG CGGCCGGGCG TGGCTCGTCG CCGACGGGGA CGGACGGCTG
GTCGGCCGCA TCTTCGCCGA GAGGCTCGCC TCCGCCCGGC CGGCCGAGAC CCTCTCCAGC
CTGCTCGATC TCGGGCAGTC CGTCGTCGAG GCGGACAGCC GGCTGGACAC CATCCTCGCG
ACGGTCGCCG CCGAGGAATC CGTCGCGGTC GTGGGCCGGA ACGGACGCCT GATCGGCTCC
ATCACCAGCC GCGACGTCGT GCAGGCACTC GCCGCGCGGC CCGGCACGCA CGCGCAGCCG
CATGCCGGTG CCCCGATCCT CTCAAAGCCG TCAGGAGCCC CGACATGGAG TGGAACGTCC
CCAAATTCCC CCTCGACACG CTCAGTGACA ACGGCCTCGA CTGGCTCACC GAGCATGGCA
GTTGGCTGA
 
Protein sequence
MGRINLEGIS KIFGSNPSKA LDLIGQGKRK GDIAAACGAV VGLRDISFDI EEGEILVLMG 
LSGSGKSTLL RCMNRLVEPS CGRIVVDGVD VTRLGRKELL AFRQKTFGMV FQHFALLPNR
TILGNVGFGL EIKQVPAKER IERSMQAIEL VGLKGWETKY PNELSGGMQQ RAGLARALAA
DADILLMDEA FSALDPLIRR DMQAELRDLQ RKLKKTIVFV SHDLDEAIAL GGRIVLMKDG
EVVQIGQPED IVARPATDYV ERFVEHIDLA AVLRAEQVAD RSAPVLAPTQ TVAEARTALG
AAGNRTSGRA WLVADGDGRL VGRIFAERLA SARPAETLSS LLDLGQSVVE ADSRLDTILA
TVAAEESVAV VGRNGRLIGS ITSRDVVQAL AARPGTHAQP HAGAPILSKP SGAPTWSGTS
PNSPSTRSVT TASTGSPSMA VG