Gene Mchl_3666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_3666 
Symbol 
ID7115655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp3861696 
End bp3862664 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content72% 
IMG OID643526401 
Productaliphatic sulfonates family ABC transporter, periplsmic ligand-binding protein 
Protein accessionYP_002422413 
Protein GI218531597 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.262321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGA CCCGCCGCCA CTTCGCCCTC TCATCGAGCG CCGCGCTCGC CGCCGGCCTC 
GTCCTCGGTC GCTCCGGCCC GGCCCGCGCC GGACGGACGG TGAAGCTGAG TTATCAGCGC
TCCTCGACGC TCCTCACCGT GCTGAAGGCG CGGGGCACCC TGGAGGAGCG GCTCGGCGCG
CAAGGGCTTA GCGTGAGCTG GCACCTGTTC ACCAAGGTGC TCGAACCGAT GAACACCGGC
GCGGTCGATC TCCACGCCGA TGTGGCCGAC GCGGTGCCGA TCTTCACCCA ATCGGCAGGG
GCCCCGCTGA CCTTCTACGC CATGGAGGCC GGTTCGCCGC GGGCCGAGGC GATCATCGTG
CCGGACGAGT CGCCGATCCG CACGGTCGCG GATCTGAAAG GCCGCACGGT CGGCGTCTCG
AAGGGCTCGG GCTGCCACTT CATCCTCGCG GGGGCGCTGA AGCGGGCGGG CCTGCGGTTC
GCCGACATCC GCCCGGCCTA TCTGGAGGCG GCGGACGGGC TCGCCGCATT CGAGCGGGGC
GGCATCGAGG CGTGGTCGAT CTGGGATCCG TTCCTGGCCA TCGTGCAGGC CAAGCGCCCG
GTGCGAGTGC TGGCCGACGC CACCGGCCTG TCGAGCTACA ACCGCTACTA CACGGTCAAC
GACAGCTTCG CCGCCGAGCA GCCGGAGGTC GTCGCCACGG TCTTTTCCGC CCTGGTCGAG
GCGGGACAAT GGGTGAAGGC CAACCCGTCG GCGGCCGTTG CGCTGCTGGC GCCGATCTGG
GGAGACCTGC CGCCGGCGGT GGTCGCCACC GTCAACGAGC GGCGCTCCTA CGCGGTCAAG
GCGGTCGATC GGGCCGCGCT CTCCGAGCAG CAGGCGATCG CCGACACCTT CCACGAGGCC
GGGCTGATCC CGCGCCGGCT CGACGCCACC GCCGTATCGC TCTGGCAGCC GCCGGCAGGA
CGCGGGTGA
 
Protein sequence
MSLTRRHFAL SSSAALAAGL VLGRSGPARA GRTVKLSYQR SSTLLTVLKA RGTLEERLGA 
QGLSVSWHLF TKVLEPMNTG AVDLHADVAD AVPIFTQSAG APLTFYAMEA GSPRAEAIIV
PDESPIRTVA DLKGRTVGVS KGSGCHFILA GALKRAGLRF ADIRPAYLEA ADGLAAFERG
GIEAWSIWDP FLAIVQAKRP VRVLADATGL SSYNRYYTVN DSFAAEQPEV VATVFSALVE
AGQWVKANPS AAVALLAPIW GDLPPAVVAT VNERRSYAVK AVDRAALSEQ QAIADTFHEA
GLIPRRLDAT AVSLWQPPAG RG