Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_3666 |
Symbol | |
ID | 7115655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 3861696 |
End bp | 3862664 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643526401 |
Product | aliphatic sulfonates family ABC transporter, periplsmic ligand-binding protein |
Protein accession | YP_002422413 |
Protein GI | 218531597 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.262321 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGA CCCGCCGCCA CTTCGCCCTC TCATCGAGCG CCGCGCTCGC CGCCGGCCTC GTCCTCGGTC GCTCCGGCCC GGCCCGCGCC GGACGGACGG TGAAGCTGAG TTATCAGCGC TCCTCGACGC TCCTCACCGT GCTGAAGGCG CGGGGCACCC TGGAGGAGCG GCTCGGCGCG CAAGGGCTTA GCGTGAGCTG GCACCTGTTC ACCAAGGTGC TCGAACCGAT GAACACCGGC GCGGTCGATC TCCACGCCGA TGTGGCCGAC GCGGTGCCGA TCTTCACCCA ATCGGCAGGG GCCCCGCTGA CCTTCTACGC CATGGAGGCC GGTTCGCCGC GGGCCGAGGC GATCATCGTG CCGGACGAGT CGCCGATCCG CACGGTCGCG GATCTGAAAG GCCGCACGGT CGGCGTCTCG AAGGGCTCGG GCTGCCACTT CATCCTCGCG GGGGCGCTGA AGCGGGCGGG CCTGCGGTTC GCCGACATCC GCCCGGCCTA TCTGGAGGCG GCGGACGGGC TCGCCGCATT CGAGCGGGGC GGCATCGAGG CGTGGTCGAT CTGGGATCCG TTCCTGGCCA TCGTGCAGGC CAAGCGCCCG GTGCGAGTGC TGGCCGACGC CACCGGCCTG TCGAGCTACA ACCGCTACTA CACGGTCAAC GACAGCTTCG CCGCCGAGCA GCCGGAGGTC GTCGCCACGG TCTTTTCCGC CCTGGTCGAG GCGGGACAAT GGGTGAAGGC CAACCCGTCG GCGGCCGTTG CGCTGCTGGC GCCGATCTGG GGAGACCTGC CGCCGGCGGT GGTCGCCACC GTCAACGAGC GGCGCTCCTA CGCGGTCAAG GCGGTCGATC GGGCCGCGCT CTCCGAGCAG CAGGCGATCG CCGACACCTT CCACGAGGCC GGGCTGATCC CGCGCCGGCT CGACGCCACC GCCGTATCGC TCTGGCAGCC GCCGGCAGGA CGCGGGTGA
|
Protein sequence | MSLTRRHFAL SSSAALAAGL VLGRSGPARA GRTVKLSYQR SSTLLTVLKA RGTLEERLGA QGLSVSWHLF TKVLEPMNTG AVDLHADVAD AVPIFTQSAG APLTFYAMEA GSPRAEAIIV PDESPIRTVA DLKGRTVGVS KGSGCHFILA GALKRAGLRF ADIRPAYLEA ADGLAAFERG GIEAWSIWDP FLAIVQAKRP VRVLADATGL SSYNRYYTVN DSFAAEQPEV VATVFSALVE AGQWVKANPS AAVALLAPIW GDLPPAVVAT VNERRSYAVK AVDRAALSEQ QAIADTFHEA GLIPRRLDAT AVSLWQPPAG RG
|
| |