Gene M446_4387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4387 
Symbol 
ID6132440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4832085 
End bp4833407 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content66% 
IMG OID641644525 
Productcapsular polysaccharide biosynthesis protein-like protein 
Protein accessionYP_001771163 
Protein GI170742508 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4421] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.329255 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCA GCATCGCCGG CTCCGCCTGG CTATACAAGC AATCCCTCGA CGATCCCGGC 
CGGATCGTCA AACTCGATTC GAGCGGGAAA ATCCTCGGGC GGGATCATTC GCTCGAGTGC
AATTGGCTCT ATGTTGACGG ACGATTAGTG TTCCGTGACT TTTCCGGGCA TGACGTGGCG
GTTTTTCCGC AACCGCTGTT CTTCGATGCC CCGCGCCGGC TGACGGGGTG GTCGGCCTCG
TGCGCGGACG CGCCACGCTG GCTGCTGCGC CTCGACGACC TCGTCCCGCT GCAATCCGGC
TCCGCCTACG ACCTCGCCGC GGAGGTGCGG GAAGTGGCGC ACGCCGTCGC GTTCACCTGC
CCGGGCCCGG CGTACGGCGA CGGCGATGCC GCGCCGACCG GATACACGGC CGGCAAGTAC
AACATGCTGA CCTTGCGGGA CGTCGATCTG CTCACGACGC ACGGGGCGAT CGAGAAGGCC
GGCGTGGTCG CCGAAGAATC TCTGTTCCAC TTTCCATTCC ACCTCGAGGA TCGCTTCATT
CGCTTCCCGG ACGGGACATT CGCGCGCAAT GCGGGCGATC CGCAGCTCTC GATCGCGCGG
GCGCGCTACG CCTGCGCGGG GATCAACGAG AATTACTATC ATTGGATGAT GTTCTTCGTG
GGCAAGATCT GCCTGCATGG CGAGACCCGC GCCTCCGGCG AAGGGGCGGT CGTCCTGGTT
CCCGAGTATC GGAACGACGT GCAGAAGCGG ACCGCGGAGC TGGTGGCGAA GGCCTACGGC
TTGAACTTGG TTCAGCTCCG GCGCTGCGAC CGCGTGCGGG TCGACGAGTT GCTGCTCCCG
CATCAGCACG GCTCCTACGG GATCGATCCC CATCCCGTCG TCCTCAGGGC CTTCGCGCTG
ATCAAGGCGG CCCAGGCCTC CGCCGCGCCC GGCGCCGGCC GCAGGCTCTA CATCTCGCGG
GCCGACTCGC ATTACCGGCG GCTCGAGAAC GAGCGAGAGA TCGAGACGCT CCTGGCCGGT
CGCGGCTTCG ACGTCGTCAG ACTGGCTGAC AGGACCCTCG AACAGCAGAT CTCATTGCTT
GCGACGGCCG AAGTGGTGGT GTCCCCCCAT GGGGCCGGCC TCACCAATCT TGGTTATTGC
GAGCCCGGAA CCAAGGTCTT GGAGTTCCAC AGCCCGCAAT ACCTAAACTG GTGCATGCGC
AATTTGTCGA TCGCGGCCGG CCTGAGATAC GGCTTCCTGA TGGGCGAGGC CACCGAGGGC
GACCGCTACC GGGTCGCGGT CGCGGCCGTC GACGCGGCCG TGACGGCGAT GCTCGCGGCC
TGA
 
Protein sequence
MATSIAGSAW LYKQSLDDPG RIVKLDSSGK ILGRDHSLEC NWLYVDGRLV FRDFSGHDVA 
VFPQPLFFDA PRRLTGWSAS CADAPRWLLR LDDLVPLQSG SAYDLAAEVR EVAHAVAFTC
PGPAYGDGDA APTGYTAGKY NMLTLRDVDL LTTHGAIEKA GVVAEESLFH FPFHLEDRFI
RFPDGTFARN AGDPQLSIAR ARYACAGINE NYYHWMMFFV GKICLHGETR ASGEGAVVLV
PEYRNDVQKR TAELVAKAYG LNLVQLRRCD RVRVDELLLP HQHGSYGIDP HPVVLRAFAL
IKAAQASAAP GAGRRLYISR ADSHYRRLEN EREIETLLAG RGFDVVRLAD RTLEQQISLL
ATAEVVVSPH GAGLTNLGYC EPGTKVLEFH SPQYLNWCMR NLSIAAGLRY GFLMGEATEG
DRYRVAVAAV DAAVTAMLAA