Gene M446_5945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5945 
Symbol 
ID6132761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6535756 
End bp6536988 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content72% 
IMG OID641646047 
Productcapsular polysaccharide biosynthesis protein-like protein 
Protein accessionYP_001772659 
Protein GI170744004 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4421] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.146298 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACCCG AACACGATCT GGTCCCGGTC TCATGCGACC TCGACCGCCT GTACCGGTCG 
GACGGCTCCT CCGAACTGGC GGACGCGCTG CCCCCGATGG AGCCCTCCAG CCCCGACATC
GACTTCGTCG ACGACCTGTT CCCCGCCTAC GCGTACGAGC GGCCGGCCCC CCTCACGCAG
GTCTGCGGGG ACGAGGAGGC GGCGCGGCAC ATGCGGCAGA TCGAGAGCGA CATGGCCCGG
GGCGCCCGGC GGGGCGCGGC CTCGGCCCTG TTCCGCATCC GCGACGCCGT CCTGTACGAC
AACGTCATCC ATCTCCTGCG CGGCGCGCGC CGCGCCGTCG TCTACGAGAC CGCGCGGCCG
CAGGACCTCG CGCATTTCCC GCTCGACCAG GCGCCGCACC CGATCCGGGA CCAGGATTCC
TCCGACGGCG CCCTCAACCT CGTCTTCACG AACTCCGCCT CGTTCAATTA CGGCCACTGG
CTGGTGGAGG ATCTGCCGCG GCTGAAGGCG GTCCGGGTGC TCCGGCGCCG CTTTCCCGGC
CGGCCCATCA ACCTGATCAT CACGACCTAT CACGAGATCA TCGACCAGGT GCGGCTGCGC
TCGATCAAGC TGATGCTCGA GGGCCTGCGG GGGATCCGGA TCGTGACGAT CACGCGCGAC
CAGCCGCTGC ATTTCGACGT GCTGCACTTC GCCTCGCCGA TCGCCCTGCA CCCGGTGCTG
AAATCCCCCG AGGCGCTCGC CTTCCTGGCC GGGACGCTGC GGCGCCGGGT GCTGCTCGCG
CGCCTGCGCA TCGCCCGCGA CGCCCTGCTG GCGACGCCCC GGCGCCGGCC CCTGCGGCGG
CGGCTCTTCG TCGACCGCGC GCCGGATTAC GGGCGCCGCC TCCTCAACCG GGACGACGTG
CTGGCGCTCC TGTCCGGCGA GGGTTTCGAG GTGGTCGATC CCCTGACCCT GCCGTTCGGC
CAGCAGGTCG CGCAGTTCGC CGATGCCGGG GTGGTGGTGG GCGGCATGGG GGCCGCCATG
ACCAACACGC TGTTCAGCCT GCCCGGGACG CAGGTGATCC ACCTCGCGGC CGAGGGCTGG
AACGACCCGT TCTTCTGGGA CCTCGCGGCG GTGCGCGGGC ACCGCTACCA CGCGCTCTAC
GGCGCGAGCG ACTCGAAGGA GCGGCCGAAC CACGGCGCCT TCACGATCGA CCTCGACGCC
CTGCGGGCCG CCCTGCGGGC GGCGACCGCC TGA
 
Protein sequence
MAPEHDLVPV SCDLDRLYRS DGSSELADAL PPMEPSSPDI DFVDDLFPAY AYERPAPLTQ 
VCGDEEAARH MRQIESDMAR GARRGAASAL FRIRDAVLYD NVIHLLRGAR RAVVYETARP
QDLAHFPLDQ APHPIRDQDS SDGALNLVFT NSASFNYGHW LVEDLPRLKA VRVLRRRFPG
RPINLIITTY HEIIDQVRLR SIKLMLEGLR GIRIVTITRD QPLHFDVLHF ASPIALHPVL
KSPEALAFLA GTLRRRVLLA RLRIARDALL ATPRRRPLRR RLFVDRAPDY GRRLLNRDDV
LALLSGEGFE VVDPLTLPFG QQVAQFADAG VVVGGMGAAM TNTLFSLPGT QVIHLAAEGW
NDPFFWDLAA VRGHRYHALY GASDSKERPN HGAFTIDLDA LRAALRAATA