Gene M446_0253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0253 
Symbol 
ID6134160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp309797 
End bp310903 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content77% 
IMG OID641640580 
Productcapsular polysaccharide biosynthesis protein-like protein 
Protein accessionYP_001767258 
Protein GI170738603 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4421] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0270966 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCT TCACGGGACA CAGGCTTCGG CGCTGCCGCG ATCTCTGGGG GGCGTGCCGC 
CTCGTCGAGG CCCCGCCGGC CGCGCGCGTG CTGGAGGACG TGCTCCTCGT GCCGCCCGGC
GGCCCGGTCC CGGCGGGCCT GTTCGCGGCG GGGCGGCGGC TGCGCGGCGA GGGCGGCGAG
GCGGCTCCCG GCGCGCTGCC GGTGGGCGAG CCGGCCCCGG ACGGGCTCTA CCTCTTCGTC
CCGGCGCTCG CCCCCCATTA CGGCCACTTC GTCACCGACA CGCTCGCGCA TCTGTGGCCC
CTCGCCGCCT GGGAGGGGCC GCGGCCGCGC CTCGCCCTGC TGGCGCCGCC CCACGCGCTC
GACGGCGCCG ACTATGCCCG CGTGATCCTG GAGCGCCTCG GCCTCGGCCC CGGCGACCTC
GTGCATTTCG ACCGGCCGGT GCGCCTGCCG CGCCTGCTGC TGCCCGAGCC AGCCTTCGAG
GAGCGGGCCT TCGTGCACGC GGTCTACGGC GCGCTCTGCC GCGAGATCGG CCGGCCCTTC
CGGGACGGTC CGGCCCTGGA CCGCCCGGTC TACCTGACCA AGACGCGGCT GCCCGCCGGC
ATCGCCCGCA TCGCCAACGA GGAGGCGATC GTCGAGGAGC TCGACCGGCG GGGCGTCGAG
ATCGTCGCCC CCGAGACGCT GCCCTTCGTC GAGCAGGTGC GCCTCGTCTC GACGCGCCGG
GTCGTCATGG GCTCGACCGG CTCGGCCTTC CACACCACGA TCTTCGCGGC GCCGGGGCGG
CGGGTGCTCG GCCTCAACTG GACCTGGAAG CTGCACGCGA ACTTCGCGCT GCTCGACGCT
GTCACGGGGA CGGCAGGGCG CTACTACTTC CCGCTCGGCA CCCGCTACGG GGCGGCGGAC
TCCTTCCATT TCGGCTGGCA AGTGCGCGAT CCGCGCGCGG TCGCCGCCGA ACTCCTGGCG
CGGGCCGAGG CCTTCGACCG CCTGGACGCG ATCGACGCGG CCGACGACGC GGCCGACCGC
ACCCTCCTCG GCCGGGTCCG CGGCCTGGTC GAGGACCTGC GCTGGCGCCT CTCCCGGGGG
GGGCGCGGCG TGGCCGGCGG GCCCTGA
 
Protein sequence
MTGFTGHRLR RCRDLWGACR LVEAPPAARV LEDVLLVPPG GPVPAGLFAA GRRLRGEGGE 
AAPGALPVGE PAPDGLYLFV PALAPHYGHF VTDTLAHLWP LAAWEGPRPR LALLAPPHAL
DGADYARVIL ERLGLGPGDL VHFDRPVRLP RLLLPEPAFE ERAFVHAVYG ALCREIGRPF
RDGPALDRPV YLTKTRLPAG IARIANEEAI VEELDRRGVE IVAPETLPFV EQVRLVSTRR
VVMGSTGSAF HTTIFAAPGR RVLGLNWTWK LHANFALLDA VTGTAGRYYF PLGTRYGAAD
SFHFGWQVRD PRAVAAELLA RAEAFDRLDA IDAADDAADR TLLGRVRGLV EDLRWRLSRG
GRGVAGGP