Gene M446_4960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4960 
Symbol 
ID6131158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5438979 
End bp5440178 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content73% 
IMG OID641645096 
Productcapsular polysaccharide biosynthesis protein-like protein 
Protein accessionYP_001771722 
Protein GI170743067 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4421] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGC CGACCGCCGG ACAGGTCCTC GCCGGAACCG TCGTCGACGA CCACGCGGTC 
GCGGCGGACT ACCCGGACGC CCGCCGCGTC GCGATGCGCG CCCTGCAGCT CTGGCCGGAC
TTCCGCTGGA CCACGCAGGA GCTCGTGAGC TTCACGGCGC GCGACGTTCT GGTGACGAGC
GCCTTCGTGC CGATCGACAC CCGGCGGAGC CGGATGTTCC TCAACCAATC CTACTCGGGC
GTCGAGACGA TCTTCCCGAC CCGGTTTCGC GTCGAGGAGG GCCGCTACCG GCTGGAGGGA
ACGCCCGTCC CCTGCCCGGG GCGCCACATC GTGCTGGGCG GCCCGATCGA CGGCGTGTGG
TATCACTGGC TGTTCAACTG GTGCCCGCGG CTGCTGCTGC TCGGGCAGCT GCGCCCCGAC
CTGCTGGCGT GCGCGGACCT GCGCATCGCC GTCCATCCCC TCGCGCTGCG GGAGCCGTAC
CGGGCCGTGC TGGACAGTTT CGGCCTCCCG GCCGAGCGCT TCCTGGTCCT CGATCCCGGC
CGGGACCACC TGCTGGAGGA GGCCTGCCTG GTCTCGTTCC TCGACCAGAA CAGATTGTAT
CCCGAGATGA TCCGGGCTTT CGCCGCCCAT CTCCTCGCCG CGTGGGGGCT CGACGGGGCG
GGCGATCCCG GGCCGGGGCC GGCGCGCGGG CCGCTCGCCG CGATCGTCCG CCGCTTCGCC
GGCCCAAACC TCGGCGCCCG CGGGCCCGCG CCTGGAGCGG CGCCGCGCGG GCTGTTCGCG
AGCCGCCAGG ATCTGCCCGC GCCCAAGCGG CGCATCGCCA ATTTCGAGGA GGTCGCCCCG
GTCCTGGCCC GGTTCGGGCT CGACGTGGTG GCCTGCGGCG GGCTGCCGGC GCGGGAGCAG
GCCCGGCTGT TCCGGAGCGC CCGGGTGGTG GTCGGCGGCC ACGGCTCCGA CCTGTCCAAC
CTGCTGTTCT GCCGGCCCGG CACGCGGGTG CTGGTCTTCG AGAGCCGCTT CAGCGTCGAG
GCGTTCCTCC ACCGCGGGCT CGAGCAGCTC TGCGCGCTCC TCGGCCTCGA CTACGTGCTC
AAAATCGTGC CGACCGACGG GGAGGCCGGG CCGGGGGCGG GAACCCAGGC GCGCATCAAC
CAGGATTACC GGATCGACCC CGACGACTTG GCGCGGACGC TGGCCGCGCT GACGCGGTGA
 
Protein sequence
MTPPTAGQVL AGTVVDDHAV AADYPDARRV AMRALQLWPD FRWTTQELVS FTARDVLVTS 
AFVPIDTRRS RMFLNQSYSG VETIFPTRFR VEEGRYRLEG TPVPCPGRHI VLGGPIDGVW
YHWLFNWCPR LLLLGQLRPD LLACADLRIA VHPLALREPY RAVLDSFGLP AERFLVLDPG
RDHLLEEACL VSFLDQNRLY PEMIRAFAAH LLAAWGLDGA GDPGPGPARG PLAAIVRRFA
GPNLGARGPA PGAAPRGLFA SRQDLPAPKR RIANFEEVAP VLARFGLDVV ACGGLPAREQ
ARLFRSARVV VGGHGSDLSN LLFCRPGTRV LVFESRFSVE AFLHRGLEQL CALLGLDYVL
KIVPTDGEAG PGAGTQARIN QDYRIDPDDL ARTLAALTR