Gene M446_0455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0455 
Symbol 
ID6132005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp537749 
End bp539710 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content75% 
IMG OID641640778 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001767453 
Protein GI170738798 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.70203 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0478579 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGAC CGACCCTCAA GAGCGCGGCG ATCGTCGCCC ACGACCTCGT GGCGACCGCC 
CTCGCCGTCA CGCTCACCTT CCTGATCCGC TTCGATGACG CGCGGCTGGC CGAGCGCCTC
GCGCATCTGC CGGTGCTGCT CGGCCCCTTC GTGCTCTATG CCGGGCTCGT CTATCGCTGG
TTCGGGCTCT ACCGCACCAA GTGGCGCTTC GCCTCGCTCC CCGACCTCGC CGCCATCGTG
CGGGCGGTCG CCGTCCTGGC CCTGTCGCTG CTGCTCCTGG ATTACGCGCT GGTCTCGCCC
GCCCTGTTCG GGATCTACTT CTTCGGCAAG ATCGCGATCG GGCTCTACTT CGTCCTGCAG
CTCTTCCTGC TCGGCGGCCC GCGCGTCGCC TTCCGCTACC TGAAATACAG CCGCTCGCGC
CAGAGCCACG CCCGCGCCGC GACGACGCCC GCGCTCCTCC TCGGCCGCGG CGCGGAGATC
GAGGTGGTGC TGCGCGCCAT CGAGGCCGGG ACGGTGCGCA AGCTCGCGGC CCAGGGCATC
CTCTCGCCCC GGGCCGAGGA TCGCGGCCAG AGCCTGCGCG GCGTGCCGGT GCTGGGCGCC
TTCGCCGACC TGGAGCAGGT CGTGGCGGAC CTCGCCGCCC GCGGCGTCGC GGTGCGGCGC
CTCGTGGCGA CGCCGAACGC CCTCACACCC GAGGCCGATC CCGACGGGCT CCTCGCCCGC
GCCCGCCGCC TCGGCCTGCC GCTCGCCCGC GTGACCACCC TCGGCGAGGG CCTGCGCGAC
GCCGAACTGG CGCCCCTCGA GATCGAGGAC CTGCTGCTGC GCCCGACCGT CCCGATCGAC
CGGCCGCGCC TGGAGCGCTT CCTCGCCGGC CAGCGCGTCG TGGTGACGGG GGGCGGCGGC
TCGATCGGCT CGGAGATCTG CGCCCGGGCC GTGGCCTTCG GCGCCTCGGC GCTCCTCGTC
GTGGAGAATT CCGAGCCCGC CCTGCACGGG GTGCTGACGC GGCCGGCCCT GGCCGAGAGC
GAGGCGGAGG TGAGCGGCGT CATCGCCGAC ATCCGCGACC GGGAGCGGCT CTTCCACGTC
CTGCGCGCGT TCCGCCCCGA CGCGGTCTTC CACGCCGCCG CGCTCAAGCA GGTGCCCTAC
CTGGAGCGCG ACTGGACCGA GGGCATCAAG ACCAACGTGT TCGGCTCGGT GAACGTGGCC
GACGCGGCGC TGGCGGCGGG CGCCCGCGCG CTGGTGATGA TCTCGACCGA CAAGGCGATC
GAGCCGGTCT CGCAGCTCGG CGTCACCAAG CGCTTCGCCG AGATGTACGC GCAGGCCCTC
GACGCGGCCG GCGGGCCGGC GCGGCTCGTG GCGGTGCGCT TCGGCAACGT GCTCGGCTCG
GTCGGCTCGG TGGTGCCGGT GTTCAAGGCG CAGATCGCCC GCGGCGGGCC GGTCACGGTC
ACCCATCCCG AGATGGTGCG CTACTTCATG ACCGTGCGCG AGGCCTGCGA CCTCGTGCTC
ACCGCCGCCT CCCACGCCGA CCGCGAGGGT CGCGACCCGC GGGCGGGCGA CCAGCGCGCC
GCCGTCTACG TGCTGAAGAT GGGCCAGCCG GTGCGGATCC GCGACCTCGC CGAGCGCATG
ATCCGCCTCG CCGGCTTCGA GCCGGGCCTC GACATCGAGA TCGCCGTGAC GGGCGCGCGG
CCCGGGGAGC GCCTGAACGA GATCCTCTTC GCCCGCGACG AGCCGATGGT GACCCTCGAC
GGGATCGAGG GGGTCATGGC GGCCAAGCCC GTCTTCGCCG ACCGGGCGCA GCTCGCGCGC
TGGCTCGAGC GGCTGCGCGC GGCGGTGGCG CAGGCCGACC GCGCGGCGGC CGAGGCGGTG
TTCGCGGAGG CGGTGCCGGA TTTCGCCCGG CGGCCGGGCG CGGCGCGCGA GGCGCCGGCG
GCCGAACTGG CGCCGCGGGA TGTCGGGGCG GGCGGGGCGT GA
 
Protein sequence
MRRPTLKSAA IVAHDLVATA LAVTLTFLIR FDDARLAERL AHLPVLLGPF VLYAGLVYRW 
FGLYRTKWRF ASLPDLAAIV RAVAVLALSL LLLDYALVSP ALFGIYFFGK IAIGLYFVLQ
LFLLGGPRVA FRYLKYSRSR QSHARAATTP ALLLGRGAEI EVVLRAIEAG TVRKLAAQGI
LSPRAEDRGQ SLRGVPVLGA FADLEQVVAD LAARGVAVRR LVATPNALTP EADPDGLLAR
ARRLGLPLAR VTTLGEGLRD AELAPLEIED LLLRPTVPID RPRLERFLAG QRVVVTGGGG
SIGSEICARA VAFGASALLV VENSEPALHG VLTRPALAES EAEVSGVIAD IRDRERLFHV
LRAFRPDAVF HAAALKQVPY LERDWTEGIK TNVFGSVNVA DAALAAGARA LVMISTDKAI
EPVSQLGVTK RFAEMYAQAL DAAGGPARLV AVRFGNVLGS VGSVVPVFKA QIARGGPVTV
THPEMVRYFM TVREACDLVL TAASHADREG RDPRAGDQRA AVYVLKMGQP VRIRDLAERM
IRLAGFEPGL DIEIAVTGAR PGERLNEILF ARDEPMVTLD GIEGVMAAKP VFADRAQLAR
WLERLRAAVA QADRAAAEAV FAEAVPDFAR RPGAAREAPA AELAPRDVGA GGA