Gene M446_4883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4883 
Symbol 
ID6132651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5362721 
End bp5364682 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content78% 
IMG OID641645019 
Productglycosyl transferase family protein 
Protein accessionYP_001771646 
Protein GI170742991 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.830469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0228705 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCCGC CCCCTGTTCG CCTCGAACCC CACGACGACC CCGATTTCGC GCGCCGCCTG 
GTCCTGCCGG GCGCGCCCGG CGGGCGCTGG CTCTCGCTCA CCTACGCGGC CGACCGCTTC
GCCCCGGCGC CGCGGCCGAT GCTGCGCTTC CTCGGCCCCG ACCCGTCGGA GGCGCTGCTG
CCCGGGCCGG TGCTCGGCCG CGCCGCGTGG CTCGGGCCGG TGCCGCCGGG CACGACGGCG
ATCCTGCTCG CCGCCCCGCC CGCGGGGTTC CGGGTCGAGG CGGCGCGGCT CCTCGGGCTC
CCGGCCGTGC TCGCCCGCGC GGCGCGGCGG CGGCCCGCGA CCCTCGCGGT CGCGCTGCTC
CGCGCCGCCC AGCGCGACGG GCGGCGCTTC CGCAACACCC TGCGCATCGC CGCGGCGGTG
ACGCCGCTCT CCGCCTACCC GGCCTGGCGG GCCGAGCGCG CCCGGCCCTT CGAGCCCGCC
CTCGACCGCC TCGTCCCGGT GCCGCGGCGG ATCGGCCTCC TGCTCGCGGC CGCGCCCGGC
GAGGAGGCCG CGCTGCGCGC GAGCCTGTCC TCGCTCCTCG CCCAGAGCCA CCGCGACTGG
CGCCTCGCCC TGTCCTGGCG CGGCGCGCCG CCGGCCGGGC TGCCGCGCGA TCCCCGGATC
GGCGAGGGCG CGGCCTTCCC GGAGGAGGTC GAGGCCGTGG GCCTCCTGCA CCCGGGCGAC
GTGCTCGCCC CGGACGCCCT CGCCCAGCTC GCCCGCGCCC TCGACGGCGC CGACCTCGCC
TACGCGGACG AGGAGGTCGA GACGCCCTCC GGCCTCGCGC CGCGCCTGAA GCCGGCCTGG
AGCCCCGACC TCGCCCTCGC GACCGGCTAT CCGGGCCGGC CGATGCTGCT CGCCCGCGAC
CTCCTCGCGC GCTGCGGCTG GGAGGAGGCG CAGGGGACGC GCGCGCTGGC CCCCGCCGCC
TTCCTGGCCG TCGCGCCCGA GCGGGTGCGC CACGTCCCCC GCATCCTGTG CCGCCGGCCG
GCCGGCCCCG AGGCGGCGGA GCCCGGATTG CTGCGCGAGG CGTTGCGGCG CGCCGGCTCC
CCCGCCCGGG TGCGCGAGGA GGGCGGCGCC CCCGATCTGC TCTGGCCGCT CCCCGATCCG
CCGCCCCTCG CCTCGGTGGT GATCCCGACC CGGGACCGGC CGGACCTGAT CCGCACCGCC
GTGCGCGGCG TGCTGGAGGA GACCGATTAC CCGGCCCTCG ACCTCGTCCT CGTCGATAAC
GGCTCGGTCG ATCCCGCCGT CCACGCCTTC TACGCGGACC TGTCGGGCGA TCCCCGGGTG
CGCCGGATCG ACCGGCCCGG CCCCTTCAAC TTCTCCAGCC TGGTCAATGC CGGGGTGGCG
GCCGCGCGCG GCAGCGTCGT GGTGCTCCTC AACAACGACG TGGCGGTGCG CGACCCGCGC
TGGCTCGCCG AGATGGTCCG CCTCGCCGTG CGGCCCGGGA TCGGCGCCGT CGGGGCCAAG
CTCCTCTACG GCAACGGGCG CCTGCAGCAT GCCGGCGTGG TCGTCGGCCT CGGCGGGCGG
GCCGGCCACA CGCTGCGCAA CCGGCCCGCC GACACGCCCG GCCATCTCGG GCGGCTCCGC
GTGACCCACG AGGTGGCGGG CGTCACCGCG GCCTGCCTCG CGGTGACGCG CGCCGGTTTC
GAGCGGGTCG GCGGATTCGA CGAGGCCGCC TTCGCGGTCG ATTTCAACGA CATCGACTTC
TGCCTGCGCC TGCGGGAGGC GGGGTTGCGC AACCTCTGGT GCCCGCACGC GGTGCTCGAC
CACCACGAAT CGGTGAGCCG CGGCCCCTCG GTCGGGCCGG CGCGGGCGCG GTTCGAGGCG
GAGGCGGCCC GCTTCACGGC GCGCTGGCGG GCGGTCATCC GCGACGACCC GTACTATCAT
CCGGCCTTCT CGGTCACGAC CTTCGGGGAG GATCTGGAGT GA
 
Protein sequence
MSPPPVRLEP HDDPDFARRL VLPGAPGGRW LSLTYAADRF APAPRPMLRF LGPDPSEALL 
PGPVLGRAAW LGPVPPGTTA ILLAAPPAGF RVEAARLLGL PAVLARAARR RPATLAVALL
RAAQRDGRRF RNTLRIAAAV TPLSAYPAWR AERARPFEPA LDRLVPVPRR IGLLLAAAPG
EEAALRASLS SLLAQSHRDW RLALSWRGAP PAGLPRDPRI GEGAAFPEEV EAVGLLHPGD
VLAPDALAQL ARALDGADLA YADEEVETPS GLAPRLKPAW SPDLALATGY PGRPMLLARD
LLARCGWEEA QGTRALAPAA FLAVAPERVR HVPRILCRRP AGPEAAEPGL LREALRRAGS
PARVREEGGA PDLLWPLPDP PPLASVVIPT RDRPDLIRTA VRGVLEETDY PALDLVLVDN
GSVDPAVHAF YADLSGDPRV RRIDRPGPFN FSSLVNAGVA AARGSVVVLL NNDVAVRDPR
WLAEMVRLAV RPGIGAVGAK LLYGNGRLQH AGVVVGLGGR AGHTLRNRPA DTPGHLGRLR
VTHEVAGVTA ACLAVTRAGF ERVGGFDEAA FAVDFNDIDF CLRLREAGLR NLWCPHAVLD
HHESVSRGPS VGPARARFEA EAARFTARWR AVIRDDPYYH PAFSVTTFGE DLE