Gene M446_4035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4035 
Symbol 
ID6132843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4501783 
End bp4502817 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content63% 
IMG OID641644192 
Productglycosyl transferase family protein 
Protein accessionYP_001770832 
Protein GI170742177 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGGTA GGGATCTTAG CCATATTCCA GGCAATAATT CAACTAATAA TCCGCACATA 
TCGGTCGTAA TTTCCGTGAA GAACCGCTTC TTTCTGCTGA AGGACTGCCT GGAAGCTTTG
CTCAGGCAGA CGATTGGCGT CGAGAGCTTC GAAGTCATTG TCGTCGACAA CGTGTCTCAG
GACGATATCG CCGGTCTTTG CACGGCGATG CGGGCCCAGG GTCTGCAGCT CCGCTACCTG
CGCATGCAGC ACGACAAGGG ACCGGCCCCG GCGCGCAACC GAGGCGTGCT TGAGGCGCGG
GCCCCGCTGA TCGCCTTCAC GGACAGCGAT TGCCGGCCCC ATCCCGAATG GCTCGCCCTT
GGCATCGCCG CCCTGGCCGA CCCGGCGGTC GCGTTCTCGA CGGGCCCGGT CCTGCCCAAG
CCGGAGCAAA CGGCATCGCT CTGCTCCAAA CTCACGTTCG TCACGGCGCA GGAGCACCCG
ACCTTTCCGA CCGCCAACAT GGTCGTGCGG AAAAGCGTGT TCGACGCGTT CGGCGGCTTC
GACGAAACCC TCTCGTTCCG TGACCCGCTC AACCGGGCGA CGGAATGTGC CGATACCGAT
CTCGCCTGGC GCATCATCGA GGCCGGCTAC ACCCGCCGCT TCGAGCCGCG CGCCGTCATC
TGGCACGAGA TCGAACAGCA ATCGCTTCTC CAATGGATTC TCGAGCCGAC ACGATTGTTT
CTGGTTCCTG CCCTAGTCAA GCGGCATCCG GAACTCAGGA GACGTCTCCT CGTCGCGCGC
CTGTTCTTCT ATCCTCCGAT ATGGCTGCTT TACCTCGCGG TATGCGTGGC GGCGTTCGCC
GTCATCTGGC AGCCGCTGCT GCTGCTCGTG CTGCCGCCGG CACTGCTCGC GCGGGGCATC
CATCGCACCG GCTCCGTGGA CCCGCGGCAG CTCGCAGCCC ACGCCGGGCG TGTGATTGCT
CACCTGCCGC GGATGGTCGT CATGATAACA TCCCTGCTTT ATGGAAGCAT TCGCTACCGC
GCACTCGTTC TATGA
 
Protein sequence
MDGRDLSHIP GNNSTNNPHI SVVISVKNRF FLLKDCLEAL LRQTIGVESF EVIVVDNVSQ 
DDIAGLCTAM RAQGLQLRYL RMQHDKGPAP ARNRGVLEAR APLIAFTDSD CRPHPEWLAL
GIAALADPAV AFSTGPVLPK PEQTASLCSK LTFVTAQEHP TFPTANMVVR KSVFDAFGGF
DETLSFRDPL NRATECADTD LAWRIIEAGY TRRFEPRAVI WHEIEQQSLL QWILEPTRLF
LVPALVKRHP ELRRRLLVAR LFFYPPIWLL YLAVCVAAFA VIWQPLLLLV LPPALLARGI
HRTGSVDPRQ LAAHAGRVIA HLPRMVVMIT SLLYGSIRYR ALVL