Gene Mext_3301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3301 
Symbol 
ID5831444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3660314 
End bp3661273 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content72% 
IMG OID641369101 
Productglycosyl transferase family protein 
Protein accessionYP_001640759 
Protein GI163852716 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.904037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.657687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCGA CCCCCGTACC CTCGCCCTCT GCGTCGGTCC TCGACGTCGT CATCGTCAAC 
TGGAATGCCG GGGACCAGCT CCGGGCCTGC CTCGCGAGCC TCGCCGCGAG CGAGGGGGCG
GAGCACCTAC GGGTCGTCGT CGTCGACAAC GCCTCCTCCG ACGGTTCGGC GGAGGGGCTG
GATCAGCCCG GCCTCGCACT CACGGTGCTG CGCAACGCAG ACAATCGCGG CTTCGCTCGT
GCCTGCAACC AGGGCGCGGC TTTGGGCTCG GCCGCGGCCA TCCTGTTTCT CAACCCCGAT
ACGGGGGTGA GCCGGGACGG TATCGCCGCC GCCCGCGCAC GGCTCGACGC CGATCCCGGC
ACCGGCATCG TCGGCGCCCG GCTGGTCGAT GACGCCGGGC AGACGCACCG CACCTGTGCC
CGCCACCCGA CGGGCGCGCG CCTGATCGCA CACACCCTGT TCCTCGACCG GTTGCTGCCC
GGCCGCGTCG CGCCGCACTT CCTGCTCGAT TGGGACCATG CTGAGACGCG GGCCGTCGAT
GCGGTGATGG GCGCCTTCCT GATGATCCGG CGCCCGCTCT TCGCTCGGCT CGGCGGGTTC
GACGAGCGCT TCTTCGTCTA CTGGGAGGAT GCCGACCTCT GTGCCCGTGC CGCCGCCGCC
GGCTTTGCGG TGTGTCACGT CGCAGAGGCC GAGATCCGCC ACCGCGGCCA GGGCACCACC
GAGGCGGTGA AGGACCGACG CCTGTTCTAC TTCCTGCGGG CGCAGACGCT CTACGCGCAC
AAGCATCACG GCCGGGCGGT GTCCCTCGCG GTGCTGGCGG CCGCGCTGGC CGTGAACCTA
CCCGTCCGCC TCGGCCGGGC GCTCGTGCGC GGTTCGGGCG GAGACGCAGG TGCGGTGATC
CGCGCAGGGC TTATGCTGAT ACGGGCCGTG CCGCGCCTGT TGACCGGCAG CGGTCGATGA
 
Protein sequence
MTATPVPSPS ASVLDVVIVN WNAGDQLRAC LASLAASEGA EHLRVVVVDN ASSDGSAEGL 
DQPGLALTVL RNADNRGFAR ACNQGAALGS AAAILFLNPD TGVSRDGIAA ARARLDADPG
TGIVGARLVD DAGQTHRTCA RHPTGARLIA HTLFLDRLLP GRVAPHFLLD WDHAETRAVD
AVMGAFLMIR RPLFARLGGF DERFFVYWED ADLCARAAAA GFAVCHVAEA EIRHRGQGTT
EAVKDRRLFY FLRAQTLYAH KHHGRAVSLA VLAAALAVNL PVRLGRALVR GSGGDAGAVI
RAGLMLIRAV PRLLTGSGR