Gene Mext_0202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0202 
Symbol 
ID5831844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp217798 
End bp218946 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content71% 
IMG OID641365987 
Producthypothetical protein 
Protein accessionYP_001637699 
Protein GI163849656 
COG category[R] General function prediction only 
COG ID[COG4671] Predicted glycosyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.45206 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGC CGATTGCGTT TTTTGTCCAT CATCAGGGCC GGGGCCATGC CAACCGCACC 
ATGGCGGTGG CCGCCGAGTT CGCCCGCGAC CGTCCGGTCT CGGTGCTGAC CGCCGGCCCG
CACCTGTTCG ACGGATTTTC CCGCGACATC GAGATCGTGA CGCTGCCGAA CATGATCGGC
GCGGCGGTGC CGACCCCGCG CCTCTACGCG GAGCCGACGC CGCCGGTGAT GCACTGCGTG
CCGCTGGGGC TCGCCGAAAT GCGTCGCACC ATGCGCCAGA TCCTCGACCA TCTCGACGAG
CGCGCGGCCG GTCTGTTCGT GGTCGACGTG TCGGCGGAGA TCGCGATGCT CGCGCGCATC
GCCAGTGTCC CCGCGGTCCA GATCCGCATG CACGGCGACC GCAACGACAT CGCCCATCTC
GGCGCCTACG AGGCCTGCGT CGGAATGCTC GCCCCCTTCG ACGAACGGCT GGAGCAGGAC
GACTACCCGG CGCATCTGCG CGACAAGACG TTCTATAGCG GCGGGCTCTG CACCAGCGTC
GATCGCGTGC CGGATCGTGC CGAGGCGCGG GCCCGTCTCG GCCTCGACCC GCAGCGCGAG
ATCGTCGTCG CGGTCACCGG CGGCGGGGGA AGCGGCACGC CCTACGCGCC GCTGACGGTC
GCCGCCCGCG CCGCGCCCGA CGCACTCTGG CTGACTCTGG GGCCGACCCA CCGCGAAGGC
CATGAGACCG ACTTCGCCAA CCTGCGCGAA CTCGGCTGGG TGCCGTCGGT CACCGACTAT
CTCGCGGCGG CCGACATCGT GGTCGCCTCG GCGGGCGACA ACACGGTGCA CGAAGTCGCG
CGCGTGGCGG GGCGCCTGAT CGTCATGCCG GAATGGCGCT ATTTCGGCGA GCAGGCCCGC
AAGGCCGAGG CTTTGGTCCG CTTCGGCGCC GCCGTGCAGG CGCCCCATTG GCCCGGCGAC
TTTCACGGAT GGCGCGATCT TCTCGACCGC GCCCGCAGCC TCGACGGGAC CATCCTGCGC
AGCCTCTACG CACCGGACGC CGCCACGCGC GCGGCCGGTT GGCTCGAAGG GCTCACCGAC
GCGCTCTGGC AGGGCGGATC GGCCGTGCAG GAGCCGGACG CCACGCCGCT GCGCGTCGTC
GCCGGCTGA
 
Protein sequence
MKKPIAFFVH HQGRGHANRT MAVAAEFARD RPVSVLTAGP HLFDGFSRDI EIVTLPNMIG 
AAVPTPRLYA EPTPPVMHCV PLGLAEMRRT MRQILDHLDE RAAGLFVVDV SAEIAMLARI
ASVPAVQIRM HGDRNDIAHL GAYEACVGML APFDERLEQD DYPAHLRDKT FYSGGLCTSV
DRVPDRAEAR ARLGLDPQRE IVVAVTGGGG SGTPYAPLTV AARAAPDALW LTLGPTHREG
HETDFANLRE LGWVPSVTDY LAAADIVVAS AGDNTVHEVA RVAGRLIVMP EWRYFGEQAR
KAEALVRFGA AVQAPHWPGD FHGWRDLLDR ARSLDGTILR SLYAPDAATR AAGWLEGLTD
ALWQGGSAVQ EPDATPLRVV AG