Gene Mext_2300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2300 
Symbol 
ID5835649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2548985 
End bp2550271 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content74% 
IMG OID641368099 
ProductO-antigen polymerase 
Protein accessionYP_001639766 
Protein GI163851723 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00840255 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.00785546 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGCAGA AACAAACGCC TAGAGCGTCG TCCTGGATCA GGTATCCAGG ACGACGCTCT 
AGCGCGGCGG CGCGGCTCTA CGCTGCCGGC GCCGTCGCGC TGGCGCTGGT GGCGCCGATG
ATGGCCCTCG CCAACCGGTC GAGTCCGCTT CTCGTCGGCG TGGCCGCGCT GCTGTTCCTG
GCGGGCACCG TCGCCGAGCG CGGCGGGCGC GCCGCCTCCG ACCTGATCAC CCCCCTGCGC
GCCCCGCTCG GCCTCGCCGC GCTGGCCTTC CTCGCCTGGT GTCTCGTCTC GCTGGCCTGG
AGCCCGTTTC CCGCGCTCTG GGGCCGTGTG CTGTCCGAAT TCCTGCCGAC GCTCGCCGCC
GCGGCGATCC TCGCCCGGCT CGCGCCGGCC CGGCTGCCGC CCTGGGCGCT GCCCCTCGGC
GCCGGCCTGC TCGCCGCAGC CTGCCTTTTC ATCGCGGCAA GCCTCGCCCT CGGGCTGGCG
CCGCAGGCCT GGCTCGGGCA GCGCGTGGCC CTGTTCATGT TCAACCGCCC GCTGCTGACG
GTGCTGCTGC TGGCCGGGCC CATCGCCGCC TTCCTCGCCC TGCGCGGCCA CCGCCTCGCC
GCCGTGATCC TGCTCGGGGT GACGGCGCTG GCGATCCTGC GCTCGATCAG CGGCGCGGCC
ATGCTCGGGC TGCTCGCGGG CGCTGTGATG TTCGCGGTCG GGCGCTTGGC GCCCCGATCC
GTGGCCCTGG CGCTCGCGGC GCTGACCCTC GGGCTCGCCT TCGCCCTTGC CCCGGTCGAG
GGCGACATCC TCCACCGGCT GATGCCGGAG GCGGCGCATG AGCGGCTGAC GCAGTCTTCG
TCGCGGGCGC GGGTCGCCAT CGCCCAGAGC TTCGCCGCCG CGGTGGCGCA GGCGCCCTGG
ATCGGCTCCG GCTACGGCAT GGGCCTGCGC TTCGCCGAGG TACCGGCGTC GCAAGCTCTC
GAGCCGGAGA TGCGGGCGAT GCTGGCCGTC GGCCATCCGC ATAACAGCTT CCTCCAGATC
TGGGCCGAAC TCGGCTTCGT CGGCGCGGCG CTCGCGGCTT TGGTCGCCTT CCTGGCCCTG
CGGGCGGCGG CCGCCTTGCC GCGGCTCCTG TTCGCCACGG CGCTCGGCTT GCTGGGCGCG
GCGGTGGCGG TGATGTTCGT CGAGCACGGC GCGTGGCAGG GATGGTGGAC GGCGGGCCTC
GGGGCCGCCA TCACATGGCT GCGGGCGGCG GCTTGCGCCA AGCCCCCATA CGAGCCCGCA
CACGAGAGTG AAGACGCGCG CGCATGA
 
Protein sequence
MMQKQTPRAS SWIRYPGRRS SAAARLYAAG AVALALVAPM MALANRSSPL LVGVAALLFL 
AGTVAERGGR AASDLITPLR APLGLAALAF LAWCLVSLAW SPFPALWGRV LSEFLPTLAA
AAILARLAPA RLPPWALPLG AGLLAAACLF IAASLALGLA PQAWLGQRVA LFMFNRPLLT
VLLLAGPIAA FLALRGHRLA AVILLGVTAL AILRSISGAA MLGLLAGAVM FAVGRLAPRS
VALALAALTL GLAFALAPVE GDILHRLMPE AAHERLTQSS SRARVAIAQS FAAAVAQAPW
IGSGYGMGLR FAEVPASQAL EPEMRAMLAV GHPHNSFLQI WAELGFVGAA LAALVAFLAL
RAAAALPRLL FATALGLLGA AVAVMFVEHG AWQGWWTAGL GAAITWLRAA ACAKPPYEPA
HESEDARA