Gene Mext_1623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1623 
Symbol 
ID5834894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1808487 
End bp1809716 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content73% 
IMG OID641367421 
Productglycosyl transferase group 1 
Protein accessionYP_001639093 
Protein GI163851050 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0129932 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAG GCAGCCTGTC CGGCGGCCCG GTGCAGTTGT CCGAGGCTTC GCCGTTGGCG 
GGCGTCACGG TGCTGCAGAT CATCCCGGCG CTTGAGGCGG GGGGCGCCGA GCGCACCACC
GTCGACGTCG CGGCCGCCCT CGCCGAGGCC GGCGCGCGGC CGCTGGTAGC CACGGAGGGC
GGGCGGCTCG TTGGCGAGTT GCAGGCCAAG GGCGGGATCT GGGTGCCGTT TCCCGCCAAC
ACCAAGAACC CGTTCGCCAT GGCGCTCAAC GTCGAGCGCC TCGCCCGGCT CTGCCGCCGC
GAGAACGTAC AGATCCTGCA CGCCCGCTCC CGCGCTCCGG CCTGGGTCGC GCTCGGCGCC
GCGCGCCGGC TGAAGCTGCC CTTCGTGACG ACCTATCACG GCAGCTATTC GGGCCGGACC
AGCGTCAAGG TCCTGTACAA TTCGGTGATG GCGCGGGGCG ACGTCGTGAT CGCCAACTCG
CACTACACCG CCGACCTGAT CCGCCGGACC CATCCCGACC AAGCCGGCGG CCGGATCAGC
GTGATCCACC GCGGCACGGA TCTGGCGGCG TTCACGCCCT CGGCGGTCGC GGCGGCACGG
GTCGAAAGCC TGCGCCGGGC CTGGAACGTG GCACCGCACG AGCGGGTCGT GCTGCTCGCC
GCCCGGCTCA CCGCCTGGAA GGGCCAGCGG GTGCTGATCG AGGCCGCCGC GCGCCTGCGC
GATCTCGGCC TCACCGACTT CGCCGTCGTG CTCGCGGGCG ATCCGCAGGG ACGCACCGCC
TATGAGCGCG AACTCGACGC GCTGATCGAG ACACGCGGCC TGTCGGGCAT CGTGCGCCGG
GTCGGCCATT GCACCGACAT GCCGGCGGCC TTCCGCGCGG CCTCCGTCGT CGCGGTCCCC
TCGGTGGAGC CGGAAGCGTT CGGCCGCTCG GCGGTCGAGG CGCAGGCGCT CGGCATTCCG
GTGGTCGTCT CCGATCTCGG TGCCGTGCCC GAGACCGTGC TGGCGCCCCC CGATGTCGAG
CCCGGCCAGC GCACCGGCTG GCGGGTGCCG CCCGGCGATG CCGCGGCTCT GGCCGAGGCG
TTGAAGGACG CGCTCTCCCT CGGCGCCAGC GCCCGCGACG GCCTCGCGCG CCGGGCGCAG
GCCCATGTCG AGGCGAATTT CTCGCTCGAT CGCATGATCG AGGGCACCCT GAACGTCTAC
GCCGACCTTC TGAACCGAGC CAAAACGTGA
 
Protein sequence
MSGGSLSGGP VQLSEASPLA GVTVLQIIPA LEAGGAERTT VDVAAALAEA GARPLVATEG 
GRLVGELQAK GGIWVPFPAN TKNPFAMALN VERLARLCRR ENVQILHARS RAPAWVALGA
ARRLKLPFVT TYHGSYSGRT SVKVLYNSVM ARGDVVIANS HYTADLIRRT HPDQAGGRIS
VIHRGTDLAA FTPSAVAAAR VESLRRAWNV APHERVVLLA ARLTAWKGQR VLIEAAARLR
DLGLTDFAVV LAGDPQGRTA YERELDALIE TRGLSGIVRR VGHCTDMPAA FRAASVVAVP
SVEPEAFGRS AVEAQALGIP VVVSDLGAVP ETVLAPPDVE PGQRTGWRVP PGDAAALAEA
LKDALSLGAS ARDGLARRAQ AHVEANFSLD RMIEGTLNVY ADLLNRAKT