Gene Mext_4067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4067 
Symbol 
ID5831646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4523172 
End bp4524335 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content69% 
IMG OID641369858 
Producthypothetical protein 
Protein accessionYP_001641508 
Protein GI163853465 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.39928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGCTC GGATACGCTC GCTTCGGACC GCTCTACTGG TTTCGGCCTT GGCGCTGACC 
CCGCTCGCCG CCGGCCTCGC TCCGATCGCG GCCATCGCCG CGCCGGCCGC CGCCTATCCC
GGCCAGCAGG AGGGCGACCA CGTCGTCGAG AACTTCAAAT TCGCGAGCGG CGAGAGCCTG
GATCGGGTCA AGCTGCACTA CACCACCCTC GGCACGCCGC ATCGCGGCGC GGACGGCGAG
ATCGACAACG CGGTGCTCGT CCTGCACGGC ACCACCGGCA CGGGCAAGAG CTTCCTGATC
CCGACGCTCG GGCCGGAGCT GTTCGGCGAA GGCGCGCCGC TCGACGCGCG GCGCTGGTAC
GTGATCCTGC CCGACGGGCT CGGCCGCGGC GGCTCCTCGA AACCGTCCGA CGGCTTCAAG
GCGCATTTCC CCCGCTACGG CTACGGCGAC GTCGTGGAGG GCCAGCACCG GGTCGTCACC
GAGGCGCTCG GCGTCAAGCA TCTGCGCCTC GTGCTCGGCA CCTCCATGGG CGGGATGCAG
GCCTGGATGT GGGGCGAGCG CTATCCCGGC GAGATGGACC TGCTGATGGC GGTGGCGAGC
CAGCCGATCC CGGTGAGCGG GCGCAACGCC CTGTGGCGGC GCCTCCTGAT CGAGGGCATC
CGCACCGATC CCGACTGGAA GGACGGCGAG TACACCGCGC AGCCGCGCAG CTTCGGCCGC
ATCCTGCCGA TCTTCAACAT CATGACCGAG AGCGTGCTCG GCCTTCAGAA GCAGGCCCCG
ACCCGCGCGG CGGCCGACAC GGCCTACGAC AAGATGATCG CCGGCTACGA GAACAAGGCC
GACGCCAACG ATTGGCTGTA CTGGTTCGAT TCCTCCTACG ATTACGACCC CTCGCCGGAC
CTCGAAAAGA TCACCGCGAA GGTGCTCGCG GTGAACTTCG CCGATGACGA GCTGAACCCG
CCCCAGCTCG ACGTGATGAA CGCAGCGCTG GCGCGGGTGA AGGACGGCCG CTTCGTGCTG
GTCCCGACCT CGCCCGAGAC GCACGGCCAT CAATCCCTGC GCTTCGCGGG CCTGTGGAAG
GGCTACCTCG CCGAATTCGT GAGACAGCCC GAGGCGACGA CGGAGAAGGA GAGCTCGTCG
GAGCGGCCGG AGGGCAGCCG GTAG
 
Protein sequence
MGARIRSLRT ALLVSALALT PLAAGLAPIA AIAAPAAAYP GQQEGDHVVE NFKFASGESL 
DRVKLHYTTL GTPHRGADGE IDNAVLVLHG TTGTGKSFLI PTLGPELFGE GAPLDARRWY
VILPDGLGRG GSSKPSDGFK AHFPRYGYGD VVEGQHRVVT EALGVKHLRL VLGTSMGGMQ
AWMWGERYPG EMDLLMAVAS QPIPVSGRNA LWRRLLIEGI RTDPDWKDGE YTAQPRSFGR
ILPIFNIMTE SVLGLQKQAP TRAAADTAYD KMIAGYENKA DANDWLYWFD SSYDYDPSPD
LEKITAKVLA VNFADDELNP PQLDVMNAAL ARVKDGRFVL VPTSPETHGH QSLRFAGLWK
GYLAEFVRQP EATTEKESSS ERPEGSR