Gene GM21_0990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0990 
Symbol 
ID8136311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1170097 
End bp1171176 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content62% 
IMG OID644868604 
Productchorismate mutase 
Protein accessionYP_003020813 
Protein GI253699624 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase 
TIGRFAM ID[TIGR01801] chorismate mutase domain of gram positive AroA protein
[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAGAAA AGCAGACTCT GGAAAAACTC CGTCAGGAAA TAGATACCGT CGACGACCAT 
ATACTCGATC TGTTGAACCA GCGGGCCAAG CTCGTGATGG AGGTCGGCGC GGTCAAGACC
AAAAGCCACA GCGATTTCCA CGTCCCCAGC CGGGAGAGGG AGATCTACGA GCGCCTCACC
GCGGCGAACC CCGGGCCTTT CCCGTCGGAC GCCGTGCGCG GCGTGTTCCG CGAGATCATC
TCGGCTTCGC TCGCGCTGGA GAAACCGCTG AACGTCGCCT TCCTCGGACC AAGCGCCACC
TTCAGCCACC TGGCCGCCAT GCAGCACTTC GGCCTGTCGG CCTCGCTCTC CCCTGAGCGC
TCCATCCCCG CCGTCTTCGA GGCGGTGGAG AAGGGGGAGG CGTACTACGG CGTGGTGCCG
GTGGAGAACA CCACCGAGGG GATGATCTCC CACACGCTGG ACATGTTCAT GGAAAGCGAG
CTGAAGATCA ACGCCGAGGT GCTCCTGGAG GTCTCCCACT TCCTCCTTTC CCGCACCGGG
CGCTTCGAGG ACATCAAGAA GGTCTACTCG CACCCGCAGC CCCTGGCCCA GTGCCGCAAG
TGGCTCGCCG AGAACCTCCC CAACGTGCCG CTGGTCGACG TCGCCTCCAC TACGCTCGCG
GCGCAGATCG TGTCCGAGGA CTACACCGCC GCGGCCATCG CCAGCGAATA CGCCTCCTCC
ATCTACAACC TCAAGGTGGT CAAGGCCCGC ATCGAGGACC AGGTCAACAA CTTCACCCGC
TTTTTGGTCA TCGGCCGCAA GATGGCCGAC AAAAGTGGAG ACGACAAGAC CTCGCTCATG
TTCTCGGTCC GCGACGAGCC CGGCATCCTG CACCGGATGC TGGAGCCTTT CGCCAAGCGC
GGCATCAACC TCTCCAAAAT CGAATCCCGC CCCCTGAAGC GCAAGGCCTG GGAATACATC
TTCTACCTCG ACCTTTCCGG CCACATCTCC GATCCGGAGG TCGCCGAGGC GGTCAAGGAA
CTTTCCGTCT GCTGCCAATT CGTCAAGGTG TTGGGCTCCT ACCCCCGGGC GCGTTCATGA
 
Protein sequence
MKEKQTLEKL RQEIDTVDDH ILDLLNQRAK LVMEVGAVKT KSHSDFHVPS REREIYERLT 
AANPGPFPSD AVRGVFREII SASLALEKPL NVAFLGPSAT FSHLAAMQHF GLSASLSPER
SIPAVFEAVE KGEAYYGVVP VENTTEGMIS HTLDMFMESE LKINAEVLLE VSHFLLSRTG
RFEDIKKVYS HPQPLAQCRK WLAENLPNVP LVDVASTTLA AQIVSEDYTA AAIASEYASS
IYNLKVVKAR IEDQVNNFTR FLVIGRKMAD KSGDDKTSLM FSVRDEPGIL HRMLEPFAKR
GINLSKIESR PLKRKAWEYI FYLDLSGHIS DPEVAEAVKE LSVCCQFVKV LGSYPRARS