Gene Spea_1059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpea_1059 
Symbol 
ID5661458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella pealeana ATCC 700345 
KingdomBacteria 
Replicon accessionNC_009901 
Strand
Start bp1282497 
End bp1284470 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content45% 
IMG OID641235605 
Productchorismate mutase 
Protein accessionYP_001500921 
Protein GI157960887 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase
[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01797] chorismate mutase domain of proteobacterial P-protein, clade 1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.850871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAC CACAACCCTT AAACCACACT CGAGAGCAGA TCACTAGCCT CGATAATGAA 
CTACTGGCCC TGCTTGCTAA GCGCCGTGAA TTAAGTTTAG ATGTCGCCCG CAGTAAAGAG
GTTGATGTTA GGCCTATACG CGATACCATT CGAGAAAAAG AGCTGTTATC TCGCTTGGTG
AAACAAGGCC GAGAACAAGG TTTAGATGCC CACTACGTTA TATCTCTTTA TCAAAGCATT
ATCGAAGACT CTGTACTTAA CCAACAAGCC TATTTGCATG GCCGTGCAAA CCCAGAAACT
CAACAGCAAC AATATTGTAT CGCCTACTTA GGGGCGCGCG GCTCTTATTC ATATCTTGCA
GCGAGTCGTT ACTGCGATAG ACGCCAAGTA GAGATGCAAG ATTTAGGCTG CCAAAGTTTT
GATGAAATCG TTCAGGCCGT CGAATCTGGT CACGCGGATT ATGGTTTCCT ACCCATTGAG
AACACTTCAT CAGGCTCAAT CAACGAGGTT TATGATGTGT TACAGCACAC CAGCCTTGCC
ATTGTCGGTG AAACCACTAT TGAAGTTGGC CATTGTTTAT TAGCAAAGAG CGGCAGCAGC
ATTAACGATA TCAAAACCGT TTACGCCCAC CCACAGCCAA TAAGTCAGTG CAGTCGATAC
CTGAGCCAAC ATGGTGAATT CAAACTCGAG TATTGCTCAA GCAGCGCAGA AGCGATGGAG
ATGGTATGCA ATGCTAATGA TAACAGCGTG GCTGCTATCG GCAGCGCTGA AGGTGGCGCT
CTATATCAAT TAGAAGCCGT TGAAAGTGGA CTTGCCAATC AAAAAATCAA TCAAAGCCGC
TTTATTGTTG TTGCAAGAAA AGCTGTTGAA GTGCCTTCTC AGCTTCCAGC TAAGTGCACA
CTGATTATGG CGACAGGGCA AAAGCCTGGC GCATTGGTTG AAGCCTTACT CGTACTTAAA
GCCCGAAACC TTAATATGAG CAAACTCGAG TCGCGTCCAA TTCCTGGTAC CCCATGGGAA
GAGATGTTCT ATCTCGATAT TGATGCCAAC TTGTCGAGTG AGCCAATGCA GGCAGCGCTG
AAAGAGTTAG AAAGAACCAC TCGCTTTATC AAGGTGCTTG GCTGCTACCC ATGTGAAACG
GTTAAACCAA CTCAACTAAG TAATAGTCAA TTAATGATAG AGCCTGACAC TTCTAAGTCG
ACCACGGTTA CCGCTACGCC GCCAAACAAA CAGATGCGCG TGAGTAAGCA ATACAAGAGT
GAGTCAACTC AAGTGATTTG CGGCCAACTG AGCCTTGGTA ATGGTAACTT AGGAGCTATA
GCTCAAGTAC AACTGCCACT CGATTTACAG CAATTTGAAC AAATGGCAAA AGAGCTTAAA
GAGTCTGGAT TCCAGGCGGT TGTTATTCAG GGCTTGAGTC AGCAGAGTGA TCTTATTAGC
TCGATAGAAA AGTTTAAGCA TGCCCTAGGC CAATTCAATC TAGTTTGTGT GTTAGCTATT
GAGCACGAAA CAGATTTAAG CATGGCAATT CAATTTGCCG ATGTGGTGAT GCTTGCAGGT
AACTTGATGT ATAACCAAGC CATACTAAGC CAACTCGGTA GCTTACTGGT CCCGGTCATT
TTAGAGCGTA ATGCAATGGC AAGTGTAGAT GACTTGCTTA ATGCCGCTGA AGAAGTTTTA
AGTCAGGGCA ATCAGCAGCT TATACTTTGC GAGTCAGGCG TAACCACACT TAACAATTCA
GGCAAACCAT CGCTAGATTT GGCCGCATTA GTTGAGTTAA AAGCCCTAAG TCACTTACCG
GTTGTTGTTA ACCCAAGTTA TGCCATTGAT ACCGATCAGC TAGCAACATT TGCTAAAGCA
ATTAAGCAGC TTAAAGGTGA TGGTGTCATG ATGAACCTAA GCGCGATAGA TACAGCGGCT
CAGGCTCATC CAGAGGAACT CGTTAAAGCA GTATTAGGTA ATTTGTACCA CTAG
 
Protein sequence
MSKPQPLNHT REQITSLDNE LLALLAKRRE LSLDVARSKE VDVRPIRDTI REKELLSRLV 
KQGREQGLDA HYVISLYQSI IEDSVLNQQA YLHGRANPET QQQQYCIAYL GARGSYSYLA
ASRYCDRRQV EMQDLGCQSF DEIVQAVESG HADYGFLPIE NTSSGSINEV YDVLQHTSLA
IVGETTIEVG HCLLAKSGSS INDIKTVYAH PQPISQCSRY LSQHGEFKLE YCSSSAEAME
MVCNANDNSV AAIGSAEGGA LYQLEAVESG LANQKINQSR FIVVARKAVE VPSQLPAKCT
LIMATGQKPG ALVEALLVLK ARNLNMSKLE SRPIPGTPWE EMFYLDIDAN LSSEPMQAAL
KELERTTRFI KVLGCYPCET VKPTQLSNSQ LMIEPDTSKS TTVTATPPNK QMRVSKQYKS
ESTQVICGQL SLGNGNLGAI AQVQLPLDLQ QFEQMAKELK ESGFQAVVIQ GLSQQSDLIS
SIEKFKHALG QFNLVCVLAI EHETDLSMAI QFADVVMLAG NLMYNQAILS QLGSLLVPVI
LERNAMASVD DLLNAAEEVL SQGNQQLILC ESGVTTLNNS GKPSLDLAAL VELKALSHLP
VVVNPSYAID TDQLATFAKA IKQLKGDGVM MNLSAIDTAA QAHPEELVKA VLGNLYH