Gene Shewmr4_2833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2833 
SymboltyrA 
ID4253404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3388014 
End bp3389153 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content49% 
IMG OID638119468 
Productbifunctional chorismate mutase/prephenate dehydrogenase 
Protein accessionYP_734961 
Protein GI113971168 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0287] Prephenate dehydrogenase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01799] chorismate mutase domain of T-protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00724041 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAA AAACCACAAC TGAATTAGAA CACCTTCGAG GTCTCATCGA TGGTGTCGAC 
CAACAATTGC TGCATTTACT ACGTAAACGC TTAGATTTAG TCGCTCAGGT GGGAACGGTA
AAACACGCCG CAGGCCTGCC AATTTATGCG CCGCAACGCG AAGCGGCTAT GCTGGCAAAA
CGCCGCGAAG AAGCCAAGAA CATGGGCATA GCGCCACAAC TGATTGAAGA TATTTTGCGC
CGCTTGATGC GTGAATCCTA TCTCAACGAG AAGGATGTCG GCTTTAAGCA AGTAAAAAAC
GATCTCGGTT CAGTCGTGAT TGTTGGTGGT AAGGGTCAGC TTGGTGGACT GTTCCAACAA
ATGCTGACGC TCTCGGGTTA TCAGGTTAAA GTGCTTGATA AAGACGACTG GCAGCAGGCG
GAAACCTTAT TTGCCGACGC CGGATTGGTA CTGGTGACTG TGCCTATCGC CATCACCTGC
GACATTATCC GTGAGAAACT GACCCAATTA CCGCAGGAAT GTATCTTAGC CGACTTAACC
TCAATCAAGA CTGAACCTAT GAATGCCATG TTGGCCGCTC ACAAGGGGCC TGTTGTCGGC
TTTCATCCCA TGTTTGGCCC AGATGTCGGC AGTTTGGCTA AGCAGGTGGT GGTGGTGTGC
CATGGCCGCG AAGCCGATAA ATACCAATGG TTGCTCGAGC AAATTGGAAT TTGGGGCGCA
CGGATTGTAG AAGCTGAGCC TGAACGTCAC GACAATGCGA TGCAATTGGT ACAGGCGATG
CGCCACTTCT CGACCTTTGT GTATGGCTTG AACCTTTGCA AAGAAGAAGC GGATATTGAA
ACCCTGCTGC AATTTAGCTC ACCTATCTAT CGCTTAGAAC TCGCCATGGT CGGGCGCTTG
TTTGCCCAAA GCCCGGAGCT TTATGCCGAT ATTATTTTTG CCCAGCAGGA TAGCCAGCAT
GCGATTGGCG ATTATTTGGA TAACTATCGC GAAGCATTAG AGCTATTAAA GCGGGGCGAT
AGGGACGCGT TTATCAGCCA GTTCCAAACG GTAGCAAAAT GGTTTGGTGA TTTTGCTCCT
CAGTTTCAGC GTGAAAGTCG CATGATGCTG CAATCGGTCA GTGATATGAA AACGAACTGA
 
Protein sequence
MNEKTTTELE HLRGLIDGVD QQLLHLLRKR LDLVAQVGTV KHAAGLPIYA PQREAAMLAK 
RREEAKNMGI APQLIEDILR RLMRESYLNE KDVGFKQVKN DLGSVVIVGG KGQLGGLFQQ
MLTLSGYQVK VLDKDDWQQA ETLFADAGLV LVTVPIAITC DIIREKLTQL PQECILADLT
SIKTEPMNAM LAAHKGPVVG FHPMFGPDVG SLAKQVVVVC HGREADKYQW LLEQIGIWGA
RIVEAEPERH DNAMQLVQAM RHFSTFVYGL NLCKEEADIE TLLQFSSPIY RLELAMVGRL
FAQSPELYAD IIFAQQDSQH AIGDYLDNYR EALELLKRGD RDAFISQFQT VAKWFGDFAP
QFQRESRMML QSVSDMKTN