Gene Sbal223_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3098 
SymboltyrA 
ID7087876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3676562 
End bp3677701 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content48% 
IMG OID643461982 
Productbifunctional chorismate mutase/prephenate dehydrogenase 
Protein accessionYP_002359006 
Protein GI217974255 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0287] Prephenate dehydrogenase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01799] chorismate mutase domain of T-protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000040953 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.11712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAAA AAACAACGGC CGAATTAGAA CATCTTCGCG GTCTGATTGA TGGTGTCGAT 
CAGCAATTAT TGCATTTACT GCGTAAGCGC TTGGATCTCG TGGCGCAAGT GGGCACAGTG
AAACACGGCG CCGGTTTACC GATTTATGCA CCACAGCGTG AAGCCGCCAT GCTCGCTAAA
CGCCGCGAAG AAGCGAAAAA CATGGGGATT GCGCCGCAAT TAATTGAAGA TATTTTACGT
CGTCTGATGC GTGAGTCTTA TCTCAATGAA AAAGATGTTG GCTTTAAGCA AGTTAAAAAA
GATCTCGGCT CAGTGGTGAT TGTCGGCGGT AAAGGGCAAC TCGGTGGACT GTTTTCACAA
ATGCTGACCT TATCTGGCTA CCAAGTGAAT CTGCTCGATA AAGATGATTG GCAGCAAGCA
GATAGCCTAT TTGCCGATGC GGGCATGGTG TTAGTGACTG TGCCGATTGC GATTACTTGC
GAGCTTATTC GCGAAAAGCT GACCCAATTA CCAGCCGACT GTATTCTGGC GGATTTGACC
TCCATCAAGA CAGAGCCGGT TAAAGCCATG CTTGAGGCGC ATTCTGGTCC TGTCGTCGGT
TTCCATCCTA TGTTTGGTCC CGATGTGGGC AGTTTGGCGA AACAAGTTGT GGTGGTGTGC
CACGGTCGCT CGCCGGAGAA ATACCAATGG CTACTCGAGC AGATCGCTAT TTGGGGCGCG
CGGATTGTCG AAGCAGAGCC CGAACGTCAC GACAGTGCAA TGCAGTTAGT GCAGGCGATG
CGTCACTTCT CGACCTTTGT GTATGGTTTG AATCTGTGCA AGGAAGAGGC AGATATTGAT
ACTTTACTGC AATTTAGCTC GCCGATTTAC CGTTTAGAAT TGGCTATGGT AGGGCGCTTA
TTCGCCCAAA GCCCAGAGCT TTACGCCGAT ATTATTTTTG CCCAGCAAGA TAGCCAACAT
GCAATCGGTG ATTATTTAGA TAACTACCGT GAAGCGTTAG AGCTGCTAAA ACGCGGCGAC
AGGAACGAGT TTATTAAGCA GTTCCAAAGC GTCGCTAAAT GGTTTGGGGA TTTTGCCCCT
CAATTCCAGC GCGAAAGCCG TATTATGCTG CAATCGGTCA ATGATATGAA AACCAATTAA
 
Protein sequence
MNEKTTAELE HLRGLIDGVD QQLLHLLRKR LDLVAQVGTV KHGAGLPIYA PQREAAMLAK 
RREEAKNMGI APQLIEDILR RLMRESYLNE KDVGFKQVKK DLGSVVIVGG KGQLGGLFSQ
MLTLSGYQVN LLDKDDWQQA DSLFADAGMV LVTVPIAITC ELIREKLTQL PADCILADLT
SIKTEPVKAM LEAHSGPVVG FHPMFGPDVG SLAKQVVVVC HGRSPEKYQW LLEQIAIWGA
RIVEAEPERH DSAMQLVQAM RHFSTFVYGL NLCKEEADID TLLQFSSPIY RLELAMVGRL
FAQSPELYAD IIFAQQDSQH AIGDYLDNYR EALELLKRGD RNEFIKQFQS VAKWFGDFAP
QFQRESRIML QSVNDMKTN