Gene VC0395_A0227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0227 
SymboltyrA 
ID5137799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp235397 
End bp236524 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content49% 
IMG OID640531687 
Productbifunctional chorismate mutase/prephenate dehydrogenase 
Protein accessionYP_001216190 
Protein GI147674187 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0287] Prephenate dehydrogenase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01799] chorismate mutase domain of T-protein 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGTAG AGTTGAATCA GTTACGCGAC CAAATCGATG AAGTCGATAA GCAGATGGTG 
GAGCTACTGG CGCGCCGTCT GGCATTGGTG GAGCAGGTCG GGCAAGTGAA AAGTCGATAT
GGGTTACCGA TTTATGCTCC CGATCGTGAA GCGGCAATGC TCGCTTCACG TCGAGCGGAA
GCGGAAAGCA AAGGTGTTCC GCCACAACTG ATTGAAGATA TTTTGCGCCG AACCATGCGT
GAATCCTACG CCAGTGAAAA GGACTCCGGC TTTAAATGCC TCAATCCTGA GTTACGTTCC
GTGGTGATCA TCGGTGGTAA CGGTCAGCTT GGGCGACTAT TTGGCCGTAT GTTTAAACTC
TCTGGCTATC AAGTCAAAGT GCTGGGTAGC CAAGATTGGG ACAAAGCGGA TGAACTGCTC
AGTGATGCTG GCTTAGTGAT AGTCACGGTA CCTATCCATT TGACGCTCGG TGTTATCGAA
AAGCTGCGCC AGTTGCCGGA CGATTGCATT TTGTGCGATC TCACCTCAAT CAAAGCCAAG
CCGCTTGCCG CTATGCTACA AGTGCACAAA GGTCCAGTGG TTGGGCTGCA CCCTATGTTT
GGCCCTGATG TTCCAAGCCT GGCGAAGCAG GTGATTGTTT ACTGTGATGG TCGAGGCAAT
GAACACTACC AATGGCTCTT GCAACAGTTT GCTATTTGGG GTGCAAGCTT GTGTCAGATT
GATGCGACTG AACATGATCG TGGTATGACG CTTATTCAAG CTCTGCGCCA CTTCACTTCC
TTTGCTTATG GCTTGCATCT GACCAAAGAG AACCCGAACT TGGCACAACT GCTGAAACTC
AGTTCACCGA TTTACCGTTT AGAGCTTGCT ATGGTCGGAC GGCTATTTGG GCAAGATCCC
CATCTATACG GCGATATTAT TCTCTCATCA CCAGAAAATA TTGAGATGAT CCAGCGTTTT
CATCGCTGCT TAAGCGAGGC GGTTGAGTTG GTGAGCGCGG GCGATAAGGC GAGTTTTGTG
GCTCAATTTG AACGAGTTAG CCAGTGGTTT GGTGATTATT CACAGCAGTT TATGCATGAG
AGCCAAAACT TGCTCAAACA AGCGAATGAT GCGATCCACA GAGGTTAA
 
Protein sequence
MAVELNQLRD QIDEVDKQMV ELLARRLALV EQVGQVKSRY GLPIYAPDRE AAMLASRRAE 
AESKGVPPQL IEDILRRTMR ESYASEKDSG FKCLNPELRS VVIIGGNGQL GRLFGRMFKL
SGYQVKVLGS QDWDKADELL SDAGLVIVTV PIHLTLGVIE KLRQLPDDCI LCDLTSIKAK
PLAAMLQVHK GPVVGLHPMF GPDVPSLAKQ VIVYCDGRGN EHYQWLLQQF AIWGASLCQI
DATEHDRGMT LIQALRHFTS FAYGLHLTKE NPNLAQLLKL SSPIYRLELA MVGRLFGQDP
HLYGDIILSS PENIEMIQRF HRCLSEAVEL VSAGDKASFV AQFERVSQWF GDYSQQFMHE
SQNLLKQAND AIHRG