Gene VC0395_A0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0235 
SymbolpheA 
ID5136282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp245605 
End bp246780 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content47% 
IMG OID640531695 
Productchorismate mutase/prephenate dehydratase 
Protein accessionYP_001216198 
Protein GI147674498 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01797] chorismate mutase domain of proteobacterial P-protein, clade 1 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGACA AACAATACTC ACTCGACGAT ATTCGCTTAC GGCTAAACGA ACTCGACGAT 
CAACTGCTAA ACCTGCTGTC AGAACGACGC AAAATGAGTA TTGAGGTCGC CAAAAGCAAA
GTCGAAACCG CGAAACCTGT ACGTGATCCG GCTCGTGAAC AGCAGCTACT GGTAAAACTC
ATCAATGCTG GCAAAGAGAA ATATCAGCTC GATCCTCAAT ACATCACCAA AATTTTCCAC
ACCATCATTG AAGATTCGGT TTTGCTTCAG CAATCCTATC TGCAAAACCT TGCGAATCCA
CAAAGTCGTA AACCATTGGC TAGAGTCGCC TTTTTAGGCG CTAAAGGCTC TTATTCACAT
CTGGCGACTC GCGAGTATTT CAGCCGCAAA AATACAGAGC TGATTGAGCT CAACTGCGAC
CATTTTAAAG AGGTGGCAAG AACCGTGGAA TCCGGCCATG CGGATTATGG TGTGCTACCG
ATTGAAAACA CCAGCTCAGG CTCCATCAAC GAAGTTTACG ATTTGCTGCA GCACACCACA
CTGTACATAG TGGGTGAGTT AACGCAGCCA ATAGAGCATT GCCTGGTGGC CACGCAAGAG
ATTCGTTTGG AGGATCTGAA AGTCCTCTAT TCCCATCCAC AACCTCACCA GCAGTGCAGC
GAATTTCTTA GCCGCTTAAA AGGGGTCAAG TTAGAAAGTT GCGCCAGTAC TGCAGATGCC
ATGAAAAAAG TGCAAGAGCT CAATCGTGCG GATGTAGCAG CGATTGGCAA CTCAGCCAGC
GGAAAACTGT ACGGACTGCA ACCGATTCAA GGTAATATTG CCAACCAAAC CGAAAATCAC
ACTCGCTTTA TCGTGGTAGC TCGTAAACCC GTCGAAGTTT CACCACAAAT TCCAGCCAAA
ACCACCTTGA TTATGTCAAC TTCACAAGAG GCGGGCTCGC TGGTTTCAAC CTTACTCGTG
CTGCAACGTT ACGGCATTAA TATGACTAAG CTGGAATCGC GTCCGATTAT GGGTAATCCG
TGGGAAGAAA TGTTCTACGT AGATTTAGAA GCGCACATTG ACTCCGATGA GATGCAGCAA
GCGTTGGCAG AACTCACTCA ACTGACTCGA CACCTCAAAG TGCTCGGCTG CTACCCTAGT
GAAAACGTCA AACCCACTCA GGTGAAATTC ATTTAG
 
Protein sequence
MTDKQYSLDD IRLRLNELDD QLLNLLSERR KMSIEVAKSK VETAKPVRDP AREQQLLVKL 
INAGKEKYQL DPQYITKIFH TIIEDSVLLQ QSYLQNLANP QSRKPLARVA FLGAKGSYSH
LATREYFSRK NTELIELNCD HFKEVARTVE SGHADYGVLP IENTSSGSIN EVYDLLQHTT
LYIVGELTQP IEHCLVATQE IRLEDLKVLY SHPQPHQQCS EFLSRLKGVK LESCASTADA
MKKVQELNRA DVAAIGNSAS GKLYGLQPIQ GNIANQTENH TRFIVVARKP VEVSPQIPAK
TTLIMSTSQE AGSLVSTLLV LQRYGINMTK LESRPIMGNP WEEMFYVDLE AHIDSDEMQQ
ALAELTQLTR HLKVLGCYPS ENVKPTQVKF I