Gene VC0395_1117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_1117 
SymboltnaA 
ID5134703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp1081033 
End bp1082451 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content48% 
IMG OID640531439 
Producttryptophanase 
Protein accessionYP_001215953 
Protein GI147671520 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID[TIGR02617] tryptophanase, leader peptide-associated 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATT TTAAACACTT ACCAGAACCT TTTCGTATTC GTGTTATTGA ACCTGTTAAA 
CGTACCACAC GGGAATATCG TGAAAAGGCG ATTTTAAATG CCGGAATGAA CCCCTTTTTG
CTCGATAGTG AAGATGTATT TATTGACCTG CTCACCGACA GCGGCACTGG CGCTATCACG
CAGGAGATGC AAGCCGCCAT GTTCCGTGGT GATGAAGCCT ACAGCGGCAG CCGCAGCTAC
CACGCACTCG CGAGAGCGGT TAAAGATATT TTTGGTTATG AATACACGAT TCCAACCCAC
CAAGGTCGTG GTGCAGAGCA GATTTATATT CCTGTTTTGA TTAAAAAGCG TGAAAAAGAG
AAAGGACTCG ATCGCAGTAA AATGGTCGCC CTATCTAACT ACTTTTTCGA TACCACTCAA
GGCCATACCC AAATCAACTG CTGTGTTGCT AAAAACGTGT ACACCGAAGA GGCATTTGAT
ACCGGTGTTA AAGCCGATTT TAAAGGTAAC TTCGACTTAG AAAAGCTCGA ACAAGCCATC
CTTGAAGCGG GCCCGGCAAA CGTCCCATAT ATTGTCAGCA CCATCACTTG TAACTCTGCG
GGTGGCCAGC CGGTTTCGAT CGCCAACTTA AAAGCCGTGT ATGAGATTGC CCAGCGTTAC
GACATTCCCG TGATCATGGA TTCTGCTCGT TTTGCTGAAA ATGCGTATTT TATTCAGCAA
CGTGAGCGCG ATTACCGCAA CTGGAGTATC GAAGAGATCA CGCGTGAAGC TTACAAATAC
GCCGATGGAC TCGCGATGTC GGCCAAAAAA GATGCCATGG TGCAAATGGG CGGTTTACTC
TGCTTCAAAG ACGAAAGCTT CTTTGACGTA TACACCGAAT GCCGAACCCT GTGTTTGGTG
CAAGAAGGCT TCCCTACATA CGGTGGCTTA GAAGGCGGTG CGATGGAGCG GTTGGCGGTT
GGCTTATATG ACGGTATGCG CCAAGATTGG CTCGCTTATC GCATTAACCA AGTGGAGTAT
CTGGTCAATG GTTTAGAAGC GATTGGGGTT ATTTGCCAAC AAGCTGGCGG CCATGCTGCG
TTTGTCGATG CGGGTAAACT GCTGCCTCAC ATCCCAGCAG ATCAATTCCC TGCTCACGCT
TTAGCTTGTG AACTCTATAA AGTCGCAGGC ATTCGCGCAG TAGAAATTGG TTCACTCCTG
CTTGGCCGTG ATCCTGCAAC CGGAAAACAG CATCCTTGCC CAGCCGAATT GCTCCGTTTA
ACCATTCCAC GCGCGACTTA TACGCAAACA CACATGGATT TCATCATCGA AGCATTTGAG
AAGGTGAAAG CCAATGCTCG TAACGTCAAA GGATTGGAGT TTACTTACGA GCCACCCGTG
CTACGCCACT TTACGGCTCG CTTAAAAGAA AAAGCCTAA
 
Protein sequence
MENFKHLPEP FRIRVIEPVK RTTREYREKA ILNAGMNPFL LDSEDVFIDL LTDSGTGAIT 
QEMQAAMFRG DEAYSGSRSY HALARAVKDI FGYEYTIPTH QGRGAEQIYI PVLIKKREKE
KGLDRSKMVA LSNYFFDTTQ GHTQINCCVA KNVYTEEAFD TGVKADFKGN FDLEKLEQAI
LEAGPANVPY IVSTITCNSA GGQPVSIANL KAVYEIAQRY DIPVIMDSAR FAENAYFIQQ
RERDYRNWSI EEITREAYKY ADGLAMSAKK DAMVQMGGLL CFKDESFFDV YTECRTLCLV
QEGFPTYGGL EGGAMERLAV GLYDGMRQDW LAYRINQVEY LVNGLEAIGV ICQQAGGHAA
FVDAGKLLPH IPADQFPAHA LACELYKVAG IRAVEIGSLL LGRDPATGKQ HPCPAELLRL
TIPRATYTQT HMDFIIEAFE KVKANARNVK GLEFTYEPPV LRHFTARLKE KA