Gene VC0395_A0796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0796 
SymboltrpE 
ID5135227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp805797 
End bp807368 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content51% 
IMG OID640532254 
Productanthranilate synthase component I 
Protein accessionYP_001216746 
Protein GI147673289 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGG CCATCAACAT AAAGAAAACC GCGCCGCTTG AGGTGCTACA CAGTGAATTG 
CCTTATACGC AAGATCCTAC GGCTTTATTT CATGCTCTTT GCGCTGGGCG CAGCGACTGC
CTACTACTGG AATCTGCAGA AATCGACTCC AAGCAAAATT TGAAGAGCCT TCTGTTAGTG
GATGCTGCCG TGCGCATTGT GTGTGAAGGC CATCAAGTGA CCTATCATGC GCTCAGTGCA
AACGGCCAAG CGCTGCTCAA CATCATTCAT AGCAACCTCA CTGATCGGAT TCCTTGCAAA
GTAGAGAAAG CGAAACTTAC CCTCACCTTC TCTACACCCT GTGATACGTT GGATGAAGAC
TCACGATTGC GCGAAGCGTC TTCTTTCGAT GCGCTACGTT TAGTGCAGCA CAGTTTTGAT
CTCACTGACC ACGGTAAATT TGCGCTGTTT TTAGGTGGCT TATTTGCCTA CGATTTAGTG
GCTAACTTTG AACCGCTCGG TGAAGCACCT GCCGACAATC AATGCCCTGA TTACGTATTT
TATGTAGCAG AAACTCTGAT GGTGATTGAT CATCAGCGTG AAACTTGCCA ACTGCAAGCC
ACCCAATTCC AACCGGGCGA TGCGCTGCAC AGCCAACTCA AAAGCCGGAT GCGTGAAATT
CGTGCGCAAG TGAATCAAAA ATTGCCTTTG CCGAGTGCGC AATCTTTATC TGATGTGGAA
GTGACCACCA ATATTAGCGA TGCAGCATTT TGCGATATCG TGCGTGACCT CAAGCAGTAC
GTGGTCAAAG GCGATGTGTT CCAAGTTGTG CCTTCGCGCC GTTTTCGCTT ACCTTGCCCT
TCACCACTCG CAGCTTATCA ACGGCTGAAA CAGAGTAACC CAAGCCCTTA CATGTTCTAC
ATGCAAGATG AACGCTTTAC CCTGTTTGGC GCATCCCCCG AAAGCGCACT CAAGTATGAA
ATGCACACCA ACCAAGTGGA AATCTACCCG ATTGCAGGGA CTCGCCGCCG CGGTAAGCGC
GCCGATGGCA GCATCGATTT TGACCTCGAT AGCCGCATTG AGCTTGAACT GCGCACCGAT
AAAAAAGAGA ACGCCGAACA CATGATGCTG GTTGACTTAG CACGCAACGA TGTCGCGCGC
ATTAGCCAAG CCGGTACTCG CCATGTCGCT GACTTGCTGC AAGTAGATCG CTACAGCCAT
GTGATGCACT TGGTGTCGCG CGTGGTGGGT CAGTTACGTG AAGATCTGGA TGCGCTGCAT
GCTTATCAAG CTTGCATGAA CATGGGCACG CTGACTGGCG CACCGAAAAT TCGCGCGATG
CAGTTAATCC GCGATGTGGA ACAAGCGCGT CGCGGCAGCT ACGGCGGCGC GGTGGGTTAT
CTCACGGGTG AAGGCGATTT GGATACCTGT ATCGTGATCC GTTCTGCTTA TGTGGAAAAC
GGCATCGCCC AAGTCCAAGC TGGCGCGGGT GTCGTTTACG ACTCCGACCC ACAAGCCGAA
GCCGATGAAA CGCGCGGCAA GGCGCAAGCG GTAATCTCCG CTATTTTATA TGCTCATCAA
GGGAAGGAAT GA
 
Protein sequence
MNKAINIKKT APLEVLHSEL PYTQDPTALF HALCAGRSDC LLLESAEIDS KQNLKSLLLV 
DAAVRIVCEG HQVTYHALSA NGQALLNIIH SNLTDRIPCK VEKAKLTLTF STPCDTLDED
SRLREASSFD ALRLVQHSFD LTDHGKFALF LGGLFAYDLV ANFEPLGEAP ADNQCPDYVF
YVAETLMVID HQRETCQLQA TQFQPGDALH SQLKSRMREI RAQVNQKLPL PSAQSLSDVE
VTTNISDAAF CDIVRDLKQY VVKGDVFQVV PSRRFRLPCP SPLAAYQRLK QSNPSPYMFY
MQDERFTLFG ASPESALKYE MHTNQVEIYP IAGTRRRGKR ADGSIDFDLD SRIELELRTD
KKENAEHMML VDLARNDVAR ISQAGTRHVA DLLQVDRYSH VMHLVSRVVG QLREDLDALH
AYQACMNMGT LTGAPKIRAM QLIRDVEQAR RGSYGGAVGY LTGEGDLDTC IVIRSAYVEN
GIAQVQAGAG VVYDSDPQAE ADETRGKAQA VISAILYAHQ GKE