Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2351 |
Symbol | |
ID | 5136845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 2503295 |
End bp | 2504602 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640533803 |
Product | protein HipA |
Protein accession | YP_001218251 |
Protein GI | 147674647 |
COG category | [R] General function prediction only |
COG ID | [COG3550] Uncharacterized protein related to capsule biosynthesis enzymes |
TIGRFAM ID | [TIGR03071] HipA N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 59 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCATAATC CTATACTTTA CGCACATGCC ACAAATAACA ACATGCTTGT GGGGCAGTTT GTCCAACAGT CGAAGACCAA CTTATCATTT TCATACTCTG ATGAGTGGTT AGCATATGAT TCAGCCTTTC CGCTTTCGTT AAGCCTGCCG CTCGTAAAGG GAGAATGTGC ATCATTTTAC GCACTGAATT TTATTCACAA CCTGCTTCCT GACTTGAAAG AAGAACGATT CAGTCTTGCT CAGTCGGTGG GTGTGCAATC CAACGATGTA TTCACTTTAC TCTCAAAAGT TGGTCACGAT TGCACGGGCG GCATTTCATT TACTGAGAGT AGGGAGCCGC CAAAGATAGG GTGGAAATAT CGAGAGATCT CGGCAAGTGA ATTAAATGAA CTCGTTACTC AGCGTAAGTC ATTTCTTCCA TGTTTTGGTG ACTACAGGCC ATGTATATCT GGTACACAAA GAAAAACCAC GCTGATGAAA ACAAATGGTA GGTGGTATGT CCCACAGGGA ACCGCGTTCA GTAGTCACAT AGTTAAGTAT CCGATGGATG TGATTACCCA AAGTAACTCC GTGCTAGATA TGAGTAGCAG CATTGAGAAT GAATTCATCT GTACTCAAAT AGCAAAGGAA CTTGGTTTTA AAGTCCCCGA CATTGAGATA ATAACCGCAG AATCAGGAAC TAAAGCCCTT GCAGTAAAAC GGTTCGACCG ATGTTTTGGT GATGGAGTTG TGACTCGTAG ACACCAAGAA GATTTCTGCC AGATTTTGGG TGTACCTGAA CATCAAAAAT ATCAATCGGA GAACAACCTA AGTGTCTCCA AAATTGTTGA TGTTTTAAGT CTCTCTGCGC AAAGCAAGGC GAACAATCAT GACTTTTTTA AATTCATGAT TTTACAGTGC CTTCTTGGTG CCACTGACGG TCATCTCAAG AATTTTTCCG TGCATATTGA TACTGGCGGA TACTATAAAC TTGCACCGTT CTACGATTTG CTATCAGCTT ACCCTGCTGT CAGTGCTACA GGGTTAAATA AGCGCAAGCT AAAACTAGCA ATGGGACTGC AAGCATCACG AGGATACAAA TACCATATCA GCAAGATTTG CTTACGGCAT ATAGAGCAGA CAGCGAGTCA GTTCGGTATC AGTAATGCTG AATGCCATGA AATCTTCTAT GCCTTTCTCG CTCAATTTAG TAGCGCACTG AGTTCTATAG ACAAAAGATT CCCTGGACAG GAGTTTGCGT TGGTAAAGGA TGCAATATTT CAACATGCCA CTGAAATCGT TGAAAAGTTA AACAGAACAA TTAAGTAA
|
Protein sequence | MHNPILYAHA TNNNMLVGQF VQQSKTNLSF SYSDEWLAYD SAFPLSLSLP LVKGECASFY ALNFIHNLLP DLKEERFSLA QSVGVQSNDV FTLLSKVGHD CTGGISFTES REPPKIGWKY REISASELNE LVTQRKSFLP CFGDYRPCIS GTQRKTTLMK TNGRWYVPQG TAFSSHIVKY PMDVITQSNS VLDMSSSIEN EFICTQIAKE LGFKVPDIEI ITAESGTKAL AVKRFDRCFG DGVVTRRHQE DFCQILGVPE HQKYQSENNL SVSKIVDVLS LSAQSKANNH DFFKFMILQC LLGATDGHLK NFSVHIDTGG YYKLAPFYDL LSAYPAVSAT GLNKRKLKLA MGLQASRGYK YHISKICLRH IEQTASQFGI SNAECHEIFY AFLAQFSSAL SSIDKRFPGQ EFALVKDAIF QHATEIVEKL NRTIK
|
| |