Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2081 |
Symbol | |
ID | 5137082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 2237360 |
End bp | 2238430 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640533537 |
Product | hypothetical protein |
Protein accession | YP_001217997 |
Protein GI | 147674275 |
COG category | [R] General function prediction only |
COG ID | [COG0795] Predicted permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000000262997 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTTAAAA TTCTTGATTG GTATATCGGC CGCACTATTG TTGCTACAAC AGCCTTGGTT TTGGTTACCT TTGTTGGCCT CTCTGGCATC ATCAAATATG TTGAGCAGCT CCGTAAAGTG GGTGAGGGAA GTTACGATTT ATTGCAAGCC TTGCTGTTTG TGGTGCTGAG TATTCCTCGT GATGTGGAAA TGTTTTTTCC GATGGCGGCT CTATTGGGCG CATTGATTGG TTTGGGGGCG TTAGCTTCCA GCTCTGAATT GGTGGTGATG CAGGCTGCTG GTTTTTCTAA GCTTGATATC GGCCTTTCCG TTCTTAAAAC CGCCATTCCG CTGATGATTA TTGTCACCTT GCTCGGAGAG TGGGGCGCAC CACAAGCGCA AAAAATGGCA CGTGATATGC GCGCATTTGC CACCTCGGGT GGTGCGATTA TGTCGGTGCG TACTGGGGTT TGGGCACGGG ATGCGAATGA TTTTATCTTT ATCGCCAAAG TTGAAAACGA ACAGTTGTAT GGATTGAATC TGTGGCGCTT TGACGAAAAT AAAAAACTGA GTACGGTGAT TTTTTCTGAG CAAGTCGATT ACGTTGCTAA CAATGAATGG CTGATGAAAG ATGCAGTATT GACACGTTTG GTGAATGACA TCGAGATCAG CAAAGAATCG TTGCCTGAGT ACCGTTGGCG AACCTCCCTT GCTCCAGACA AACTTGCCGT AGTGACGGTT AAGCCGGAAG AGCTATCTCT CACAGGCTTA AGCGATTACG TGCATTACTT AAAAGCATCA GAGCAAGACT CATCGCGTTA TGAGTTGGCT TTGTGGCGCA AAGTGACTCA GCCGATCTCG ATTGCGGTGA TGATGTTGAT GGCGTTGTCG TTCATTTTTG GCCCATTGCG TAGCGTCACC ATGGGGGCGC GGATTTTATC TGGCGTCATT GCCGGATTTA CTTTCTACAT CTCAAGTGAG TTTTTTGGCC CACTCAGTTT GGTGTATGGA TTACCGCCAT TGTTTGGTGC GCTAGCGCCA AGCTTAGTTT TCTTGGCCAT CGCGTTAGGG CTATTGGGTA GGAAGCTATA A
|
Protein sequence | MFKILDWYIG RTIVATTALV LVTFVGLSGI IKYVEQLRKV GEGSYDLLQA LLFVVLSIPR DVEMFFPMAA LLGALIGLGA LASSSELVVM QAAGFSKLDI GLSVLKTAIP LMIIVTLLGE WGAPQAQKMA RDMRAFATSG GAIMSVRTGV WARDANDFIF IAKVENEQLY GLNLWRFDEN KKLSTVIFSE QVDYVANNEW LMKDAVLTRL VNDIEISKES LPEYRWRTSL APDKLAVVTV KPEELSLTGL SDYVHYLKAS EQDSSRYELA LWRKVTQPIS IAVMMLMALS FIFGPLRSVT MGARILSGVI AGFTFYISSE FFGPLSLVYG LPPLFGALAP SLVFLAIALG LLGRKL
|
| |