Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_0101 |
Symbol | |
ID | 5134958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | - |
Start bp | 110558 |
End bp | 112432 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640530424 |
Product | hypothetical protein |
Protein accession | YP_001214942 |
Protein GI | 147672238 |
COG category | [R] General function prediction only |
COG ID | [COG3211] Predicted phosphatase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.517413 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGAAGC GTGATGAGCA AGAATCCCAG TTTAGTGAAA TGATTGAAGC GCAACTGTCT AGACGCCATT TTTTGGCGGG CAGTGCGGCA GTGAGTGCCG GCGCGTTTTT AGCCTTAAAT CCGGTTGCGA GTGCGGTGGC TGCGCCTGCC ACATCAAACC TACTGAATTT CTCAGCAATC CCTGTGTCAA CAGAAGATAA AGTGATTGTG CCTAAAGGCT ATAAAGCCAC TCCACTCATG TCTTGGGGCG ATCCTATTTT TGCGAATGCG CCAGAATTTG ACCAAAGCGG CAAGCAAGAC TCAAAAGCAC AAGAGAAGCA GTTTGGCGAT AATACGGATG GCATGAGCTT TTTCCCGATC AGCGAAGATC GCGGTGTGCT CGCGATCAAC AATGAATACA CCAATTACGA GTATCTGTTT GATCATCAAG GTAAAGCGAT GACGGCGGAT GACGTACGTA AAGCGCAAGC GGCAGTCGGT GTCACCATTG TTGAAGTGGT ACGTAAAAAT GGTCAGTGGA TGGTAGACCG TCAAGGTGAA CGTAATCGTC GGATCACCGC TTATACACCA ATGATGATGA CCGGTCCTGC TGCAGGCCAT GATCTGCTGA AAACGGCTGA AGATCCTAGC GGATTAAAAG TGCTGGGTAC TTTCAATAAC TGTGCGAATG GTGAAACTCC TTGGGGCACT TACCTCACTT GTGAAGAAAA CTTCGATGAT TTCTTTGGTG CGGACCAAGA AGGCAGTGTC GATGCTGATC AGAAGCGTTA CGGAATTGCT GCTGAACCTA GTGATTACCA ATGGCATAAG CACGATGCGC GTTTCGACAT AACCAAGAAC CCTAAAGAGC CAAACCGCTT TGGTTGGGTT GTGGAAATTG ATCCACATAA TCCGAACTCT ACACCACTGA AACGTACCGC TCTTGGTCGT TTTAAACATG AAAATGCGGC ACTGGTGATT AATAATGATG GCCATGTGGT GGTTTATCTT GGTGATGATG AACGCGGCGA ACATCTGTAT AAATTCGTTT CCAAGCATCG CTATCAAGCC GGTAATGATC AGCAAAACCG TAATCTGCTG GAAGAGGGCA CCTTGTATGT CGCTAAGTTC GATATCAATG AAAACGAACT GAAAGGCAGC GGACGCTGGA TGGAGCTGAG CTTTGGCAAG AATGGCCTCA CTCCTGAAAA CGGATTTAAA GATCAAGCCG AAGTGCTGAT TTTTGCTCGT CGTGCGGCGA CCCAAGTGGG CGCAACGACC ATGGATCGAC CAGAATGGGT GGCCGTGCAT CCTGATAAAA AGCATGTGTT CTGTACGCTC ACCAACAACA AAAACCGTGG CAAAGAAGGT CAACCTGTTG GTGGCCCGAA TCCACGCGAG AAGAATAACT ACGGGCAGAT CGTTCGTTGG ATGCCAGCAC AGGGTGATCA CACCAGTGAT GTGTTCGCTT GGGACCTCTA CTTAATTGCA GGTAATCCAA CCGTTCACAA AGGCACCCTG TACGCAGGTA GCGAAAACAT TTCGGCTGAC AATATGTTTA ACAGCCCTGA TGGGATTGGT TTTGACACTG CAGGTCGCTT GTGGATCCAA ACCGATGGTA ACTACTCTAA CCAAGGTGAT TTTGCTGGGC AGGGGAATAA CCAAATGCTG TGTGGTGACC CAATCACAGG TGAAGTGAAA CGCTTCTTAA CAGGTCCTAT TGCGTGTGAG ATCACGGGAT TAACCTTCAG CCCTGATCAC AAAACCATGT TTGTTGGTGT TCAGCATCCG GGTGAAGAGG CTGCTCCATC GCATTTCCCT TACGGTGGTA CAAGCAAACC TCGTTCAACC ATCATGATGA TCACCCGTGA AGACGGCGGT GTCATCGGCG CATAA
|
Protein sequence | MWKRDEQESQ FSEMIEAQLS RRHFLAGSAA VSAGAFLALN PVASAVAAPA TSNLLNFSAI PVSTEDKVIV PKGYKATPLM SWGDPIFANA PEFDQSGKQD SKAQEKQFGD NTDGMSFFPI SEDRGVLAIN NEYTNYEYLF DHQGKAMTAD DVRKAQAAVG VTIVEVVRKN GQWMVDRQGE RNRRITAYTP MMMTGPAAGH DLLKTAEDPS GLKVLGTFNN CANGETPWGT YLTCEENFDD FFGADQEGSV DADQKRYGIA AEPSDYQWHK HDARFDITKN PKEPNRFGWV VEIDPHNPNS TPLKRTALGR FKHENAALVI NNDGHVVVYL GDDERGEHLY KFVSKHRYQA GNDQQNRNLL EEGTLYVAKF DINENELKGS GRWMELSFGK NGLTPENGFK DQAEVLIFAR RAATQVGATT MDRPEWVAVH PDKKHVFCTL TNNKNRGKEG QPVGGPNPRE KNNYGQIVRW MPAQGDHTSD VFAWDLYLIA GNPTVHKGTL YAGSENISAD NMFNSPDGIG FDTAGRLWIQ TDGNYSNQGD FAGQGNNQML CGDPITGEVK RFLTGPIACE ITGLTFSPDH KTMFVGVQHP GEEAAPSHFP YGGTSKPRST IMMITREDGG VIGA
|
| |