Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_1084 |
Symbol | |
ID | 5134533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | - |
Start bp | 1052396 |
End bp | 1053880 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640531407 |
Product | hypothetical protein |
Protein accession | YP_001215921 |
Protein GI | 147672196 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.387482 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGACACCA TTATGCCGTT ACCCACTCAT TTTTATACTA CGCAGCAGCT CAAACAAGGC GAACAAGATG CCGCGAGTGA GCGAGGTCTT GAGCTCTTTC ATTTAATGGA ACGCGCAGGA CAAGCGGTAT TCACCATCGC TTTTGCTCAG TATCCCACCA GCCACCACTG GTTAATTTGT TGCGGTGGCG GAAATAATGG TGGGGACGGT TATATTGTCG CGGTATTAGC CAGACATATG GGGATTGATG TTACCGTATG GCAGTTAGGC GATCCTGAAA AACTGCCAGC CGATGCCCAT CGTGCTTATC AGCAATGGAA AGAGTTGGGT GGTGCGGTCT ACGCTCCACA GTCTGAAGTG CCCGAATCGA CGGATGTGAT TATTGATGCG CTGTTTGGCA TTGGTTTAAA AGAGGTGTTA CGCCCGCAGG TGGTACCGCT CGTCGAGTTA CTCAACCAAA GTGGCAAGCC GATTGTGGCG GTCGATGTGC CTTCAGGGCT GTGTGCCGAT ACCGGTCAAG TGATGGGGAC ATGCATTAAA GCGCAGCATA CGGTGAGTTT GATTGGATTA AAACAAGGCT TAGTGACTGG CCAAGCCCGC TGTTATGTTG GAACGTTACA CTATGCCGGG CTTGGGGTTG AAGAAGTGTT TGCCCAGCAC AATACGCCAT CCTTAGTCTC CATCGATGGT AAGCTAAGGC ACAGTTTATT GCCGCCACGT CAAGCTTGTA CTCACAAAGG CCAAAATGGC AAAGCTTTGA TCGTTGGAGG CAATGAGGGT ATGGGCGGAG CCTTGATTTT GTGTGCTTCC GCTTGTGCTC GTTCGGGAGC TGGGCTGAGC GCGGCAATGA CCCATCCCGA TAACGTTACC GCTATGCTGA CGATTACACC GGAAGTGATG AGCACAAGCT GGAATAAACA GCATTTATTT GAAGAGCGCA TTGAATGGTG TGATGCCCTT GCTTTGGGGC CCGGATTAGG GCGAGATGCG CAAGCGCAGC AGATTATGCA GCGCTTAAGT AGCTTGAAGG TTCCGAAAGT GTGGGATGCG GATGCACTCT ATTTTCTAGC GCATAACCCC AGCTATGATG CGCAGCGGAT CATTACACCG CATCCCGTCG AAGCGGCGCG TTTATTGGGC TGTGAAGTGG AAGAGGTGGA GCAAGATCGT TTTGCGGCGA TTCGCCAGCT TCAGCAACGC TATGGAGGCG TTGTCGTGCT CAAAGGTGCG GGGACTTTAG TGGATGATGG GAAAGAGATC GCGGTCTGCT TACAGGGGAA TCCTGGAATG GCCAGTGGGG GGATGGGCGA TGTACTTACT GGCATTATTG TGGCGCTATT AGCGCAAAAA ATTCCCTTAG CGGATGCGGC AAAACTCGGG GTTTGGCTAC ACAGTAGCGC TGCGGATCTC AATACCAAAT CGCATGGCCA GAGAGGACTT CTGGCCAGCG ATTTATTGCC TCATCTGCGT GAGTTATTGA ATTAA
|
Protein sequence | MDTIMPLPTH FYTTQQLKQG EQDAASERGL ELFHLMERAG QAVFTIAFAQ YPTSHHWLIC CGGGNNGGDG YIVAVLARHM GIDVTVWQLG DPEKLPADAH RAYQQWKELG GAVYAPQSEV PESTDVIIDA LFGIGLKEVL RPQVVPLVEL LNQSGKPIVA VDVPSGLCAD TGQVMGTCIK AQHTVSLIGL KQGLVTGQAR CYVGTLHYAG LGVEEVFAQH NTPSLVSIDG KLRHSLLPPR QACTHKGQNG KALIVGGNEG MGGALILCAS ACARSGAGLS AAMTHPDNVT AMLTITPEVM STSWNKQHLF EERIEWCDAL ALGPGLGRDA QAQQIMQRLS SLKVPKVWDA DALYFLAHNP SYDAQRIITP HPVEAARLLG CEVEEVEQDR FAAIRQLQQR YGGVVVLKGA GTLVDDGKEI AVCLQGNPGM ASGGMGDVLT GIIVALLAQK IPLADAAKLG VWLHSSAADL NTKSHGQRGL LASDLLPHLR ELLN
|
| |