Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1759 |
Symbol | ipk |
ID | 5135763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 1879880 |
End bp | 1880752 |
Gene Length | 873 bp |
Protein Length | 290 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640533216 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_001217698 |
Protein GI | 147673745 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000000124262 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCACG GCACAACCGT GTGGCCTTCA CCGGCCAAAC TCAACCTGTT CCTTTACATC ACAGGTCGTC GAGCTAACGG CTATCACGAT CTGCAGACCT TGTTTCAGTT TCTCGATCAC GGTGATGAGT TAACCATTAC CGCCAACAAC AGCGGCAACA TCACCCTCTC TCCTGCTCTG GCCGATGTCG CGTTAGAAGA TAACCTGATT TACAAAGCCG CGATGGCACT CAAAAATGCG GCGCAATCAC CACTCGGCGC AGACATTCAA CTGCACAAGG TGTTGCCTAT GGGCGGCGGA ATTGGTGGCG GATCATCCAA TGCGGCCACT ACCTTAGTCG CACTCAATTA CTTGTGGCAA ACTGGGCTTA GCGATGATCA ATTGGCTGAA ATTGGGCTGG CACTCGGAGC GGATGTCCCT GTCTTTACTC GTGGTTTTGC GGCTTTTGCT GAAGGAGTTG GCGAAGAATT ATCCGCAGTA GAGCCAGAGG AAAAATGGTA TCTCGTGGTT CGCCCTGCGG TCAGCATCGC GACAAAAGAT ATTTTCACTC ATCCACAGCT GATGAGAAAC ACGCCAAAGC GTGATCTGGC AAGCCTTCTT ACCACCCCGT ACGAAAACGA TTGCGAAAAA ATTGTCCGAT CACTGTACCC CGAGGTTGAT AAGCAACTTT CATGGCTGCT ACAATACGCG CCGTCAAGAT TGACCGGGAC GGGATCTTGC GTTTTTGCTG AGTTTTCGAG CAGGAAAGAT GCACAGGCCG TCTTTGCTCA ATTATCTGAC AACGTCTTAG CGTTTGTCGC CCAAGGGCGC AATGTTTCAC CGCTCAGAAA GACGTTGGCT GACTACCAAT CAGCTAAAAT CCGACCTTAC TAA
|
Protein sequence | MIHGTTVWPS PAKLNLFLYI TGRRANGYHD LQTLFQFLDH GDELTITANN SGNITLSPAL ADVALEDNLI YKAAMALKNA AQSPLGADIQ LHKVLPMGGG IGGGSSNAAT TLVALNYLWQ TGLSDDQLAE IGLALGADVP VFTRGFAAFA EGVGEELSAV EPEEKWYLVV RPAVSIATKD IFTHPQLMRN TPKRDLASLL TTPYENDCEK IVRSLYPEVD KQLSWLLQYA PSRLTGTGSC VFAEFSSRKD AQAVFAQLSD NVLAFVAQGR NVSPLRKTLA DYQSAKIRPY
|
| |