Gene VC0395_A1759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1759 
Symbolipk 
ID5135763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1879880 
End bp1880752 
Gene Length873 bp 
Protein Length290 aa 
Translation table11 
GC content51% 
IMG OID640533216 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_001217698 
Protein GI147673745 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000124262 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCACG GCACAACCGT GTGGCCTTCA CCGGCCAAAC TCAACCTGTT CCTTTACATC 
ACAGGTCGTC GAGCTAACGG CTATCACGAT CTGCAGACCT TGTTTCAGTT TCTCGATCAC
GGTGATGAGT TAACCATTAC CGCCAACAAC AGCGGCAACA TCACCCTCTC TCCTGCTCTG
GCCGATGTCG CGTTAGAAGA TAACCTGATT TACAAAGCCG CGATGGCACT CAAAAATGCG
GCGCAATCAC CACTCGGCGC AGACATTCAA CTGCACAAGG TGTTGCCTAT GGGCGGCGGA
ATTGGTGGCG GATCATCCAA TGCGGCCACT ACCTTAGTCG CACTCAATTA CTTGTGGCAA
ACTGGGCTTA GCGATGATCA ATTGGCTGAA ATTGGGCTGG CACTCGGAGC GGATGTCCCT
GTCTTTACTC GTGGTTTTGC GGCTTTTGCT GAAGGAGTTG GCGAAGAATT ATCCGCAGTA
GAGCCAGAGG AAAAATGGTA TCTCGTGGTT CGCCCTGCGG TCAGCATCGC GACAAAAGAT
ATTTTCACTC ATCCACAGCT GATGAGAAAC ACGCCAAAGC GTGATCTGGC AAGCCTTCTT
ACCACCCCGT ACGAAAACGA TTGCGAAAAA ATTGTCCGAT CACTGTACCC CGAGGTTGAT
AAGCAACTTT CATGGCTGCT ACAATACGCG CCGTCAAGAT TGACCGGGAC GGGATCTTGC
GTTTTTGCTG AGTTTTCGAG CAGGAAAGAT GCACAGGCCG TCTTTGCTCA ATTATCTGAC
AACGTCTTAG CGTTTGTCGC CCAAGGGCGC AATGTTTCAC CGCTCAGAAA GACGTTGGCT
GACTACCAAT CAGCTAAAAT CCGACCTTAC TAA
 
Protein sequence
MIHGTTVWPS PAKLNLFLYI TGRRANGYHD LQTLFQFLDH GDELTITANN SGNITLSPAL 
ADVALEDNLI YKAAMALKNA AQSPLGADIQ LHKVLPMGGG IGGGSSNAAT TLVALNYLWQ
TGLSDDQLAE IGLALGADVP VFTRGFAAFA EGVGEELSAV EPEEKWYLVV RPAVSIATKD
IFTHPQLMRN TPKRDLASLL TTPYENDCEK IVRSLYPEVD KQLSWLLQYA PSRLTGTGSC
VFAEFSSRKD AQAVFAQLSD NVLAFVAQGR NVSPLRKTLA DYQSAKIRPY