Gene Pnec_1624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_1624 
Symbolipk 
ID6182966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp1426668 
End bp1427558 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content45% 
IMG OID641672141 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_001798312 
Protein GI171464199 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.431144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAATC TCGATTCTTT ATCTCTTCGC TCACCAGCTA AGCTCAATCT TTTTTTGCAT 
ATCGTTGGTC GCAGGACTGA TGGTTATCAC CTGCTTCAAT CTGTCTTTCA ATTAATCGAT
TGGTGCGACA CGCTGCACTT GAAACGTATT TCTGAAAATG TAGTGCGGCG AATCAACCCA
ATTCCCGGAG TTGCACCAGA ACACGATCTA GTGGTTCGCG CAGCAAATTT ACTAAAAGAT
TTTTGCCAAT TTGAAGGCGG CGTTGAAATT AACCTGCAAA AAGAAATTCC GATGGGCGCT
GGTATGGGCG GAGGATCTTC AGACGCAGCG ACTACTTTGA TCGGACTTAA CGCCCTTTGG
AGTCTCAACC TTTCCAAAGA AACGCTTTGC GCCTTAGGCC TAAAGCTGGG AGCCGATGTT
CCATTCTTTA TTTTTGGCAA AAATGCCTTT GTTGAGGGTG TCGGGGAGAA AATGCGAGAA
ATCTCCCTCG AAACCCCTGA TTTTTTGGTC ATATTTCCCA ACCGGGGAAT TGCAACCGCT
AGCATTTTTC AAGACCCGGA ATTGACCCGA GATCACGCTC AGATTACAAT TGATGGCTTT
CTTACATCGC CATTATTGTA TCAATCGAAT GATTGCCAAG CGGTAGCGAT GAGGATTTAC
CCAGAAGTGA AGCAAGCTTT GGATTGGATT ACCCAGGCAG TACCGGGCTC ACAGCCCCGT
ATGTCAGGCT CTGGAAGTAG TGTTTTTGCA GTCTTAGACT CTAAGACTGA CATCGCAAAA
CTAAAAAATT TTCTTCAAAA TCTTCCTAAA GGGTGGGTAG GTCGGGTTGT TCGGGGGCTA
AATAAAAACC CCGCTTACAA TTTGATTTCA TTTCTTCAGA TTGACCTGTA G
 
Protein sequence
MVNLDSLSLR SPAKLNLFLH IVGRRTDGYH LLQSVFQLID WCDTLHLKRI SENVVRRINP 
IPGVAPEHDL VVRAANLLKD FCQFEGGVEI NLQKEIPMGA GMGGGSSDAA TTLIGLNALW
SLNLSKETLC ALGLKLGADV PFFIFGKNAF VEGVGEKMRE ISLETPDFLV IFPNRGIATA
SIFQDPELTR DHAQITIDGF LTSPLLYQSN DCQAVAMRIY PEVKQALDWI TQAVPGSQPR
MSGSGSSVFA VLDSKTDIAK LKNFLQNLPK GWVGRVVRGL NKNPAYNLIS FLQIDL