Gene Noc_1455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1455 
Symbol 
ID3706024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1610568 
End bp1611548 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content53% 
IMG OID637737944 
Producthypothetical protein 
Protein accessionYP_343473 
Protein GI77164948 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID[TIGR00147] lipid kinase, YegS/Rv2252/BmrU family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.435019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCTT TACATAACGA TAATGTAGAA TCTACACCAA CCACAAAGAC GCTAGAACCA 
GGCTCGCCCA ATTCGCGCCT ATTTCTCATT CTTAATCCGG TCGCGGGCAG TTGCAGCGCC
GAGCGGGTCA GGTTTACTCT GAAGCAATAC TGCGAGCAGC ATGATGTAGG CTACGAAATT
TATGAGACCA CCGGCAAAGA GCACTTGCCC AGTATTGTGC GCCAGGCACG GGAGGAAGAC
TATAGCGTCA TCGTTGCAGC GGGCGGTGAT GGCACCGCTT CGATGGTGGC CGGTGAGTTA
ATCCACAGCC CGATTCCTCT GGGTATTATC CCAGTGGGCA CGGCCAATTT ACTAGCCCGT
GAGTTGGCCA TCCCGTTAGA TCTGGAGTCC GCCTGTCAAC TCGTGGTTAC CGGGGGTGCC
ATAAGAAAGA TTGATGCCAT GCGGGTGGGC CGTCAGGTTT TGATTTCTCA TATTAGCCTG
GGTTCTTATT CGCGCATTGC GGAGAGAACC AGCGTGGAGG CTAAACGGCG TTTTCGCCAA
CTCGCCTATA TCTGGAATGG GATAGCCGAA TTTATCGGCA CTCGGGTATG GCGTTTTGAC
CTCGTTGTGG ACGGTCAGCG GCAGCGCATT AAAGCCGCTT TTATTATGAT CGCTAACGTA
GGCGCCATGG GAGCGGCTAC CCTGCGCTGG GGTGAAGAGG TCAAGCCTGA TGACGGGAAA
GTAGATATTT GTATTGTCCG AACCCGGGGC CTTCTCCATT ACTCGTCTTT TTTGTGGCAT
GCCTTGAGAG GACGGCATAA GGAATCTCCC CATACGGACT ATTTATGGGC CGAAAAAAAT
ATAAAGGTAA CGGCAAAAAA GAATTTGCCG GTGCGGGGCG ATGGGGAAAT TATTGGTCGC
TCCAGCGTGG AGATAGAGAT TATCCCAAGG GCCGTTCCCA TCATCGTCCC CGCTCCCGTG
CCTGATGAGA TAGCCTCCTG A
 
Protein sequence
MKSLHNDNVE STPTTKTLEP GSPNSRLFLI LNPVAGSCSA ERVRFTLKQY CEQHDVGYEI 
YETTGKEHLP SIVRQAREED YSVIVAAGGD GTASMVAGEL IHSPIPLGII PVGTANLLAR
ELAIPLDLES ACQLVVTGGA IRKIDAMRVG RQVLISHISL GSYSRIAERT SVEAKRRFRQ
LAYIWNGIAE FIGTRVWRFD LVVDGQRQRI KAAFIMIANV GAMGAATLRW GEEVKPDDGK
VDICIVRTRG LLHYSSFLWH ALRGRHKESP HTDYLWAEKN IKVTAKKNLP VRGDGEIIGR
SSVEIEIIPR AVPIIVPAPV PDEIAS