Gene Noc_1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1697 
Symbol 
ID3705610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1899493 
End bp1900773 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content54% 
IMG OID637738178 
Productcarbohydrate kinase, FGGY 
Protein accessionYP_343699 
Protein GI77165174 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.667396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTCTG ATCTTTTTCT TGGTATTGAC ATGGGCACTT CAGGCTGCCG AATCATTGCT 
ATCAATGAGA TGGGGGAAAT CCGGGGCCGC AGCCATGTTT CCCTGCCTCC TCCCCAGCGG
CAAGGAAGGG CGATAGAACA GGATCCAGAG CAATGGTGGC AAGCCATCAA GCAAGCTTTT
ACAAATCTGT TTCAGGAGGT GCCCGCCAAG GCAGTTCGCT CCCTTGCTGT TGATGGAACT
TCCGGTACGG TGGTCTTAGT AGATAAAAAA GGGAGTCCTT TAACTCCCGC CTTACTCTAT
AATGACAGCC GCAACCATGC TGAAGCGCAC CTCATTGCCG AGCAGGCCCC TCCCCATAGC
GGCGCCTTGG GTCCCACTTC TAGTCTAGCG AAACTGCTCT ATTTGCAGAC CCGTCCAGAG
GCGCTTCAGG CAGCTTACCT GCTGCATCAA GCAGATTGGA TCGCCTTCCG TTTAGGGGCA
AAGTTGGGGA TCAGCGACGA AAATAATTGC CTTAAGACAG GATATGACTC CAACCAGCAA
GAATGGCCTG ACTGGCTCGA CCGAGTGGGT GTACGCCGCG AACTGCTGCC GACGGTAGTT
CCTCCCGGAA CCCTAATTGG CACCATTGAC CCCTCCCATA CAGAAACGTT TCAAATCCCC
CCCCAGGCGA AGTTAGTGGC GGGCACCACC GATAGTATTG CTGCCTTTAT AGCTACCGGG
GCGGGAAAAC CAGGAGACGC TGTCACCTCC CTTGGTTCTA CCTTAGCGCT CAAGGTAGCC
TCCGAACGCC CCATTTTTAG CGCAAAATAT GGCATTTACA GCCATCGCCT AGGCAGGCTT
TGGCTGGCTG GTGGCGCTTC TAATAGTGGT GGCATTGTTC TGCGGCAGTA TTTTACTCAA
GCTCAACTCG ATGAGATGAC TCCCCATCTT CAGCCACAGC AAATTACGGG ATTGAATTAC
TATCCTCTAC CCGCCCAGGG CGAGCGCTTT CCCGTTCCAG ACCCCCACTA TTCGCCATGT
CTCGCTCCCC GTCCCCGTGA TGATATAACC TTCTTTCAAG CTATTCTGGA GGGTATTGCC
CGTATTGAAG CCCAAGGTTA TCGTCAACTT CAATCGCTAG GCGCCCCTTT CCCCAGCCTG
GTGAAAACCA CTGGCGGAGG TGCTCACAAT CCCGCTTGGT TGCAAATAAG GGAACACACC
CTTCGGGTTC CGGTGATTGC TGCCCATGAG ACCGAGGCCG CCTATGGGAG CGCATTATTA
GCGCGTCAGG CTCTCTCCTG A
 
Protein sequence
MSSDLFLGID MGTSGCRIIA INEMGEIRGR SHVSLPPPQR QGRAIEQDPE QWWQAIKQAF 
TNLFQEVPAK AVRSLAVDGT SGTVVLVDKK GSPLTPALLY NDSRNHAEAH LIAEQAPPHS
GALGPTSSLA KLLYLQTRPE ALQAAYLLHQ ADWIAFRLGA KLGISDENNC LKTGYDSNQQ
EWPDWLDRVG VRRELLPTVV PPGTLIGTID PSHTETFQIP PQAKLVAGTT DSIAAFIATG
AGKPGDAVTS LGSTLALKVA SERPIFSAKY GIYSHRLGRL WLAGGASNSG GIVLRQYFTQ
AQLDEMTPHL QPQQITGLNY YPLPAQGERF PVPDPHYSPC LAPRPRDDIT FFQAILEGIA
RIEAQGYRQL QSLGAPFPSL VKTTGGGAHN PAWLQIREHT LRVPVIAAHE TEAAYGSALL
ARQALS