Gene Noc_2788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2788 
Symbol 
ID3705518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3163140 
End bp3164132 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content51% 
IMG OID637739264 
Productsugar phosphate isomerase 
Protein accessionYP_344765 
Protein GI77166240 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0517] FOG: CBS domain
[COG0794] Predicted sugar phosphate isomerase involved in capsule formation 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000495576 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGTTG CATACAATGA TGATATGGAT AAGCGGCTTA TTCAACTAGG AGCGGCTGTT 
ATCGACACTG AGGCCCATGC GATAGCAGCG CTACGAACGC GAATCAATGG GAACTTTGCT
GCCGCCTGTA AATACATGCT GGCATGCGAA GGCCGTATCG TTATATTAGG GATGGGTAAA
TCAGGCCATA TTGGCGGCAA AATTGCCGCT ACTCTTGCCA GTACGGGTAC GCCAGCTTTT
TTCGTCCACC CTGGTGAGGC CAGCCACGGG GATCTGGGTA TGATTACCGA AAAGGATGTG
GTACTAGCAC TGTCAAATTC TGGAGAGACG GAAGAAATCT GTACGATTCT CCCTCTCATC
AAACGCTTAG GCGTTCCACT GATTGCCTTA ACTGGCCAGC CCCGGTCAAC TCTAGGAAAG
GTTGCTGACA TCCACATCGA TATTAGCGTC GAAAAAGAAG CCTGCCCTCT CGGACTAGCG
CCGACGGCCA GCAGCACAGC AACTTTAGCC ATGGGCGATG CCCTGGCCAT TGCCTTGCTG
GAAAGCCGGG GGTTCACTGC AGAGGATTTT GCTCGTTCCC ATCCGGGCGG TCGTCTGGGG
CGCCGCCTCT TACTTCGCAT TAGCGATATC ATGCATAAAG GCGAGGAAAT TCCGGCTATC
CCAGAAAATG TGCTATTGAG CAGCGCTTTG CTAGAAATGA CTCGCAAAGG GTTAGGAATG
ACGGCTGTGG TCAATGCTCA AAACCACGCC GTAGGAATCT TTACCGATGG CGATTTGCGT
CGCGCTCTTG ATCAGGGAAT CGATGTCCAT ATTACCCCAA TTGCCAAAAT CATGACCGCC
AATTGTAAAA CCCTAGGTCC GGATCTGCTC GCCGCTGAAG CGTTGCAAAT AATGCAGCGT
CATCGTATTA ATGCACTCCT CGTAGTAGAC ACTGAGCAAC GCTTAATTGG CGCCCTCAAT
ATGCATGATT TGCTTCGAGC CGGTGTCTTG TAA
 
Protein sequence
MPVAYNDDMD KRLIQLGAAV IDTEAHAIAA LRTRINGNFA AACKYMLACE GRIVILGMGK 
SGHIGGKIAA TLASTGTPAF FVHPGEASHG DLGMITEKDV VLALSNSGET EEICTILPLI
KRLGVPLIAL TGQPRSTLGK VADIHIDISV EKEACPLGLA PTASSTATLA MGDALAIALL
ESRGFTAEDF ARSHPGGRLG RRLLLRISDI MHKGEEIPAI PENVLLSSAL LEMTRKGLGM
TAVVNAQNHA VGIFTDGDLR RALDQGIDVH ITPIAKIMTA NCKTLGPDLL AAEALQIMQR
HRINALLVVD TEQRLIGALN MHDLLRAGVL