Gene Noc_2718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2718 
Symbol 
ID3704744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3084992 
End bp3086107 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content50% 
IMG OID637739200 
ProductHAD family hydrolase 
Protein accessionYP_344701 
Protein GI77166176 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0561] Predicted hydrolases of the HAD superfamily
[COG3100] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01484] HAD-superfamily hydrolase, subfamily IIB
[TIGR01485] sucrose-6F-phosphate phosphohydrolase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACAAA AAATTTTGCT TTGTTCTGAT CTGGATCGGA CCCTGCTTCC CAACGGCCAT 
CAAGCGGAGT CGCCCCAGGC GCGGCTTCGG CTGCAACGAT TGGCCCAGCG TCCTGGCATT
ATTTTGGCCT ATGTCAGCGG TCGTCATAAG GCGCTCATTC AAAGCGCTAT TCGAGAATAT
GATCTTCCTC TTCCTGACTT TGCCATTGGT GATGTGGGGA CGACTATCTA TCAAATTACG
GATAATCAAT GGCATCCTTG GGAAGACTGG AGTAAAGAAA TCAGCCAGGA TTGGCAAGGA
ATAAACCAGG CTGGGCTGGC AAAACTTTTT GCTGACATTA CTCCACTGCG GTTGCAAGAG
CCGGAAAAGC AAAACCGCTA TAAGCTCAGC TATTATGCTC CACCGGAGCT AGATTGGGAG
AATCTGATTC CCCAATTGGC CCAGCGGTTA CAAGCGCAGG GAATCCAGGC TTCTTTCATT
TGGAGTGTAG ATGAAACTGC CCAGATCGGT TTGCTCGATA TCCTGCCAAA GCGGGCCAAC
AAACTCCATG CCATTCGCTT TTTAATGGAG CGTCAGCATT TTGATAAAAG CCATACCGTT
TTTGCGGGGG ATAGTGGCAA CGACCTGGAG GTGTTAGCTA GCGGTCTCCA AGCTATTCTG
GTCCGCAATG CCCAGGAAGA AGTGCGCCAG GAAGCTCTCC GCCGCCTCCC GCCAGAGCAT
AGTCAGCAGC TTTATCTCGC CCGGGGTGGC TTTATGGGGC TTAACGGTTA TTACAGCGCG
GGAGTGCTAG AGGGCCTAGC CCATTTTTTT CCTGAAACCC GGGCATGGAT GGAAACAGGG
AGAGAAGAGT CAGCGGAGGA AGAAACGGCA CAATCCTGCG CCATTTATCG AAGCTGTAAA
AGAAATGATA GCTACTTATA TGTGGAATCT CAGGATGATT TTTCCCGCGT TCCTGGAAAA
TTGTTGGAAA TGCTTGGAAA GCTAGAGTTT GTCATGAGAC TGGAGCTGCG TCCCGAGATC
TCCCTGGCCC AGGCCAATAC CAGGGAAGTG ATGCAAATGC TCAGGGAGAA AGGCTATTTT
TTGCAGTTAT CATCAAGGGA ATACAGGCGG TCTTAA
 
Protein sequence
MKQKILLCSD LDRTLLPNGH QAESPQARLR LQRLAQRPGI ILAYVSGRHK ALIQSAIREY 
DLPLPDFAIG DVGTTIYQIT DNQWHPWEDW SKEISQDWQG INQAGLAKLF ADITPLRLQE
PEKQNRYKLS YYAPPELDWE NLIPQLAQRL QAQGIQASFI WSVDETAQIG LLDILPKRAN
KLHAIRFLME RQHFDKSHTV FAGDSGNDLE VLASGLQAIL VRNAQEEVRQ EALRRLPPEH
SQQLYLARGG FMGLNGYYSA GVLEGLAHFF PETRAWMETG REESAEEETA QSCAIYRSCK
RNDSYLYVES QDDFSRVPGK LLEMLGKLEF VMRLELRPEI SLAQANTREV MQMLREKGYF
LQLSSREYRR S