Gene Noc_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2053 
Symbol 
ID3705029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2361833 
End bp2363671 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content54% 
IMG OID637738528 
Productglycoside hydrolase 15-like protein 
Protein accessionYP_344043 
Protein GI77165518 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.440439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAC ACAATTACCC TGCCATTAGT GATTATGGCT ATATTTCCGA TTGCCATTCC 
TCCGCCCTGA TCTCGAAATC CGGTTCCATC GACTGGTGCT GCATGCCACG GGTGGATTCC
CGCAGCTGTT TTGGCCGTCT TCTGGGCTGG GAGCAAGGCG GGTACTGTCA AATCGCTCCG
CCAGAACCCC ATGAAGTATC CCGCCGTTAC CTCCCTCAGA CGCTGATTCT GGAAACCACC
TTTCGAACGA GCGAGGGAGA AGCCCGTCTC CTGGATTGTT TTACGCTGCG GGAAGGGGGG
AAACAGCACC CTCACCGGCA GATCCTCAGG GTACTTGAAG GGCTAAAAGG GCAGGTGAGC
TTCCGCGTCG ATATTGCGCC ACGCTTCGAT TACGGCGCTA TCAAACCTTG GATTCAGCGG
CGCCATAATA ATCATTCTGG CGATTATTAT ATTGTGATAG GAGGCAGTGA TGGGTTACTC
ATCTCCAACG ATTTTCGCCT GGAAATGAAG GATCGCCATA ATCTTCAAGG CGCTTGCCAT
ATTAAAGAAG GGCAACGAGT TCATCTCTCC CTTTTATACC GGCGCCCGGA AAGTCTTGAT
GAAGGCTGGG CTAATATCCC TACTATCGAA ACGCTGGACC AGCGCTTGGA AGAAACTATC
AAATGGTGGC ATGCCTGGTT CTCCCAGGGC GAATTTAACG GCCCCCATGC TGAACAAGCG
CAGCGTTCGG CCCTTGTCCT CAAGGGCTTA TGCAATGCGC CTACGGGGGC TATCGCGGCG
GCCTCCACAA CATCTCTTCC GGAAGCGCCC GGCGGGGAGC GGAACTGGGA TTATCGTTTT
ACTTGGATTC GGGACTCTAC CTTTACGGTC AGATCACTGG CGGATCTTGG ATATATCAAG
GAGGCAGATG GTTTTCGCCG TTTTATCGAG CGGACTGCAG CTGGATGTGC GGACGAAGTC
CAGATTTTAT TTGGTGTGGG GGGAGAACGG CGACTGCATG AATTTGAGAT TAAAGAATTG
CCAGGATACC GGGGAGCAAA GCCAGTGCGC CAAGGCAATG CGGCGGAAAA ACAAATCCAA
CTAGATGTTT ATGGAGAATT ATTGGAGTTA GCCTGGCGCT GGCGCCAGCG GGGGCAAACC
CCAGACGAAG ATTATTGGGA ATTCCTAGCG GGCCTTGTGA ATGCAGCGGG TGAGCGTTGG
AAAGAGCCAG ATCAAGGTCT TTGGGAGATG CGCGGTGAAC CCCGTCATTT CGTCCACTCC
AAGGTCATGT GTTGGGCGGC CTTAGATCGA GGGATCAAAC TGGCTGCAGA CCTTGATAAT
CATGCGCCTC TTGAGTGGTG GAAGCAGGAA CGGAAAGCGG TCCGCCAAGC AGTGGAAGAG
AAGGGCTATG ATTTCCAGCG CGGTATTTTT ATTCAGGCCT TTGATCATGT TGAGATGGAC
GCGGGTTTAT TGTTATTGCC CGTGGTGGGA TTCGTGGATT ATCAGGACGA ACGCATGATA
CGGACCACAA ACGCCGTATG GCGGGACTTG GAACAAGAAG GTCTGCTGCG CCGCTATAGA
GCGGAAAGTC ACGATGATGG CCTGCAGGGC AAGGAAGGCG TGTTTCTGGC TTGCTCCTTT
TGGCTGGCGG AATGCCTGGC TTACCAAGGC CGCCTGGAAG AGGCGCGCGA GGTGTTCACG
CAGGCAGCGG CTACCGGCAA TGATCTTGGC CTTTATTCAG AGGAATACGA TACCGAAAAA
AAGGAGATGT TGGGCAACTT TCCCCAAGGT TTGACTCACC TTTCCCTGAT TGCCGCCGCG
GTAGCCCTGT CAAAGGTGGC AGAAGTGGGA GGGAACTAA
 
Protein sequence
MDKHNYPAIS DYGYISDCHS SALISKSGSI DWCCMPRVDS RSCFGRLLGW EQGGYCQIAP 
PEPHEVSRRY LPQTLILETT FRTSEGEARL LDCFTLREGG KQHPHRQILR VLEGLKGQVS
FRVDIAPRFD YGAIKPWIQR RHNNHSGDYY IVIGGSDGLL ISNDFRLEMK DRHNLQGACH
IKEGQRVHLS LLYRRPESLD EGWANIPTIE TLDQRLEETI KWWHAWFSQG EFNGPHAEQA
QRSALVLKGL CNAPTGAIAA ASTTSLPEAP GGERNWDYRF TWIRDSTFTV RSLADLGYIK
EADGFRRFIE RTAAGCADEV QILFGVGGER RLHEFEIKEL PGYRGAKPVR QGNAAEKQIQ
LDVYGELLEL AWRWRQRGQT PDEDYWEFLA GLVNAAGERW KEPDQGLWEM RGEPRHFVHS
KVMCWAALDR GIKLAADLDN HAPLEWWKQE RKAVRQAVEE KGYDFQRGIF IQAFDHVEMD
AGLLLLPVVG FVDYQDERMI RTTNAVWRDL EQEGLLRRYR AESHDDGLQG KEGVFLACSF
WLAECLAYQG RLEEAREVFT QAAATGNDLG LYSEEYDTEK KEMLGNFPQG LTHLSLIAAA
VALSKVAEVG GN