Gene Noc_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1087 
Symbol 
ID3707076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1192577 
End bp1193626 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content52% 
IMG OID637737589 
Productpeptidase M42 
Protein accessionYP_343122 
Protein GI77164597 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTCGA TGAGTTATGA ACGGCTATTT GCGCAGATAG AAGATCTGGT CTTGCGCCAT 
TCTCCCAGTG GGGCCGAGCA AGAAATAGAT GGATGGTTGT TAGAACGTTT CACGGCGTTA
GGGCAGACTG TCTGGCAAGA TGCCGCGGGC AACATTGTTG TGAAAGTTAG GGGGAAAAAT
CCCGGGGCAA TCGCGATTAC CGCCCATAAG GATGAAATTG GCGCAATCGT GAAAACTAGT
AATAGCCAAG GGCAAGTGGA GGTGCGGAAG CTAGGCGGCG CTTTCCCCTG GGTCTACGGA
GAAGGGGTAG TGGATTTGTT AGGCGACCAT GCCTGCATTA GCGGTATTCT TTCGTTTGGT
TCCCGCCATG TTTCCCATGA GAGCCCGCAG AAAGTCCAGC AAGATCAGGC GCCACTGCAA
TGGCAGCATG CCTGGGTGGA AACCAAATGC ACGCAAAAAG AGCTGGAGGC CGCCGGAGTG
CGTTCAGGTA CGCGGATGGT GGTGGGAAAA CACCGCAAGC GCCCCTTCCG CCTTAAGGAT
TACATTGCCA GTTATACGTT GGATAACAAG GCTTCAGTGG CGATTTTGCT GGCATTGGCG
CAGAAGCTTC ATAAACCCTT AGTAGATGTT TATTTGGTGG CCTCTGCCAA GGAAGAAGTG
GGAGCAATAG GAGCGCTTTA CTTTAGCCGG TGTCATTCTC TGGATGCTTT GATTGCTTTG
GAGATATGTC CCTTGGCGCC TGAATATTTC ATCAAAGAAG GCACCGCTCC TGTGCTGCTT
TCCCAGGACG GTTATGGTCT TTACGATGAG GGACTCAATG GACTTATCAG GCAAGCCGCG
GCCCGGCGAG AAATCCCCTT GCAACTGGCG GTGATTAGCG GTTTCGGCAG TGATGGTTCC
ATTGCCATGA AATATGGCCA CGTGCCACGG GCCGCTTGTT TAGGTTTCCC TACCCAAAAC
ACACATGGTT ATGAGATCGC CCATTTAGGC GCCATTGCCC ATTGTATTGA AATACTTTAC
GCCTACTGCA CCCAGACGGA GGAGGGTTAG
 
Protein sequence
MDSMSYERLF AQIEDLVLRH SPSGAEQEID GWLLERFTAL GQTVWQDAAG NIVVKVRGKN 
PGAIAITAHK DEIGAIVKTS NSQGQVEVRK LGGAFPWVYG EGVVDLLGDH ACISGILSFG
SRHVSHESPQ KVQQDQAPLQ WQHAWVETKC TQKELEAAGV RSGTRMVVGK HRKRPFRLKD
YIASYTLDNK ASVAILLALA QKLHKPLVDV YLVASAKEEV GAIGALYFSR CHSLDALIAL
EICPLAPEYF IKEGTAPVLL SQDGYGLYDE GLNGLIRQAA ARREIPLQLA VISGFGSDGS
IAMKYGHVPR AACLGFPTQN THGYEIAHLG AIAHCIEILY AYCTQTEEG