Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0781 |
Symbol | |
ID | 3707047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 853870 |
End bp | 855108 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637737283 |
Product | allantoate amidohydrolase |
Protein accession | YP_342824 |
Protein GI | 77164299 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01879] amidase, hydantoinase/carbamoylase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACGC CTCTTAGGGT AAATTTTAAG CGGCTGCAAG CGGATGTGGA AACCCTGGCC CATATCGGCC GGCGGGCCGA TTACGGTCTT TATCGCATGG CCTTTAGCAA AGGCGATCAG GCAGCCCGTG AATGGTTCCA AGAGCGCATT CATGAGGCAG GGCTGGATCT TTATATAGAC GGCGCCGCCA ACATTCATGC CCGCTTCAAC TGGAACGGAG AACGCCCCAG CGTCATGACC GGCTCCCATC TCGATACCGT CCCTGGTGCT GGCCACCTGG ATGGAGCCCT GGGCGTATTA GTCGGGCTTG AATGTTTGCG CCGCTTCAAA GAACTCGACC TCTCTTTACG CTATGCAGTA GAGGCGATTG CCTTTACCGA TGAGGAAGGA CGCTTTGGCG GCCTGTTGGG ATCCCAGGCT ATCAGCGGCC GCCTCACCCC GGAAGCCATC CATAATGCCC GCGACTTGGA CGGAATCAGC CTCTCCCAGG CCATGACCGC CCAGGGACTA AATCCCGCGG ACATCCTGCG AGCAAGGCGC AAACCAGAAA GTCTCATCGC CTTTTTGGAA CTCCACATTG AACAAGGTCC CATCCTTGAG CGGCAAGGCG TTAGCGTGGG AGTCGTCGAA GGAATCGTGG GCCTGTTCAA ATGGGAAGTC ACCCTTAAGG GCACCGCCAA CCATGCCGGC ACCACACCTA TGGATATGCG CCAGGATGCC TTGCAAGGTC TGGCCGAATT CGCAGGAGAA ATTACCCGAG TTCTGGAAGA AAATGGCGGT CCCCGCAGCG TGGCCACTAT CGGCCGGGTA GAGGTTTTTC CTGGCGCTGC AAATGTAATC CCAGGAAGCG TCAAGTTTTC TCTGGATGTG CGGGATACCG AGGCAATCAT TCTCAAGGAT TTGACCCACG CCTTCCGCCT CGCCCTCTCG GCAATCGCCC GCCGCCGCGG GCTCATGTTC GAATTTGAAG TGTTGAGCGA AATTGAACCG GTTAAGTGCG ATCCTGGCAT CATGGAGACC ATCTTTAATG CGGCCCGGAG CCTCGGGGTA GAGCCTTTGC AAATGCCAAG CGGAGCCGCC CATGACACCC AAATCATGGC AACCCTGACC CGGGCAGGCA TGATTTTCGT TCCTAGCCAA GGAGGGCGCA GCCATTCTCC AGCGGAATGG ACTCCCTGGG AAGACATTGA AACGGGCGCA AACGTGGCCT TGAATACGCT CTATCAATTA GCCCATTAA
|
Protein sequence | MKTPLRVNFK RLQADVETLA HIGRRADYGL YRMAFSKGDQ AAREWFQERI HEAGLDLYID GAANIHARFN WNGERPSVMT GSHLDTVPGA GHLDGALGVL VGLECLRRFK ELDLSLRYAV EAIAFTDEEG RFGGLLGSQA ISGRLTPEAI HNARDLDGIS LSQAMTAQGL NPADILRARR KPESLIAFLE LHIEQGPILE RQGVSVGVVE GIVGLFKWEV TLKGTANHAG TTPMDMRQDA LQGLAEFAGE ITRVLEENGG PRSVATIGRV EVFPGAANVI PGSVKFSLDV RDTEAIILKD LTHAFRLALS AIARRRGLMF EFEVLSEIEP VKCDPGIMET IFNAARSLGV EPLQMPSGAA HDTQIMATLT RAGMIFVPSQ GGRSHSPAEW TPWEDIETGA NVALNTLYQL AH
|
| |