Gene Noc_0781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0781 
Symbol 
ID3707047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp853870 
End bp855108 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content57% 
IMG OID637737283 
Productallantoate amidohydrolase 
Protein accessionYP_342824 
Protein GI77164299 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACGC CTCTTAGGGT AAATTTTAAG CGGCTGCAAG CGGATGTGGA AACCCTGGCC 
CATATCGGCC GGCGGGCCGA TTACGGTCTT TATCGCATGG CCTTTAGCAA AGGCGATCAG
GCAGCCCGTG AATGGTTCCA AGAGCGCATT CATGAGGCAG GGCTGGATCT TTATATAGAC
GGCGCCGCCA ACATTCATGC CCGCTTCAAC TGGAACGGAG AACGCCCCAG CGTCATGACC
GGCTCCCATC TCGATACCGT CCCTGGTGCT GGCCACCTGG ATGGAGCCCT GGGCGTATTA
GTCGGGCTTG AATGTTTGCG CCGCTTCAAA GAACTCGACC TCTCTTTACG CTATGCAGTA
GAGGCGATTG CCTTTACCGA TGAGGAAGGA CGCTTTGGCG GCCTGTTGGG ATCCCAGGCT
ATCAGCGGCC GCCTCACCCC GGAAGCCATC CATAATGCCC GCGACTTGGA CGGAATCAGC
CTCTCCCAGG CCATGACCGC CCAGGGACTA AATCCCGCGG ACATCCTGCG AGCAAGGCGC
AAACCAGAAA GTCTCATCGC CTTTTTGGAA CTCCACATTG AACAAGGTCC CATCCTTGAG
CGGCAAGGCG TTAGCGTGGG AGTCGTCGAA GGAATCGTGG GCCTGTTCAA ATGGGAAGTC
ACCCTTAAGG GCACCGCCAA CCATGCCGGC ACCACACCTA TGGATATGCG CCAGGATGCC
TTGCAAGGTC TGGCCGAATT CGCAGGAGAA ATTACCCGAG TTCTGGAAGA AAATGGCGGT
CCCCGCAGCG TGGCCACTAT CGGCCGGGTA GAGGTTTTTC CTGGCGCTGC AAATGTAATC
CCAGGAAGCG TCAAGTTTTC TCTGGATGTG CGGGATACCG AGGCAATCAT TCTCAAGGAT
TTGACCCACG CCTTCCGCCT CGCCCTCTCG GCAATCGCCC GCCGCCGCGG GCTCATGTTC
GAATTTGAAG TGTTGAGCGA AATTGAACCG GTTAAGTGCG ATCCTGGCAT CATGGAGACC
ATCTTTAATG CGGCCCGGAG CCTCGGGGTA GAGCCTTTGC AAATGCCAAG CGGAGCCGCC
CATGACACCC AAATCATGGC AACCCTGACC CGGGCAGGCA TGATTTTCGT TCCTAGCCAA
GGAGGGCGCA GCCATTCTCC AGCGGAATGG ACTCCCTGGG AAGACATTGA AACGGGCGCA
AACGTGGCCT TGAATACGCT CTATCAATTA GCCCATTAA
 
Protein sequence
MKTPLRVNFK RLQADVETLA HIGRRADYGL YRMAFSKGDQ AAREWFQERI HEAGLDLYID 
GAANIHARFN WNGERPSVMT GSHLDTVPGA GHLDGALGVL VGLECLRRFK ELDLSLRYAV
EAIAFTDEEG RFGGLLGSQA ISGRLTPEAI HNARDLDGIS LSQAMTAQGL NPADILRARR
KPESLIAFLE LHIEQGPILE RQGVSVGVVE GIVGLFKWEV TLKGTANHAG TTPMDMRQDA
LQGLAEFAGE ITRVLEENGG PRSVATIGRV EVFPGAANVI PGSVKFSLDV RDTEAIILKD
LTHAFRLALS AIARRRGLMF EFEVLSEIEP VKCDPGIMET IFNAARSLGV EPLQMPSGAA
HDTQIMATLT RAGMIFVPSQ GGRSHSPAEW TPWEDIETGA NVALNTLYQL AH