Gene Noc_0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0420 
Symbol 
ID3706591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp459864 
End bp461609 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content55% 
IMG OID637736931 
Productamidohydrolase-like 
Protein accessionYP_342475 
Protein GI77163950 
COG category[R] General function prediction only 
COG ID[COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTA GTATGCGTAT ACCTAGCAAA CTCCTGCTAT CATTGATGGC GGTGACTCTA 
CTCATCACGC TGGCCTTTAG CTCCACCAAG GCAGCGCCTG ATACAGGGAG TAGCACCCTT
TATTATGGCG GCTCGATACT TACCATGGAG GGGAGTGCTC CGAGCTATGC GGAGGCCCTG
GTTGTTAAGG ATGGCCGCAT CCTTTTTGTT GGTACCAAAA CGCAGGCCGA ACATCTAGCA
GGTGCGGCGG CAAAAAAGGT CGATCTCGAC GGCAGGGCGC TGCTGCCGGG ATTCATCGAC
GCTCATGGCC ATGTGTTCAA CGCTGGTTTT CAGAAACTCG CAGCCAATTT GCTACCGCCT
CCAGATGGCG GCGGCAAGGA CGTAGCATCC CTCGTGGCGC TGCTCAAGGA ATGGCAAGAT
AAAAACGCTG CTGCTATTAA GAAATCGGGC TGGATCATTG GCTTCGGCTA TGATGATTCA
CGACTAGCGG AAGGGCGTCA TCCCACAGCA AAGGAACTGG ATCAGGTATC CACCGAGTTG
CCGGTGGTGA TCATCCATCA ATCCGGCCAC TTGGCCGCGA TGAACCATAA GGGACTGGAG
CTGGCTGGAA TCACGGCTGA AACCAATGAC CCGGTTGGTG GTGTGATTCG CCGCCAGGCC
GACGGCAAGA CGCCAAACGG AGTGCTTGAG GAGATGGCCT CCTTCGGGCC GATTTTCAAA
ATCCTGGGAG CTCTCGATTC GGAAGCTAAC GAGAAAATTG CCCTGGCTGG GGTAGATGCC
TACACCCAAC ACGGTTTCAC TACGGCACAG GAAGGTCGGG CCAACAAGGC AGCGACCGAG
ACTTGGCGAA AACTTGCCGA TGAGCGCAAA CTTAAAATCG ATGTCGCAGT CTATCTCGAC
CTCCAATCAG AAATCGAGTA CATCAAGCAA GTCGGAATCC ACGAAGACTA CACTCATCAT
TTTCGTGTGG CGGGTGTAAA GTTGAGCCTA GATGGCTCGC CGCAAGGCAA AACGGCCTGG
CTCAGCAAAC CCTATCTTAA TCCGCCTCCC CAAAAACCGG ATAGTTATAC AGGATACCCG
GCCATTCCGA GCGACAAGGA GAGACAGGCA CTGTTTAACC TTGCCTACCA GAAGAACTGG
CAACTGCTGG TGCATTGTAA CGGCGATGCA GCGGCTGAAG CGATGATTGA TGCTGTAGCC
GTGGCTAGCG AGAAATATGG CAAGGATGAC CGTCGTACCG TGATGATCCA TGCACAGACT
GTCCGCGAAA GCCAACTCGA GCGGATGAAA AAACTTGGCA TCCTGCCATC GTTTTTCTCC
ATGCATACTT ATTATTGGGG TGACTGGCAT CGGGATGAGA CCCTCGGCCC GGAACGAGCA
GCACGTATCT CACCAACCGC CTCGGCCCTG AAGCGCGGCA TGCGATTCAC CGAGCATCAT
GATGCGCCTG TCGCGCTGCC GAGCGCGATC ATGATTCTCC ACACCACCGT CAACCGGACC
AGCCGCAGTG GTGAGGTAAT CGGGCCGGAC CAGCGCGTTA GTCCCTATCA GGCGCTGAAG
TCAATCACCG ACTGGGCGGC CTGGCAATAT TTCGAGCAGG ATAGTAAGGG CACGCTTACG
AAGGGCAAGC TGGCCGATCT CGTGATATTG GACCAAGACC CGACACAGGT TGACCCGGCA
ACGATCATGA ACATACGTGT ATTGGAAACG ATCAAGGAGG GTCAGACCGT CTATAAGGCG
CAGTAA
 
Protein sequence
MKVSMRIPSK LLLSLMAVTL LITLAFSSTK AAPDTGSSTL YYGGSILTME GSAPSYAEAL 
VVKDGRILFV GTKTQAEHLA GAAAKKVDLD GRALLPGFID AHGHVFNAGF QKLAANLLPP
PDGGGKDVAS LVALLKEWQD KNAAAIKKSG WIIGFGYDDS RLAEGRHPTA KELDQVSTEL
PVVIIHQSGH LAAMNHKGLE LAGITAETND PVGGVIRRQA DGKTPNGVLE EMASFGPIFK
ILGALDSEAN EKIALAGVDA YTQHGFTTAQ EGRANKAATE TWRKLADERK LKIDVAVYLD
LQSEIEYIKQ VGIHEDYTHH FRVAGVKLSL DGSPQGKTAW LSKPYLNPPP QKPDSYTGYP
AIPSDKERQA LFNLAYQKNW QLLVHCNGDA AAEAMIDAVA VASEKYGKDD RRTVMIHAQT
VRESQLERMK KLGILPSFFS MHTYYWGDWH RDETLGPERA ARISPTASAL KRGMRFTEHH
DAPVALPSAI MILHTTVNRT SRSGEVIGPD QRVSPYQALK SITDWAAWQY FEQDSKGTLT
KGKLADLVIL DQDPTQVDPA TIMNIRVLET IKEGQTVYKA Q