Gene Noc_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1048 
Symbol 
ID3707231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1154958 
End bp1156526 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content50% 
IMG OID637737553 
ProductIMP cyclohydrolase 
Protein accessionYP_343086 
Protein GI77164561 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.144916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCCA TAGCCCGCGC CCTTATCAGT GTCTCCGACA AAACCGGCAT CGTTCCCTTT 
GCACACCGGC TCCAGGCGCG GGGTGTCAAA ATCTTATCCA CTGGTGGCAG CGCCCAATTA
CTCCAAAAAA ACAACATTGG AGCCACAGAA ATATCCACCT ATACCGGTTT CCCAGAAATG
ATGGGGGGAC GCATTAAAAC ACTCCACCCC AAAATCCATG GAGGCATTCT AGGACGAAGG
GAAACCGATG CCACCACTAT GGCGGAATAT AATATTGCCC CCATCGATCT GGTAGCCGTT
AATCTTTATC CCTTCGAACA AACGGTGGCA AAACCAGATT GCGACTTGGC TACCGCTATC
GAGAACATCG ATATTGGCGG ACCCACCCTG CTCCGAGCTG CGGCTAAAAA CCATGCGGCG
GTGACCGTAA TCGTCGATCC AGACGACTAC GAAAGAGTCC TCCGTGAAAT GGAAGCAAAC
GGCGGAGCCC TCTCATCCTC TACCCGTTTT GAATTGGCGG TGAAAAGCTT TGAACACACT
GCACGCTACG ATGCCACCAT CGCTAATTAC CTAGGAGCGT TGACTCCAAA TGGCGAAAAG
AGCGCGTTTC CCCGCAGCTA CAATATTCAA TTTGCCAAAA AGCAAGAAAT GCGTTACGGA
GAGAATCCTC ATCAGCGGGC GGCCTTCTAT GTAGAACAAC CCCCTCCAGC GGGAACCATT
GCCACCGCCC AACAATTACA AGGCAAGACA CTCTCCTTTA ACAATATTGC GGATACAGAT
GCCGCTTTAG CATGTGTTAA AGCTTTTCGC GAGGCTCCTA CCTGCGTCAT CGTAAAACAC
GCCAATCCTT GTGGAGTGGC TACAGGAATA AACCTACAAG AAGCTTATGA ACGAGCCTAC
GCCGCTGATC CAGTTTCCGC TTTTGGTGGC ATTATTGCTT TCAACGAGCC GTTAGATCCA
ACTACTGCAA AAACCATCAT CAAGCGCCAA TTCGCCGAAG TGATTATTGC CCCTGCAGTA
ACAACAACGG CCCAAGAAAT ACTCACATCC AAACCTAATA TACGAGTCTT GGCTTGTGGT
GAATGGTCAT CTCAAACTGC CGCCGGTTGG GACTATAAGC GAATTGTCGG CGGCTTACTA
CTCCAGGACC AGGATACCGA TACAGTGCCT TTAGAGGCGC TTCAAACTGT TACGGAACGC
TCCCCTACTC CCCAGGAATT AAAGGATCTC CTCTTTGCCT GGCAGGTGGT AAAATTCGTC
AAATCCAATG CTATCGTCTA TGCCAAGAAT GGACGGACCA TAGGCGTGGG CGCGGGGCAA
ACAAGCCGGG TGATGAGTAG TCAAATTGCG GAACTTAAAG CGAAAGAGGC AGGTTTTTCA
ACCCAAAATG CGGTCCTGGC CTCAGATGCT TTTTTTCCCT TTCGGGATGG CCTGGAAGCC
GCTGCTAAAG CAGGAATCTG TGCTGTTATC CAGCCTGGGG GTTCCAGGCG GGATAAGGAA
GTCATTGCCG CTGCAAATGA ATGGGATATG GCAATGCTCT TTACTGGAAT GCGCCATTTC
CGCCACTAA
 
Protein sequence
MKPIARALIS VSDKTGIVPF AHRLQARGVK ILSTGGSAQL LQKNNIGATE ISTYTGFPEM 
MGGRIKTLHP KIHGGILGRR ETDATTMAEY NIAPIDLVAV NLYPFEQTVA KPDCDLATAI
ENIDIGGPTL LRAAAKNHAA VTVIVDPDDY ERVLREMEAN GGALSSSTRF ELAVKSFEHT
ARYDATIANY LGALTPNGEK SAFPRSYNIQ FAKKQEMRYG ENPHQRAAFY VEQPPPAGTI
ATAQQLQGKT LSFNNIADTD AALACVKAFR EAPTCVIVKH ANPCGVATGI NLQEAYERAY
AADPVSAFGG IIAFNEPLDP TTAKTIIKRQ FAEVIIAPAV TTTAQEILTS KPNIRVLACG
EWSSQTAAGW DYKRIVGGLL LQDQDTDTVP LEALQTVTER SPTPQELKDL LFAWQVVKFV
KSNAIVYAKN GRTIGVGAGQ TSRVMSSQIA ELKAKEAGFS TQNAVLASDA FFPFRDGLEA
AAKAGICAVI QPGGSRRDKE VIAAANEWDM AMLFTGMRHF RH