Gene Noc_1182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1182 
Symbol 
ID3706756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1289738 
End bp1290814 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content51% 
IMG OID637737685 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_343214 
Protein GI77164689 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.66248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AGAAACTGCC TCCCTCTTCT CATTCACCTT TAAGCTACCG CGACGCTGGG 
GTTGATATCG ATGCCGGTAA TCAACTTATC CAGCGGATAA AACCTGCTGC GAATAGAACC
ACCCGCCCTG GCGTATTAAC TAGCCTAGGA GGATTCGGCA GCTTATTCGA GCTACCCATC
CATCGTTACC AGCATCCCGT ACTGGTGGCG GGCACCGATG GCGTGGGTAC AAAACTCAAA
CTTGCTATTC AGCTCAATCG TCACGATAGC ATTGGGATCG ATCTGGTTGC CATGTGCGCC
AACGACATCA TTGTCCAGGG AGCCGAACCG TTATTTTTTC TGGATTATTA CGCTACAGGC
CGATTAGAAG TTGATATTGC CGCCGAAATC ATCGAAGGCA TTGCCCATGG CTGCGAATTG
GCCGGGGTAG CACTAGTGGG TGGAGAGACG GCGGAGATGC CAGGAATTTA CCAAGCAGGT
GATTATGACC TTGCCGGTTT TTGTGTCGGG GTAGTAGAAA AGGAACGTCT TATTGATGGG
AGTCAGGTAC AGGCTGGCGA TCATCTTATT GGAATCGCCT CTTCCGGGCC TCACGCCAAC
GGCTATTCTC TGATCCGCAA AATTCTGGAA CGCAGCCGCT ACTCACTAAA TAGCCCCTTG
GCGGACCAGA CCCTAGGCGA TGCCTTGTTG ACCCCTACCC GCATCTACGT CAAACCCCTG
CTACAGCTGC TGGAAGTTAT TGAAATACAT GCCTTAGCCC ATATCACTGG GGGCGGGTTG
CCAGAAAATT TACCCCGGGT GCTCCCCCCA ATGCTTAGCG CTGAAATCGA TACGTCCCGC
TGGCCACGCC TGCCTATTTT CGATTGGCTG CAAAGAGAAG GTAATCTCTC TGAGCAGGAG
CTCTATCGTA CTTTTAATTG CGGAATTGGC ATGGTGGTCT GCGTAGATCA AGCAGATACT
GAACAAGCCC TAGAATTTTT AAAAGACAGG GGCGAATCCG CCTGGTTGAT AGGACGAATT
GTTCCTCAAA CAAGTGATGG GCAGAGAGTA GCCTTTAACA CGGGGAACCC CTTGTGA
 
Protein sequence
MSKKKLPPSS HSPLSYRDAG VDIDAGNQLI QRIKPAANRT TRPGVLTSLG GFGSLFELPI 
HRYQHPVLVA GTDGVGTKLK LAIQLNRHDS IGIDLVAMCA NDIIVQGAEP LFFLDYYATG
RLEVDIAAEI IEGIAHGCEL AGVALVGGET AEMPGIYQAG DYDLAGFCVG VVEKERLIDG
SQVQAGDHLI GIASSGPHAN GYSLIRKILE RSRYSLNSPL ADQTLGDALL TPTRIYVKPL
LQLLEVIEIH ALAHITGGGL PENLPRVLPP MLSAEIDTSR WPRLPIFDWL QREGNLSEQE
LYRTFNCGIG MVVCVDQADT EQALEFLKDR GESAWLIGRI VPQTSDGQRV AFNTGNPL