Gene Noc_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1047 
Symbol 
ID3707230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1153766 
End bp1154797 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content53% 
IMG OID637737552 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_343085 
Protein GI77164560 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.545554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAGAG CAGCTATCGT AGGTGGAACG GGATACACAG GCGTTGAACT ACTACGTCTA 
CTGGCCAATC ATCCCAATGT TGAAATCGTA GCCATCACCT CACGCACTGA GGCAGGCAGG
CCGGTGAGCA AGCTATTTCC AAACCTTCGA GGCTATTTGG ATATTTGCTT TACAGAACCT
GAGCCCGCCC AGCTAGCCGC CGAATGCGAC GTAGTTTTCT TTGCAACTCC CCATGGAGTC
GCCATGGATA TGGTGCCCGC CCTGCTCGCA CAAAATACTC GCGTCATTGA TTTATCTGCC
GATTTCCGCC TTGCTGATCC TACGATATGG GAGCAATGGT ATGGCCGTCC TCACGCTGCA
CCCCATTTAT TGGCTGAAGC GGTTTATGGA CTCCCTGAAA TCAATCGGGA AGCGATTCGC
CAAGCTCGCC TAATCGCCTG TCCGGGCTGC TACCCCACTG CGGTCCAGCT TGGATTTCTC
CCCTTGCTAG AGCATCAACT AGTTGACCCT AGCCGGCTTA TCGCCGATGC GAAATCAGGC
GCCAGCGGAG CAGGCCGCAA AGCAGCCTTA GGAACCCTCC TTTGCGAGGC AGGTGAAAAT
TTTAAAGCCT ATAGCGTCAG CGGACACCGG CATCTACCTG AAATCATTCA AGGACTTCAA
TGGGCCAGCC GATCCTCCGT AGACTTGACC TTTGTTCCCC ACCTTATCCC CATGATTCGG
GGAATTCATG CAACCCTCTA CGCCCAGCTT GAGCATGAGG TTGATCTCCA AGAACTTTAT
GAGCAACGTT ATGCCCCGGA GCCCTTTGTG GATGTATTGC CGCCGGGAAG CCACCCAGAA
ACCCGCAGCG TCCGGGGAAA CAATATGTGC CGCCTCGCTA TCCACCGCCC ATCGGCAGGC
AACACTGTAA TTGTGCTCTC AGTAACCGAT AACCTAATAA AAGGCGCCTC CGGCCAGGCA
ATACAAAATA TGAACCTTAT GTTCGGTCAA GAAGAAACAC GCGGGTTAAT GCACATTGCT
GTCATACCTT GA
 
Protein sequence
MIRAAIVGGT GYTGVELLRL LANHPNVEIV AITSRTEAGR PVSKLFPNLR GYLDICFTEP 
EPAQLAAECD VVFFATPHGV AMDMVPALLA QNTRVIDLSA DFRLADPTIW EQWYGRPHAA
PHLLAEAVYG LPEINREAIR QARLIACPGC YPTAVQLGFL PLLEHQLVDP SRLIADAKSG
ASGAGRKAAL GTLLCEAGEN FKAYSVSGHR HLPEIIQGLQ WASRSSVDLT FVPHLIPMIR
GIHATLYAQL EHEVDLQELY EQRYAPEPFV DVLPPGSHPE TRSVRGNNMC RLAIHRPSAG
NTVIVLSVTD NLIKGASGQA IQNMNLMFGQ EETRGLMHIA VIP