Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0175 |
Symbol | |
ID | 3706208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 194683 |
End bp | 195795 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637736692 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_342238 |
Protein GI | 77163713 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.645733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAAA GCTTATTTTA TCAGCTTGCG GCGGCAGGGG TGCAAGGACT AACCCCCTAC CAGCCTGGAA AACCTATTGA GGAATTGGAG CGGGAGTATG GGGTGCGGGG TGCGGTTAAG CTGGCCTCTA ACGAAAATCC TCTAGGCCCC AGTCCCATGG CAATAGATGC GATTTACGGG GTGCTTGGAG AAAGCGGCCG TTATCCTGAT GGGAATGGCT TTGCGCTTAA AACTGCTCTT TCTCAATGCT TAGGCATTCC CGCGAATCAG ATTACGCTGG GTAATGGCTC CAGCGATTTG CTGGAGTTTG CGGCTCGGGT GCTAATTTCG CCTGAACATG AAGTGATTTA TTCTCAGTAT TGTTTCGCTC TCTATCCTTT ACTGATCCAG ATTTTGGGAG CTAAGGGCCA CGCCGTGCCG GCGAAGGGTT TTGGCCACGA TCTGGAGGCC ATGGTTAAAG CGGTGAATAG CCAGACCCGG CTGGTTTATA TCGCTAATCC CAATAATCCT ACGGGTACTT GGTTGCACTC TGATGAGCTA GAAGCTTTTT TGGCCGCTCT GCCAGAGCAC GTTTTGGTGG TATTGGATGA GGCTTATTAC GAGTATGTGA ACGAGGCTCA ATACCCTTAT TCTCTGGCCT GGATGAGTCG TTATCCTAAT CTGATGATCA CTCGCACCTT CTCTAAAATT TATGGCTTGG CCGGTTTACG CATAGGTTAT GGGGTGTCCC ATCCAGATTT AGCGGATTTG ATGAATAGAG TTCGCCCTCC CTTTAACGTC AATAGCTTAG CTTTGGCTGC AGCCACGGCT GCTTTGCAGG ATCACGACCA TTTACAGCGT AGTCGAAAGG TAAATCAGGC GGGAATGGCG CAGTTAACGA TGGCTTTTAC TGCCTTGGGC TTGGATTATA TTCCTTCGGT AGCCAACTTT GTGACTGTCG ATGTGAAGCA ATCGGGTGAC AAGGTTTATG AGAATTTGTT GCGGCATGGT GTGATTGTAA GACCAATGAC GGGATATGGG CTACCAAGGC ACGTGCGGGT TACCGTGGGA AGGGAGGAAG AAAATGCGCG TTTTATTCAG GTCCTTGAAA CTGTTCTCGA GGAATTTAGG TGA
|
Protein sequence | MAESLFYQLA AAGVQGLTPY QPGKPIEELE REYGVRGAVK LASNENPLGP SPMAIDAIYG VLGESGRYPD GNGFALKTAL SQCLGIPANQ ITLGNGSSDL LEFAARVLIS PEHEVIYSQY CFALYPLLIQ ILGAKGHAVP AKGFGHDLEA MVKAVNSQTR LVYIANPNNP TGTWLHSDEL EAFLAALPEH VLVVLDEAYY EYVNEAQYPY SLAWMSRYPN LMITRTFSKI YGLAGLRIGY GVSHPDLADL MNRVRPPFNV NSLALAAATA ALQDHDHLQR SRKVNQAGMA QLTMAFTALG LDYIPSVANF VTVDVKQSGD KVYENLLRHG VIVRPMTGYG LPRHVRVTVG REEENARFIQ VLETVLEEFR
|
| |