Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4682 |
Symbol | |
ID | 5704309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5304567 |
End bp | 5305670 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641274080 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_001539426 |
Protein GI | 159040173 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.120317 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.293547 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATA CCGGAAAGCA CCAGCCGGCA CCCCGACTCA CCCGCGCTGA TCTGGACGTA CTGCCCAGCT ACGTGCCTGG TCGCAGCCCG GCTGACCTGG CCCGTGAGCT GGGTCTGCCC GAGGCGATCA AGCTGGCCAG CAACGAGGTG CCGTACGGCC CGCTGCCCGG CGTGGTGGCG GCGGTGACGG AGGCCGCTAC GGGCGCCCAC CGGTACCCGG ACATGGGCGT GGTGGCGCTG CGCGACGCCC TTGCCGAGCG GTACGCGGTG GACGTCGCAC GGATCGCCAC CGGCTGCGGC TCGGTGGCGC TGGCCGACCA CCTGGTCCGG GCCACCTGCC TGCCCGGCGA CGAGTTGGTG TACTCCTGGC GTTCGTTCGA GGCGTACCCG ATCATCGCGG CGACCAACGG GGCGACCAGC GTACGGGTCA GCAACGATGC TGGACACGGG CACGACCTGA CCGCAATGGC CGCGGCCGTG ACCGACCGGA CCCGGATGGT CCTGATCTGC AATCCGAACA ATCCGACCGG CACCGCTGTG CGGCGAACCG AACTGGAACG CTTCCTCGAC GTGGTGCCCG ACGACGTGCT GGTGGTCATC GACGAGGCAT ACCGGGAGTT CGTGACCGAC CCGGAGGTGC CGGACGGCCT CGGCTACGCG GACCGGCCCA ACGTGGCGGT GCTGCGGACG CTGTCGAAGG CATGGGGCCT GGCCGGGCTG CGAATCGGCT TCCTGGTCGC CCCGCCGGAG GTGGCCGCGG CCATTCGGAA GGTAGTCACG CCGTTCTCCG CCAGCGTGGC CGCACAGGCC GGCGCGCTCG CCGCGCTCGC CCAGGCCGAC GAGGTGCGGC GACGCTGCGC CCTGGTCGTG GCCGAACGGG ACCGGGTCAC CGAGACGCTA CGCAAGCTGG TCCCGGATTT GCCGAGCAGC CAGGCCAATT TCGTCTGGCT GCCGCTGGGT GACCGGGCGG TGGACTTCGG CAGGGCCTGC GAGTCGCGCG GCGTGATCGT GCGGCCCTTC CCGGGCGACG GAGTCCGGGT CACCATCGGC ACCCCGGCCG AGAACGATGC CTTCCTCGCG GCGGCGGAAG CCGCGCTCGG CTGA
|
Protein sequence | MTDTGKHQPA PRLTRADLDV LPSYVPGRSP ADLARELGLP EAIKLASNEV PYGPLPGVVA AVTEAATGAH RYPDMGVVAL RDALAERYAV DVARIATGCG SVALADHLVR ATCLPGDELV YSWRSFEAYP IIAATNGATS VRVSNDAGHG HDLTAMAAAV TDRTRMVLIC NPNNPTGTAV RRTELERFLD VVPDDVLVVI DEAYREFVTD PEVPDGLGYA DRPNVAVLRT LSKAWGLAGL RIGFLVAPPE VAAAIRKVVT PFSASVAAQA GALAALAQAD EVRRRCALVV AERDRVTETL RKLVPDLPSS QANFVWLPLG DRAVDFGRAC ESRGVIVRPF PGDGVRVTIG TPAENDAFLA AAEAALG
|
| |