Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4254 |
Symbol | |
ID | 3912067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4840376 |
End bp | 4841473 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637886159 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_487853 |
Protein GI | 86751357 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGTC CCGTGCCGAA GCCCGGCATT CTCGATATTG CGCCCTACAC CCCCGGCAAG AGCCCCGTGC CCGAAGCCGG CCGCAAGGTG TTCAAGCTCT CGGCCAACGA AACCCCGTTC GGCCCGTCGC CGCACGCGAT CGCGGCCTAT AAGAGCGCGG CGGATCATCT CGAGGATTAT CCGGAGGGCA CCTCGCGGGT GCTGCGCGAG GCGATCGGCC GTGCCTACGG CCTCGACCCC GACCGCATCA TCTGCGGCGC CGGCTCCGAC GAAATCCTGA ACCTGTTGGC GCACACTTAT CTCGGCCCCG GCGACGAGGC GATCTCGTCG CAGCACGGCT TCCTGGTCTA TCCGATCGCC ACGCTGGCGA ACGGCGCCAC CAACGTGGTC GCGCCGGAAA AGGACCTGAC GACCGACGTC GACGCGATCC TCAGCAAGGT CACGCCGAAC ACCAAGCTGG TGTGGCTCGC CAACCCGAAC AACCCGACCG GGACCTATAT TCCGTTCGAC GAGGTCAAGC GGCTGCGCGC CGGCCTGCCC TCGCACGTCG TGCTGGTCCT CGACGCCGCC TATGCCGACT ACGTCTCGAA GAACGACTAC GAGATCGGCA TCGAACTGGT CTCGACCACC GACAACACCG TGCTGACCCA CACCTTCTCC AAGGTGCACG GCCTCGCGAG CTTGCGGATC GGCTGGATGT TCGGCCCGGC GAATATTGTG GACGCGGTCA ACCGCATCCG CGGTCCGTTC AACACGTCGA TCCCGGCGCA GCTCGCCGCG GTCGCGGCGA TCCAGGACAC CGCGCATGTC GACATGTCGC GCGTCCACAC CGAGAAGTGG CGCGACCGAC TGACCGAGGA GTTCACCAAA CTCGGCCTGA CAGTGACGCC GAGCGTCTGC AATTTCGTGC TGATGCATTT CCCGACCACG GCGGGCAAGA CCGCGGCGGA TGCCGACGCG TTCCTGACCA AGCGCGGTCT CGTGCTGCGC GCGCTCGGCA ATTACAAGCT GCCGCACGCG CTGCGCATGA CCATCGGCAC CGACGAGGCC AACGAGCTGG TAATCGCGGC GCTGACCGAG TTCATGGCGA AGCCATGA
|
Protein sequence | MSRPVPKPGI LDIAPYTPGK SPVPEAGRKV FKLSANETPF GPSPHAIAAY KSAADHLEDY PEGTSRVLRE AIGRAYGLDP DRIICGAGSD EILNLLAHTY LGPGDEAISS QHGFLVYPIA TLANGATNVV APEKDLTTDV DAILSKVTPN TKLVWLANPN NPTGTYIPFD EVKRLRAGLP SHVVLVLDAA YADYVSKNDY EIGIELVSTT DNTVLTHTFS KVHGLASLRI GWMFGPANIV DAVNRIRGPF NTSIPAQLAA VAAIQDTAHV DMSRVHTEKW RDRLTEEFTK LGLTVTPSVC NFVLMHFPTT AGKTAADADA FLTKRGLVLR ALGNYKLPHA LRMTIGTDEA NELVIAALTE FMAKP
|
| |