Gene RPB_4254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4254 
Symbol 
ID3912067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4840376 
End bp4841473 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content66% 
IMG OID637886159 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_487853 
Protein GI86751357 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGTC CCGTGCCGAA GCCCGGCATT CTCGATATTG CGCCCTACAC CCCCGGCAAG 
AGCCCCGTGC CCGAAGCCGG CCGCAAGGTG TTCAAGCTCT CGGCCAACGA AACCCCGTTC
GGCCCGTCGC CGCACGCGAT CGCGGCCTAT AAGAGCGCGG CGGATCATCT CGAGGATTAT
CCGGAGGGCA CCTCGCGGGT GCTGCGCGAG GCGATCGGCC GTGCCTACGG CCTCGACCCC
GACCGCATCA TCTGCGGCGC CGGCTCCGAC GAAATCCTGA ACCTGTTGGC GCACACTTAT
CTCGGCCCCG GCGACGAGGC GATCTCGTCG CAGCACGGCT TCCTGGTCTA TCCGATCGCC
ACGCTGGCGA ACGGCGCCAC CAACGTGGTC GCGCCGGAAA AGGACCTGAC GACCGACGTC
GACGCGATCC TCAGCAAGGT CACGCCGAAC ACCAAGCTGG TGTGGCTCGC CAACCCGAAC
AACCCGACCG GGACCTATAT TCCGTTCGAC GAGGTCAAGC GGCTGCGCGC CGGCCTGCCC
TCGCACGTCG TGCTGGTCCT CGACGCCGCC TATGCCGACT ACGTCTCGAA GAACGACTAC
GAGATCGGCA TCGAACTGGT CTCGACCACC GACAACACCG TGCTGACCCA CACCTTCTCC
AAGGTGCACG GCCTCGCGAG CTTGCGGATC GGCTGGATGT TCGGCCCGGC GAATATTGTG
GACGCGGTCA ACCGCATCCG CGGTCCGTTC AACACGTCGA TCCCGGCGCA GCTCGCCGCG
GTCGCGGCGA TCCAGGACAC CGCGCATGTC GACATGTCGC GCGTCCACAC CGAGAAGTGG
CGCGACCGAC TGACCGAGGA GTTCACCAAA CTCGGCCTGA CAGTGACGCC GAGCGTCTGC
AATTTCGTGC TGATGCATTT CCCGACCACG GCGGGCAAGA CCGCGGCGGA TGCCGACGCG
TTCCTGACCA AGCGCGGTCT CGTGCTGCGC GCGCTCGGCA ATTACAAGCT GCCGCACGCG
CTGCGCATGA CCATCGGCAC CGACGAGGCC AACGAGCTGG TAATCGCGGC GCTGACCGAG
TTCATGGCGA AGCCATGA
 
Protein sequence
MSRPVPKPGI LDIAPYTPGK SPVPEAGRKV FKLSANETPF GPSPHAIAAY KSAADHLEDY 
PEGTSRVLRE AIGRAYGLDP DRIICGAGSD EILNLLAHTY LGPGDEAISS QHGFLVYPIA
TLANGATNVV APEKDLTTDV DAILSKVTPN TKLVWLANPN NPTGTYIPFD EVKRLRAGLP
SHVVLVLDAA YADYVSKNDY EIGIELVSTT DNTVLTHTFS KVHGLASLRI GWMFGPANIV
DAVNRIRGPF NTSIPAQLAA VAAIQDTAHV DMSRVHTEKW RDRLTEEFTK LGLTVTPSVC
NFVLMHFPTT AGKTAADADA FLTKRGLVLR ALGNYKLPHA LRMTIGTDEA NELVIAALTE
FMAKP