Gene RPD_4101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4101 
Symbol 
ID4024623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4565160 
End bp4566257 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content66% 
IMG OID637964309 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_571221 
Protein GI91978562 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.156349 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGTC CCGTGCCGAA TCCCGGCATT CTCGATATTG CGCCCTACAC CCCCGGCAAG 
AGCCCAGTCG CCGAGCCCGG CCGCAAGGTG TTCAAGCTCT CCGCCAACGA AACGCCGTTC
GGCCCGTCGC CGCATGCGAT CGCGGCCTAT AAGAGCGCGG CGGATCATCT CGAGGATTAT
CCGGAAGGCA CCTCGCGGGT GCTGCGCGAG GCGATCGGCC GCGCCTACGG GCTCGACCCC
GACCGCATCA TCTGCGGCGC CGGCTCCGAC GAAATCCTCA ATCTGCTGGC GCACACTTAT
CTCGGGCCGG ACGACGAGGC GATCTCGACC ACCCACGGCT TCCTGGTCTA CCCGATCGCC
ACGCTGGCGA ACGGCGCCAG AAACGTGGTC GCCGAGGAAA AGGATCTGAC CTGCAACGTC
GACGCGATCC TCGCCAAGGT GTCGCCGAAG ACCAAGATCG TCTGGCTCGC CAACCCGAAC
AACCCGACCG GGACCTACAT TCCGTTCGAC GAGGTGAAGC GGCTGCGCGC GGGCCTGCCC
GGCCATGTCG TGCTGGTGCT GGACGCGGCC TATGCCGACT ACGTCTCGCG CAACGACTAC
GAGATCGGGA TCGAGCTGGT GGCGACCACC GACAATACGG TGATGACCCA CACCTTCTCC
AAGATCCACG GTCTGGCCGC GCTGCGGATC GGCTGGATGT TCGGCCCGGC CAACATTGTC
GACGCCGTCA ACCGCATCCG CGGGCCGTTC AACGTCTCGG TGCCGGCGCA GCTCGCTGCG
GTCGCCGCGA TCCAGGACAG CGCGCATGTC GAAAAGTCGC GCACCCACAC CGAGCAGTGG
CGCAACCGAC TGACCGAGGA ACTCACCAAG ATCGGCCTGA CGGTGACGCC GAGCGTCTGC
AACTTCGTGC TGATGCATTT CCCGACCACC AAGGGCAAGA CCGCGGCGGA AGCCGACGCG
TTCCTGACCA AGCGTGGGCT GGTGCTGCGT GCGCTCGGCA ACTACAATTT GCCGCACGCG
CTGCGCATGA CGATCGGCAC CGACGAGGCC AACGAGCTGG TGATCGAAGG GCTGCGCGAG
TTCATGGCGC AGCCATGA
 
Protein sequence
MSRPVPNPGI LDIAPYTPGK SPVAEPGRKV FKLSANETPF GPSPHAIAAY KSAADHLEDY 
PEGTSRVLRE AIGRAYGLDP DRIICGAGSD EILNLLAHTY LGPDDEAIST THGFLVYPIA
TLANGARNVV AEEKDLTCNV DAILAKVSPK TKIVWLANPN NPTGTYIPFD EVKRLRAGLP
GHVVLVLDAA YADYVSRNDY EIGIELVATT DNTVMTHTFS KIHGLAALRI GWMFGPANIV
DAVNRIRGPF NVSVPAQLAA VAAIQDSAHV EKSRTHTEQW RNRLTEELTK IGLTVTPSVC
NFVLMHFPTT KGKTAAEADA FLTKRGLVLR ALGNYNLPHA LRMTIGTDEA NELVIEGLRE
FMAQP