Gene RPD_1257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1257 
Symbol 
ID4021734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1419930 
End bp1420970 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content70% 
IMG OID637961450 
Productaminotransferase, class I and II 
Protein accessionYP_568396 
Protein GI91975737 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01140] L-threonine-O-3-phosphate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.644499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGGCAG CGGCGGCTGC GACGGCGCTC CGCATTCACG GCGGCCGCGT CGATCTCGCA 
GCAAGCGCCT ATCCTGACGC GCCGCAGCCC TGGATCGATC TTTCGACCGG CATCAATCCG
ACCGCCTATC CGATCCCGAC GCTCGCAGCA GCGGCCTTTG CACGGCTGCC GCTGACGACG
GAACTCGACG AATTATGCGC GGCCGCGGCC GAAGCCTATG GGCTGCCCGG CGGCGCAGTG
GTGCTTCCCG CGCCGGGCAG CGAGATCGCA ATCCGGCTGG CGCCGCTCGT GCTGACGCCG
CCACAGCCAT CGATCGCGCA AGTCGGCATC CTGGGACCGA CCTATGGCTC GCACGCCGCC
GCCTGGCGCG CGGCAGGCGC GCAGGTGCAT GAGCTCGGCG CCCTGCCCGA TCCGCAGGCG
CACTTCGATG TCGTGGTGCT CGTCAACCCG AACAATCCGG ATGGACACCT GATCGCGCCG
GAGCCACTTG CGGACTTTGC CGAGTGCTGG ACCGCATCGG GCAAACGCCT GGTGATCGAC
GAAGCGTTCG GCGACCTCAG GCCGGAATTG TCGGTGCTTG GCGGCGCGGC GTTGCCGCGC
GGGGTGGTGG TGCTGCGGTC GCTCGGAAAA TTCTTCGGGC TGGCAGGACT GCGGCTCGGC
TTCGTGGTGG TCAATGCCGG CGATGCGCCG CGATGGCGTG ATCTGCTCGG CGACTGGGCC
GCGTCCGGAC CGGCCTGCAC GATCGCAACC GCTGCGCTGC GCGACAAGGC ATGGATCGCG
GCGACGCGCA GCCGGCTTGC CGCCGACCGC GGCCGGCTCG ACGCAACACT CGCCGCAGCG
CAGCTCGAAC CGCGCGGCGG AACCGATCTG TTCGGGTTCT ACGAAAGCGC GGACGACGGC
GATCTGGTCG ACCGGTTCGC GCGCGCCGGC ATCTTGATCC GCGGCTTCGA TCACAGCCCA
CGGCTTTATC GCTTCGGCCT GCCCGCGGAC GAGCCCGCCT GGCAACGGCT GCAGCGGATC
TGCGATACTG TTGCGGGCTA G
 
Protein sequence
MTAAAAATAL RIHGGRVDLA ASAYPDAPQP WIDLSTGINP TAYPIPTLAA AAFARLPLTT 
ELDELCAAAA EAYGLPGGAV VLPAPGSEIA IRLAPLVLTP PQPSIAQVGI LGPTYGSHAA
AWRAAGAQVH ELGALPDPQA HFDVVVLVNP NNPDGHLIAP EPLADFAECW TASGKRLVID
EAFGDLRPEL SVLGGAALPR GVVVLRSLGK FFGLAGLRLG FVVVNAGDAP RWRDLLGDWA
ASGPACTIAT AALRDKAWIA ATRSRLAADR GRLDATLAAA QLEPRGGTDL FGFYESADDG
DLVDRFARAG ILIRGFDHSP RLYRFGLPAD EPAWQRLQRI CDTVAG