Gene RPD_0445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0445 
Symbol 
ID4020911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp513953 
End bp515191 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content69% 
IMG OID637960630 
Productaminotransferase, class I and II 
Protein accessionYP_567584 
Protein GI91974925 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.335861 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTGC CCGCCTCTTC CGCTTCGGCG CGCGCTGCCG GCAGCCCTTC GGCTGCCGGC 
CATAGCGAGC GCTCGCCATT CGCGCGTCTG ACCGAATTGC TGGCGCCGTA TCAGCCCGGC
GAGCGACTGA TCAATCTGTC GCTCGGCGAG CCGCAGCATC CGGTGCCGGA TTTCGTCGGT
CCGGTGCTGG CGCAGCACAT TGCCGATTTC GGCCGCTATC CGATGGCCAA GGGGATCGCG
CCGTTCCGCA ACGCGGCCGC GAGCTGGCTG GCCCAACGCT TCGCGTTGCC CCGCGCGCCC
GATCCGGAGA CCGAAGTGCT GGTGCTGAAC GGCAGCCGCG AAGGCCTGTT CCTCGCCGCG
CTCGCCGCCG CGCGCCATGT CGGCCCGCGC GATGGCACGC CGGCGATCCT GATGCCAAAT
CCGTTCTATC CGGCCTATGC GGCCGGCGCC CGCGCCGCCG GCTGCGAGGC GGTGTTCCTG
CCGACCAATC TGGCCAACGG CTTCCTGCCG GATCTGGAAT CGCTCGACGA GGCGACGCTG
AAGCGCACGG TGGCGATCTA CATCGCCTCG CCCGCGAACC CGCAGGGCTC GGTCGCGTCG
CGCGATTACT TCGCGCGGCT GAAAGCGCTC GCCGATCGCT ACGGCTTCCT GATCCTCGCC
GACGAGTGCT ACTCGGAGAT CTACACCCGC ACTGCGCCCG GCAGCGCGCT CGAGGCGGCC
GGTCCCGACT TCCGCAACGT CGCGGTGTTC CAGTCGCTGT CGAAGCGCTC GAATCTTCCG
GGCATGCGCG TCGGCTTCGT CGCCGGCGAC GCGGACTTCC TCAATGCGTT CCACGAACTA
CGCAATGTCG CCGCGCCGCA GGTGCCGGTG GCGCTGCAGC ACGTCGCGGT CGCCGCCTAT
GGCGACGAAG CCCATGTCGA GGAAAACCGC CGGCTGTATC GGCTGAAGTT CGATCTCGCC
GATCAGATCC TCGGTGAGCG TTTCGGCTAT CAGCGCCCCG CCGGCGGCTT CTGCCTGTGG
CTCGACGTCT CCGCTCATGG CGGCGACGAG GCCGCCACCG TGAAGCTGTT CAGGGAGGGA
GGCGTCCGCG TCATCCCAGG CAGTTATCTG GCGCGTCCGC AGCCCGACGG TAGCAATCCG
GGCGCCGGCT ACATCCGGCT GGCGATGGTT CAGGACAGTG AGAGTACCGC CGAAGCGCTG
CACCGGCTGG TGCGGATTCT CGATGAGTCG CGAGGCTGA
 
Protein sequence
MALPASSASA RAAGSPSAAG HSERSPFARL TELLAPYQPG ERLINLSLGE PQHPVPDFVG 
PVLAQHIADF GRYPMAKGIA PFRNAAASWL AQRFALPRAP DPETEVLVLN GSREGLFLAA
LAAARHVGPR DGTPAILMPN PFYPAYAAGA RAAGCEAVFL PTNLANGFLP DLESLDEATL
KRTVAIYIAS PANPQGSVAS RDYFARLKAL ADRYGFLILA DECYSEIYTR TAPGSALEAA
GPDFRNVAVF QSLSKRSNLP GMRVGFVAGD ADFLNAFHEL RNVAAPQVPV ALQHVAVAAY
GDEAHVEENR RLYRLKFDLA DQILGERFGY QRPAGGFCLW LDVSAHGGDE AATVKLFREG
GVRVIPGSYL ARPQPDGSNP GAGYIRLAMV QDSESTAEAL HRLVRILDES RG