Gene RPC_2817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2817 
Symbol 
ID3970084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3059153 
End bp3060373 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content65% 
IMG OID637925929 
Productaminotransferase 
Protein accessionYP_532684 
Protein GI90424314 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.547863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAAT TTTACCGCAT CCGCCGTTTG CCGCCCTATG TGTTCGAACA GGTCAATCGG 
GCCAAGGCTG CTGCGCGCAA CGCCGGTGCG GACATCATCG ACCTCGGGAT GGGCAATCCC
GACCTGCCGG CGCCGGCGCA TGTCATTGAA AAGCTTAAAG AGACCCTCGG CAAACCGCGC
ACCGACCGGT ATTCGGCGTC GCGCGGCATC ACCGGCTTGC GCAAGGCCCA GGCCGCCTAT
TACGAGCGCC GCTTCGGCGT CAAGCTGAAC CCCGACACCC AGGTGGTGGC GACGCTCGGC
TCCAAGGAAG GCTTTGCCAA CGTCGCCCAA GCGATTACCG CTCCTGGCGA CGTCGTGCTG
TGCCCGAACC CGAGCTATCC GATCCACGCC TTCGGCTTCC TGATGGCCGG CGGTGTGATC
CGCTCGGTGC CGTCCGAGCC GACGCCGCAA TTCTTCGAGG CTTGCGAGCG CGCGATCATC
CATTCGATTC CGAAGCCGAT CGCGATGATC GTCTGCTATC CGTCGAACCC GACCGCCTAT
GTGGCGAGCC TGGATTTCTA CAAGGATCTG GTGGCGTTCG CGAAGAAGCA CGAGATCTAT
ATTCTGTCGG ATCTGGCCTA CGCCGAAGTG TATTTCGACG AGGCCAACCC GCCGCCCTCG
GTGCTGCAGG TGCCGGGCGC GATGGACGTC ACCGTCGAAT TCACCTCGAT GTCGAAGACC
TTCTCGATGG CTGGCTGGCG GATGGGCTTT GCGGTCGGCA ACGAGCGCAT CATCGCCGCT
TTGGCGCGGG TGAAGTCCTA TCTCGATTAC GGCGCCTTCA CCCCGGTGCA GGTCGCCGCC
ACCGCGGCGC TGAACGGCCC CGACGACTGC ATCAAGGAGA TGCGCGACAC CTACCGCAAG
CGCCGCGACG CGCTGGTCGA GAGTTTTGGC CGCGCCGGCT GGGAGATTCC GCCGCCGCAG
GCCTCGATGT TCGCCTGGGC GCGGCTGCCG CCGGCCTTCA AGGAGGTCGG CTCGATGCAA
TTCGCCACCT TGATGGTGGA GAAATCCGGC GTCGTGGTGT CGCCTGGCGT CGCCTTCGGC
GAGCACGGCG AGGGCTTCGT GCGCATCGCC ATGGTGGAAA ACGAGCAGCG GATCCGCCAG
GCCGCCCGCG GCGTGCGCCG CTTCCTTGAA ACCGGCATTG AAACGTTGCA CAACGTCGTT
CCACTCGCCA CCCGGCGATA G
 
Protein sequence
MEEFYRIRRL PPYVFEQVNR AKAAARNAGA DIIDLGMGNP DLPAPAHVIE KLKETLGKPR 
TDRYSASRGI TGLRKAQAAY YERRFGVKLN PDTQVVATLG SKEGFANVAQ AITAPGDVVL
CPNPSYPIHA FGFLMAGGVI RSVPSEPTPQ FFEACERAII HSIPKPIAMI VCYPSNPTAY
VASLDFYKDL VAFAKKHEIY ILSDLAYAEV YFDEANPPPS VLQVPGAMDV TVEFTSMSKT
FSMAGWRMGF AVGNERIIAA LARVKSYLDY GAFTPVQVAA TAALNGPDDC IKEMRDTYRK
RRDALVESFG RAGWEIPPPQ ASMFAWARLP PAFKEVGSMQ FATLMVEKSG VVVSPGVAFG
EHGEGFVRIA MVENEQRIRQ AARGVRRFLE TGIETLHNVV PLATRR