Gene RPD_1988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1988 
Symbol 
ID4022470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2222512 
End bp2223873 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content59% 
IMG OID637962181 
ProductXaa-Pro aminopeptidase-like 
Protein accessionYP_569124 
Protein GI91976465 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0384007 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.40781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAC GAGAATTTTT GGGACTGTCC GCGGCAGCCA TGCTGAGCAA TCTGCCCGTG 
GCCGATGCCA ACAACACAAC TCCAGCTATG GATGCTACTT CATCGAACCT GCACCTAGAC
AAGGCGCCAC GTCTTTCGAT GACCGAACGC GACCGGCGCT GGAAGATGGC GCGCGACATT
ATGAAGGCGG CAGGTGTCGA GGGGCTCATT GTCTATGGAG ACAGGGAATC CTCCGCTCCG
GCCCCCTTCT CTCCCGACAG CTATTTTACC AATGATCGGT TGGGGGCCAT CGTCGTGTTT
AAAGGAGACG AACCACCGGT GGTGCTTGCG TTTGCCCCGA TGGCTGTCTC CGATCACATG
CAGGCCCGTC TGCGAGGCGA CCAACAGTGG CTCGAGCCAA AGCAGATCTT TGTCGGCAAA
AGCGGCGCTC ACCTAGTTTC CCTCCTCGAT CGCATTGGCC TGACAGGAAA GATCGGCGTG
ATCGGCCTGG AACCCTATCC ACCCTTCTAT TTCGACGGGG CCATTCCCTT CGGTATATGG
AATGCCGTCC GGGAAGCCTT TCCAAAGGCC GAGTTCAAGC CGGTATATCG CGAATACTTT
ATGTTGACCG CCGCGCACAG TGCGGAGGAA CTGGCTCTCA TCCGGCATGC CGCTGGCATC
GGCGAAAAGA TGTGCGAAGC GATGAGGGTA GCGACGAAGC CTGGCGCGTC CGAAGCCGAT
ATCGTGGCCG CGACTACAGC GGAATGCCTG CGCAATGCCG GCTTTACCGC GGAGGTGCTT
TTCGGATCCG GAAGGGAATT CATCGGCTGG GGTCCCGCCC CCTGGACCTA TCGCGCCCAA
CCGCCCAGAA TCATCGAAGA GGGAGACGTG GTGCTTGCAG AGGTTTTCGC CTTCTACGGC
ATGTACGAAA CCCAGCATCA ACCCGCCATA ACCGTGGGAA AGGCGCATCC GGAGTTTGAA
CGGGCTGCAG CCGTTGCGAG AGCCTGTTAC GAAGAGGGGC TGCAGGCTCT GCGTCCCGGA
CGCACGTTCG GCGAAGTGGT GGAGATCATG GAAGCCCCCC TGATCGCCTC TGGCGGTTGG
CATGTCCATC CCCTGATCCA CAGCATCAAT CCTTACGGTC CGATCGGCTT TGGTTCGGCT
CCCGGGCCGG AATCCCTGCC CGAAGCATCG GCATACGGCT CGATCGGACG CCTTCCCACG
ACCTGGCGCG ATCTGCCACT GCAGGAAGGC ATGTCTTTCG CCTTCGAAAC GAATTGTGCG
TTTGGCAGGC ATCTGGCAAA TCTGGGTGGT ACTGTTCTTG TCGGCTCGGA CGGGGGCATT
GAGCTCAACA GCAACTCGAC CAACGTGATG AGAGCAGGTT GA
 
Protein sequence
MKRREFLGLS AAAMLSNLPV ADANNTTPAM DATSSNLHLD KAPRLSMTER DRRWKMARDI 
MKAAGVEGLI VYGDRESSAP APFSPDSYFT NDRLGAIVVF KGDEPPVVLA FAPMAVSDHM
QARLRGDQQW LEPKQIFVGK SGAHLVSLLD RIGLTGKIGV IGLEPYPPFY FDGAIPFGIW
NAVREAFPKA EFKPVYREYF MLTAAHSAEE LALIRHAAGI GEKMCEAMRV ATKPGASEAD
IVAATTAECL RNAGFTAEVL FGSGREFIGW GPAPWTYRAQ PPRIIEEGDV VLAEVFAFYG
MYETQHQPAI TVGKAHPEFE RAAAVARACY EEGLQALRPG RTFGEVVEIM EAPLIASGGW
HVHPLIHSIN PYGPIGFGSA PGPESLPEAS AYGSIGRLPT TWRDLPLQEG MSFAFETNCA
FGRHLANLGG TVLVGSDGGI ELNSNSTNVM RAG