Gene RPD_2890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2890 
Symbol 
ID4023389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3217294 
End bp3218586 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content64% 
IMG OID637963089 
Productpeptidase T 
Protein accessionYP_570019 
Protein GI91977360 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.223061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGTCTG GCGGTCCTGC AGTGAACTCT TCACCGATCC GCTCCAGCCA AATCGATTTC 
AGCCACGGCG TCATTGAGCG CTTTCTGCGC TATGTCGCGA TCGACACCCA GTCCGATCCC
GCCTCTTCGA CCTGCCCCTC GACCGCGAAG CAGAAGACCC TCGGCGCCTT GCTGGCGCAG
GAGCTGCGCG ATCTCGGGCT TTCGGACGCC CATCTCGACG AGCACGGCTA CGTCTACGCC
ACGATCCCGG CGACCACTGA CAAAAACGTC CCGGTGATCT GCTTCTGCGC GCATATGGAC
ACCTCACCCG ATTGTTCAGG CGAAGGCGTC AAGCCTCAGA TCGTGAAGAA CTATCAGGGT
GGCGACATCG TCCTGCCGGC GGACCCGACG CAGGTCATCC GCGCGACCGA GCATCCAGCG
CTGGCGCAGC AGATCGGCCA TGACATCGTC ACGACCGATG GCGTCACCTT GCTCGGGGCG
GACAACAAGG CCGGAATCGC GGAGATCATG GACGCCGCGG CATTCCTGAT CGCCAATCCG
CAGATCAGGC ATGGCACGCT CAAAGTCCTG TTCACGCCCG ACGAGGAGAT CGGCCGCGGC
GTCGACAAGG TCGACCTCGC CAAACTCGGC GCTGATTTCG CCTACACCAT GGACGGCGAG
ACCGCGGGCA ATATCGAGGA CGAAACCTTC TCCGCCGATT CGGCCGTCGT CACCATCACC
GGCGTGAGCG CCCATCCGGG CTTCGCCAAG GGCAAGATGG AGCACGCCAT CAAGATCGCT
GCGGCGATCG TGGAACGGCT TCCCAGGGAC GCCTGCTCGC CGGAAACCAC CGAGGGCCGC
GAGGGCTTCC TGCATCCGGT CGGCATCACC GGCGCGCTGG AGCAGACCAC GCTGAGTTTC
ATCGTCCGCG ACTTCACCCA GGCCGGACTG CAGCAGAAGG AAGCGCTGTT GCAGGGAATC
GTCGACGAGG TGATGCGCGA CTATCCGCGC TCGACCGCGA CGATCGAGAT CAAGCAGCAG
TATCGCAACA TGAAGCAGGT GCTCGACCGC CATCCCGAGC TGGTCGAGAA CGCCCGCGAG
GCGATTCGGC GCGCCGGCCT GACGCCGGTC ACCACCGCGA TTCGCGGCGG CACCGACGGA
TCGCGGCTGT CGTTCATGGG GCTGCCCTGC CCCAACATCT TCGCCGGCGA ACACGCCTTC
CATTCAAGGC TCGAATGGGT CAGCCGCCAG GATATGGAGG CCGCCGTTCG CACCATCGTG
CATCTGGCGA TGATCTTCGA GGAGCAGGCG TAA
 
Protein sequence
MRSGGPAVNS SPIRSSQIDF SHGVIERFLR YVAIDTQSDP ASSTCPSTAK QKTLGALLAQ 
ELRDLGLSDA HLDEHGYVYA TIPATTDKNV PVICFCAHMD TSPDCSGEGV KPQIVKNYQG
GDIVLPADPT QVIRATEHPA LAQQIGHDIV TTDGVTLLGA DNKAGIAEIM DAAAFLIANP
QIRHGTLKVL FTPDEEIGRG VDKVDLAKLG ADFAYTMDGE TAGNIEDETF SADSAVVTIT
GVSAHPGFAK GKMEHAIKIA AAIVERLPRD ACSPETTEGR EGFLHPVGIT GALEQTTLSF
IVRDFTQAGL QQKEALLQGI VDEVMRDYPR STATIEIKQQ YRNMKQVLDR HPELVENARE
AIRRAGLTPV TTAIRGGTDG SRLSFMGLPC PNIFAGEHAF HSRLEWVSRQ DMEAAVRTIV
HLAMIFEEQA