Gene RPD_1212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1212 
Symbol 
ID4021688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1371082 
End bp1372860 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content64% 
IMG OID637961404 
Producttetratricopeptide TPR_2 
Protein accessionYP_568351 
Protein GI91975692 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCTTC ACCGACTCCG TCGCTCGATG TTTGTCGCCG TCACCGTCGC GGCACTGCCG 
ATCGCGGGTC AGGCGCTGGC GCAGACCCCG GACCATCCCG CTGACAATTC GGCGCAGTTT
CCGTCCGGCC AGGATTTGCG GTCGATGACG ACGGCCGGCA GCTATCTGGC CGCCCGTCAC
GCCAGCGTCG AGCGCGACGC CGCCGCGGCG GCGGCGTTCT ATCGTTCGGC GCTGCGCACC
GATCCGAAGA ACAACGAGCT GCTCGACCGC GCCTTCATCT CGTCGCTGGC CGAAGGCAAT
ATCGACGAGG CCGTCAAGCT CGCCGATCGC ATCCTCAAGA TCGACAAGAC CAACCGCGTC
GCCCGTCTCG TGCTCGGCGC TCGGGACCTC AAGACAAAGA AATACGCGTC CGCGATTCAG
AACGTGAATC TGTCGGTCCG CGGCCCGATC ACCGACCTGA TCGCGACCCT GCTCGCGAGC
TGGGCGATGG AGGGAGCCGG CGACGTCAAG GGCGCGGTGG CCAATATCGA CAAGCTCGCC
GGTCCGGAAT GGTATCCGAT CTTCAAGGAT CTCCATTCCG GCATGATCCT CGAACTCGCG
AACCGTCAGA AGGACGCCGG CGTTCGCTTC GAACGCGCCT ACAAGCTCGA CGATTCCGCG
CTTCGGGTGA TGGACGCCTA TGCGCGCTGG CTGTCGCGCA ACAAGGACGA CAAGTCCGCG
GTGGCCGTCT ATGAGAGCTT CGACAAGAAG CTGTCCCGTC ATCCGCTGGT TGTCGAAGGC
CTGAACGACA CCAAGGCCGG CAAGAAGCTG CCGCCGCTGG TCGACAGTCC GCAGGCCGGC
GCGGCCGAGG CGCTGTACGG CATCGGCGCG TCGCTGACCC GCCGCGGCGG CGAAGACCTC
GCCCTGGTCT ATCTGCAGCT CGCGCTGTAT CTGCAGCCCG ATCACGCGCT GGCGCTGCTG
GCGCTCGGCG ACCTCTACGA GTCGGTCAAG AAGCCGCAGA TGGCGGTGAA GGTCTACGAG
CGCGTGCCGG CGAATTCGCC GCTGAAGCGC AACGCTCAAA TCCAGCTCGC CACCGATCTC
GATGCGACCG ACCGCAGCGA GGAGGCGATC AAGATCCTGA AGACCGTGAT TGCCGAAGAC
GGCAAGGATC TGGAAGCGAT CATGGCGCTC GGCAATATCG AGCGCGGCCG CAAGAAGTTC
GCCGACTGCG CCGTGACCTA TTCGCAGGGC ATCGACGCGC TGTCCGGGGC CGAGAAGAAC
AGCTGGGTGT ATTACTACTT CCGCGGTATC TGCGAGGAGC GCTCCAAGCA GTGGGCCAAG
GCCGAGATCG ACATGAAGAA GGCGCTGCAG CTGCAGCCGG AGCAGCCGCA CGTTCTCAAT
TATCTCGGCT ATTCCTGGAT CGACCAGGGC ATCAATCTCG ACGAAGCGAT GAAGATGATC
AAGCGCGCCG TCGATCAGCG TCCCGACGAC GGCTACATCG TCGACTCGCT CGGCTGGGCC
TATTATCGCA TCGGCAATTA CGAGGATGCG GTGAAGACGC TGGAACGCGC GATCGATCTG
AAGCCGGAAG ATCCGACCAT CAACGATCAT CTCGGAGACG CTTATTGGCG CGTCGGTCGC
ACGCTGGAGG CGCGTTTCCA GTGGGCGCAC GCCCGCGACC TCAAGCCCGA TCCGGAAGAA
TTGCCGAAGA TCGAGGCCAA GCTCGCCAAC GGCCTGCCGG ACGACACCTC GTCGGCGGCT
TCGGCCGACA AGAAAAAAGA AGACGGCAAG GGCGGCTGA
 
Protein sequence
MLLHRLRRSM FVAVTVAALP IAGQALAQTP DHPADNSAQF PSGQDLRSMT TAGSYLAARH 
ASVERDAAAA AAFYRSALRT DPKNNELLDR AFISSLAEGN IDEAVKLADR ILKIDKTNRV
ARLVLGARDL KTKKYASAIQ NVNLSVRGPI TDLIATLLAS WAMEGAGDVK GAVANIDKLA
GPEWYPIFKD LHSGMILELA NRQKDAGVRF ERAYKLDDSA LRVMDAYARW LSRNKDDKSA
VAVYESFDKK LSRHPLVVEG LNDTKAGKKL PPLVDSPQAG AAEALYGIGA SLTRRGGEDL
ALVYLQLALY LQPDHALALL ALGDLYESVK KPQMAVKVYE RVPANSPLKR NAQIQLATDL
DATDRSEEAI KILKTVIAED GKDLEAIMAL GNIERGRKKF ADCAVTYSQG IDALSGAEKN
SWVYYYFRGI CEERSKQWAK AEIDMKKALQ LQPEQPHVLN YLGYSWIDQG INLDEAMKMI
KRAVDQRPDD GYIVDSLGWA YYRIGNYEDA VKTLERAIDL KPEDPTINDH LGDAYWRVGR
TLEARFQWAH ARDLKPDPEE LPKIEAKLAN GLPDDTSSAA SADKKKEDGK GG