Gene Rpal_1230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1230 
Symbol 
ID6408886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1301464 
End bp1303248 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content65% 
IMG OID642711128 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_001990245 
Protein GI192289640 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTCCC TTCGAATCCG TCGATCGATG GCCGTCGCTC TCACGCTGGT GGCGATGCCG 
ATCGCCGGAC AGGCGCTGGC GCAACCCCCC GATCATCCGG CCGACAACGC GGCGCAGTTT
CCGACCAGCC AGGATCTGCG GTCGATGACG ACCGCCGGCA GCTACCTGGC CGCCCGCCAC
GCCAGCATCG AGCGTGACGC CGCCTCCGCC GCCGCGTTCT ACCGCTCCGC GCTACGCACC
GATCCGAGCA ACAACGAGCT GCTCGACCGC GCCTTCATCT CGTCGCTCGC TGAAGGCGAC
ATCGACGAAG CGGTCAAGCT TGCCGATCGC GTGTTGAAGA TCGACAAGTC CAATCGCGTC
GCCCGGCTGG TGATCGGTAT TCGCGATCTG AAGACCAAGA AATACGCTGC CGCGGTTCGC
AACGTGAACC AGTCGGTGCG CGGCCCGATC ACCGATCTGA TCGCGACCCT GATCTCGAGC
TGGTCGCTGT ACGGCGCCGG TGACGTCAAG AGCGCGGTCG GCAACATCGA CAAGCTGGCC
GGTCCGGAGT GGTATCCGAT CTTCAAGGAT CTGCATTCCG GCATGATGCT GGAACTGGCC
AACCGCCAGA AGGACGCCGG CGAACGGCTG GAGCGCGCCT ACAAGCTCGA CGATTCCGCG
CTGCGCGTGG TCGAGTCCTA TGCGCGCTGG CTGTCGCGCA ACAAGAGCGA GGCCGAGGCG
CTTGCGGTCT ATCAGGCATT CGACAAGAAG CTGCCGCGTC ATCCGCTGAT CGAGGACGGC
ATTCGCGAGG TCAAGGCCGG CAAGAAGCTG TCGCCGCTGG TTGACAGCCC GCAGGCCGGC
GCCGCCGAGG CGCTGTACGG CATCGGCGCG TCGCTGACCC GCCGCGGCGG CGAAGATCTG
GCGCTGGTGT ACCTGCAGCT CGCGCTCTAC CTCGAACCCA ATCATGCACT CGCGCTGCTG
GCGCTCGGCG ATCTCTACGA GTCGGTGAAG AAGCCGCAGA TGGCGATCAA GGTGTATGAG
CGCGTGCCGG CGTCCTCGCC GCTGAAGCGC AACGCCCAGA TCCAGCTCGC CACCGACCTC
GACGCCTCCG ACCGCAGCGA GGAAGCGATC AAGATCCTCA AGGGCGTGAT CGCCGAGGAC
GGCAAGGATC TCGAAGCCAT CATGGCGCTC GGCAATATCG AGCGCGGCCG CAAGAAGTTC
GCCGACTGCG GCGAGACCTA CTCCAAGGGC ATCGATGCGC TCACCGGCGC CGAGAAGAAC
GCCTGGGTGT ATTACTACTT CCGCGGCATC TGCGAGGAGC GCTCCAAGCA GTGGGCCAAG
GCCGAGGTCG ACCTGAAGAA GGCGCTGCAA ATGCAGCCCG ACCAGCCGCA CGTGCTGAAC
TATCTCGGCT ATTCCTGGAT CGATCAGGGC ATCAACCTCG ACGAGGCGAT GACCATGATC
AAGCGCGCGG TCGATCAGCG CCCTGACGAC GGCTACATCG TCGACTCGCT CGGCTGGGCG
TACTACCGCA TCGGCGACTA CGAGAATGCC GTGAAGACGC TGGAGCGGGC GATCGAATTG
AAGCCCGAGG ACCCGACCAT CAACGATCAC CTCGGCGATG CCTATTGGCG CGTCGGCCGT
ACGCTCGAAG CCCGCTTCCA GTGGGCGCAC GCCCGCGACC TCAAGCCCGA TCCGGAGGAG
CTGCCGAAGA TCGAGGCCAA GATCGCCAAC GGTCTGCCCG AAGAGGACAA GGCGTCGGCG
GCGTCCGCGG ACAAGAAGAA AGAAGACGGC AAGGGCGGCG GCTGA
 
Protein sequence
MLSLRIRRSM AVALTLVAMP IAGQALAQPP DHPADNAAQF PTSQDLRSMT TAGSYLAARH 
ASIERDAASA AAFYRSALRT DPSNNELLDR AFISSLAEGD IDEAVKLADR VLKIDKSNRV
ARLVIGIRDL KTKKYAAAVR NVNQSVRGPI TDLIATLISS WSLYGAGDVK SAVGNIDKLA
GPEWYPIFKD LHSGMMLELA NRQKDAGERL ERAYKLDDSA LRVVESYARW LSRNKSEAEA
LAVYQAFDKK LPRHPLIEDG IREVKAGKKL SPLVDSPQAG AAEALYGIGA SLTRRGGEDL
ALVYLQLALY LEPNHALALL ALGDLYESVK KPQMAIKVYE RVPASSPLKR NAQIQLATDL
DASDRSEEAI KILKGVIAED GKDLEAIMAL GNIERGRKKF ADCGETYSKG IDALTGAEKN
AWVYYYFRGI CEERSKQWAK AEVDLKKALQ MQPDQPHVLN YLGYSWIDQG INLDEAMTMI
KRAVDQRPDD GYIVDSLGWA YYRIGDYENA VKTLERAIEL KPEDPTINDH LGDAYWRVGR
TLEARFQWAH ARDLKPDPEE LPKIEAKIAN GLPEEDKASA ASADKKKEDG KGGG