Gene Rpal_0940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0940 
Symbol 
ID6408594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp998752 
End bp1000842 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content68% 
IMG OID642710854 
ProductPeptidyl-dipeptidase Dcp 
Protein accessionYP_001989973 
Protein GI192289368 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.131573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAA GCTCCGGACC GATTGCCGCA CCCGCTGATG CCGGCAATCC GCTGCTGGCC 
GCCTGGACCA CCCCGTTCGA GACACCGCCG TTCGCCGAGA TCAGGCCCGA GCACTTCATG
CCGGCCTTCG AGCAGGCGTT CGCCGACCAC GCGGGCGAGA TCGCCGCGAT CGTCAACGAT
CCGACCGAGC CGGACTTCGA CAACACCGTC ACGGCGCTGG AGCGCTCAGG CAAGCTGCTG
AACCGGGTCG CCGCGGTGTT CTACGATCTG GTGTCGGCGC ATTCGAGCCC GGAGCTGCTG
AAGATCGACG AAGAGGTGTC GCTGCGGATG GCGCGGCACT GGAATCCGAT CATGATGAAC
GCGGTGCTGT TCGGCCGGAT CGCGGCGCTG CGCGACAAGG CGGCGCGGCT GAACCTGACG
CCGGAGCAGA GCCGGCTGCT GGCACGCAGC TACACCCGCT TCCACCGTGC CGGCGCCGGC
CTCGATGAGG CGTCCAAGGC GCGCATGGCG GCGATCAACG AACGCCTGGC GCAGCTCGGC
ACCAGCTTCA GCCATCATCT GCTCGGCGAC GAGCAGGAAT GGATGATGGA GCTCGGCGAG
GGCGATACCG AGGGGCTGCC CGACAGCTTC GTGGCGGCGG CGCGCGCTGC GGCGGAAGAG
CGCGAGCTGC CCGGCAAGGC CGTGGTGACG CTGTCGCGGT CGTCGGTCGA GCCGTTCCTG
AAGATGTCGA GCCGGCGCGA TCTGCGCGAG AAGGTGTACC GCGCCTTCAT CGCCCGCGGC
GACAACGGTA ACGATAACAA CAACAACGCG ATCATCGGCG AGATCCTGAG CCTGCGCGAA
GAAAGCGCCA AGCTGCTCGG CTATCCGACC TTTGCGGCCT ACCGTCTGGA AGACTCGATG
GCCAAGACGC CGGAGGCGGT GCGCGGCCTT TTGGAGCGGG TGTGGAAGCC GGCGCGCGCT
CGCGCGCTCG CCGACCGCGA CGCGCTGCAG GAGCTGGTCA CGGAAGAGGG CAGCAACTTC
AAGCTGGCGC CGTGGGACTG GCGCTACTAC GCCGAGAAGC TGCGCCAGCG CCGCGCCAAT
TTCGACGATG CGGCGATCAA GCCGTATCTG ACGCTCGACG GCATGATCGC CGCGGCGTTC
GACACCGCCA CGCGGCTGTT CGGCATCACC TTCCAGGAAC GCAAAGACGT GCCGGTGTGG
CACCCCGACG TTCGCGTTTG GGAAGTGAAG GACGCCGACG GCGCTCATCG CGGGCTGTTC
TACGGCGACT ACTATGCCCG GCCCTCGAAG CGTTCCGGCG CCTGGATGAC CTCGCTGCGC
GATCAGCAGA AGCTCGACAG CGCGGTGGCG CCGCTGATCA TCAACGTCTG CAACTTCGCC
AAGGGCGCCG GCGGCGAGCC GTCGCTGCTG TCGCCCGACG ACGCCCGCAC GCTGTTCCAC
GAATTCGGCC ACGGCCTGCA CGGCATGCTG TCGGACGTGA CCTACCCGTC GCTGTCCGGC
ACCAGCGTGT TCACCGACTT CGTCGAGCTG CCGTCGCAGC TTTACGAGCA TTGGCAGGAG
CAGCCGCAAG TGCTCCGGCA GTTCGCCCGT CACTATCAGA CCGGCGAGCC GCTGCCCGAC
GACCTGTTGC AGCGGTTCAT CGCCGCCCGC AAGTTCGGCC AGGGCTTCGC CACCGTCGAG
TTCGTCTCCT CGGCGCTGCT CGATCTCGAA TTCCACACCC AGCCCGCCGC CAGCATCGGC
GAGGTCCGCG CCTTCGAGCG CAAGGAACTC GACAAGATCG GCATGCCGGA AGAGATCGCG
CTGCGGCACC GGCCGACCCA GTTCGGCCAC ATCTTCTCCG GCGATCACTA CGCTTCGGGC
TACTACAGCT ACATGTGGAG CGAGGTAATG GACGCCGACG CGTTCGGCGC GTTCGAAGAG
GCCGGCGACA TCTTCGCGCC GGACGTGGCC AAGCGGCTGC GCGACGACAT CTATTCGTCC
GGCGGCTCGC GCGATCCGGA GGAAGCCTAT GTGGCGTTCC GCGGCCGCAA GCCGGAGCCC
GACGCGCTGC TGCGCCGCCG CGGCCTGCTC GACACGCCGG AGGCCGCGTA G
 
Protein sequence
MSESSGPIAA PADAGNPLLA AWTTPFETPP FAEIRPEHFM PAFEQAFADH AGEIAAIVND 
PTEPDFDNTV TALERSGKLL NRVAAVFYDL VSAHSSPELL KIDEEVSLRM ARHWNPIMMN
AVLFGRIAAL RDKAARLNLT PEQSRLLARS YTRFHRAGAG LDEASKARMA AINERLAQLG
TSFSHHLLGD EQEWMMELGE GDTEGLPDSF VAAARAAAEE RELPGKAVVT LSRSSVEPFL
KMSSRRDLRE KVYRAFIARG DNGNDNNNNA IIGEILSLRE ESAKLLGYPT FAAYRLEDSM
AKTPEAVRGL LERVWKPARA RALADRDALQ ELVTEEGSNF KLAPWDWRYY AEKLRQRRAN
FDDAAIKPYL TLDGMIAAAF DTATRLFGIT FQERKDVPVW HPDVRVWEVK DADGAHRGLF
YGDYYARPSK RSGAWMTSLR DQQKLDSAVA PLIINVCNFA KGAGGEPSLL SPDDARTLFH
EFGHGLHGML SDVTYPSLSG TSVFTDFVEL PSQLYEHWQE QPQVLRQFAR HYQTGEPLPD
DLLQRFIAAR KFGQGFATVE FVSSALLDLE FHTQPAASIG EVRAFERKEL DKIGMPEEIA
LRHRPTQFGH IFSGDHYASG YYSYMWSEVM DADAFGAFEE AGDIFAPDVA KRLRDDIYSS
GGSRDPEEAY VAFRGRKPEP DALLRRRGLL DTPEAA