Gene Rpal_3303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3303 
Symbol 
ID6410974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3552021 
End bp3553274 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content64% 
IMG OID642713180 
Productpeptidase T 
Protein accessionYP_001992280 
Protein GI192291675 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGCGA TTAACTTCGA CTTCACCGTG TTGGAGCGTT TTCTCCGCTA CGTCACCATC 
GACACCCAAT CGGACCCGCA TTCCGGATCC TGCCCGTCCA CAGAAAAGCA GAAGGACCTC
GGCGCGCTGC TGGCGCAGGA GCTGCGCGAA TTGGGCCTCG TCGATGCGCA TCTCGACCAG
CACGGCTACG TCTACGCGAC GATTCCCGCG ACCACCGCGA AACAGAACGT TCCGGTAATC
TGCTTCTGTG CCCACATGGA CACCTCGCCC GATTGTTCAG GAGCTGGCGT CAAACCGCAG
GTCTGGAAGG ACTATCAGGG CGGCGACATC GTCCTGCCGG GTGATAAGTC GCAGGTGATT
CGGCGCGCCG AGCACCCGGC GCTGTCGAAT CAGATCGGCC ACGACATCGT CACCAGCGAC
GGCACCACCC TGCTCGGCGC CGACAACAAA GCCGGCGTCG CCGAGATCAT GGATGCGGCG
CGGTTTCTGC TAGCGCACCC CGAGATTAAG CACGGCACGC TCAAGATCCT GTTCACCCCG
GACGAAGAGA TCGGCCGCGG TGTCGACAAG GTCGACCTCG CCAAGCTCGG CGCCGATTTC
GCCTTCACCA TGGACGGCGA AAGCGCCGGG CATATCGAGG ATGAGACGTT CTCGGCCGAC
AGCGCGGTGA TCACCATCGA GGGCGTCAGC GCCCATCCGG GATTCGCCAA GGGCAAGATC
GAGCACGCCA TCAAGATCGC CGCGGCGATC ATCGAGCGGC TGCCCAAGAC CGGATGCTCG
CCGGAGACCA CCGAAGGACG CGAAGGCTTC CTGCATCCGA TCGGAATCAC CGGCACGCTG
GAGAAGGCCA GCGTCAGCTT CATCGTCCGG GATTTCACAG AAGCCGGACT GAGGGACAAG
GAAACGCTGC TGCAGAGCAT CGTCGAAGAG GTGATGCTGG ATTATCCGCG CTCGCGCGCC
AAGATCGAGA TCCAGCCGCA ATATCGCAAC ATGAAGCAGG TGCTCGACCG CCATCCCGAG
CTGGTCGAGA ACGCCCGTGA AGCGATCCGC CGTGCCGGCC TCACGCCGGT CACCGCCGCG
ATCCGCGGCG GCACCGATGG CGCGCGGCTG TCGTTCATGG GCCTGCCCTG CCCCAACGTG
TTCGCCGGCG AGCACGCCTT CCACTCCCGT CTGGAATGGG TCAGCCGCCA GGACATGGAG
AAGGCGGTCG AGACCATCGT GCATCTGGCA ACGATCTTCG AAGAGCAGGC GTAA
 
Protein sequence
MTAINFDFTV LERFLRYVTI DTQSDPHSGS CPSTEKQKDL GALLAQELRE LGLVDAHLDQ 
HGYVYATIPA TTAKQNVPVI CFCAHMDTSP DCSGAGVKPQ VWKDYQGGDI VLPGDKSQVI
RRAEHPALSN QIGHDIVTSD GTTLLGADNK AGVAEIMDAA RFLLAHPEIK HGTLKILFTP
DEEIGRGVDK VDLAKLGADF AFTMDGESAG HIEDETFSAD SAVITIEGVS AHPGFAKGKI
EHAIKIAAAI IERLPKTGCS PETTEGREGF LHPIGITGTL EKASVSFIVR DFTEAGLRDK
ETLLQSIVEE VMLDYPRSRA KIEIQPQYRN MKQVLDRHPE LVENAREAIR RAGLTPVTAA
IRGGTDGARL SFMGLPCPNV FAGEHAFHSR LEWVSRQDME KAVETIVHLA TIFEEQA