Gene Paes_1653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1653 
Symbol 
ID6460301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1799988 
End bp1801388 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content48% 
IMG OID642725641 
ProductTPR repeat-containing protein 
Protein accessionYP_002016318 
Protein GI194334458 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCC TTGATTTTTT TGACGATGGC GCCGGCAATG AATCCAATGG TTTTTTCAAA 
AAGATACAGC TTGATGATCT CGATCTGGCC TCTCTCTATG ACACTGAAGA ACTGGTAGAA
ATCATCAATC AACTCAATGA GGAAGGCATG CACGCCGATG CGTTGACTGT TGCACGCCAC
CTTGCTACCA CCTGTCCATA CAATACCGAA TCCTGGTTTC ATCTCGGCAA CTGCCTGACG
GTGAACGGCT ATTTCGATGA TGCAAAAGAA GCGTTTCGAA AAGCGACCAT TCTCAGCCCG
GCTGACAGCG AGATGCAGCT TAATCTGGCG CTTTCCCATT TCAACATAGC AAACTACAGC
GCAGCTCTTG AAGAGCTCGA TAACATGGTG ACCGATTCCA CGCTGGAAAA AGAAACCTTT
TTTTACCGGG GCCTTATCCT TCAGAAACTC GAGCGCTACC CTGAAGCGGA AAAGAACCTC
GAGAAGTGCC TCGATATGGA ACCCGCCTTC ACCGAAGCCT GGTATGAGCT TGCTTACTGC
AAGGACCTTC TGGGTAAGCT CGAAGAGAGC GCAAAGTGCT ACCTGGAAGC AATTGATCAG
GACCCCTATA ATGTCAATGC CTGGTACAAC AGAGGTCTTG TCCTGAGTAA GCTGAAACGC
TATGATGAAG CTCTTGAGTG CTATGATATG GCGCTCGCCA TCGCTGATGA TTTCAGTTCA
GCATGGTACA ACAAGGCCAA TGTGCTGGCG ATAACCGGAA TGATCGAAGA CGCAGCAGAG
TGCTACAGGA AAACCATTGA ATTCGAACCC AACGACATCA ACGCGCTCTA CAATCTGGCC
ATCGCCTATG AAGAGCTCGA GCAATATGAT GAGGCAATCA GCCACTATAG CCGTTGCGTA
GAAATAAAAC CTGATTTTGC AGACGCCTGG TTTGCACTTG CCTGTTGTTT TGAGGCAAAC
GAAGAGTACG ATAAAGCACT CAAAGCGGTT AATCTGGCCC TTGACCATCT GCCGGGAACA
ATTGATTTTC TGCAACTGAA AGCAGAGATC CACTATAACA TGCAGGATTT TGAGTCCTCG
ATTACGACCT ATCGCTTGAT TCTCGACGAT GAAGGCGATG CGTCGCAGAT ATGGGTTGAC
TATGCCATGG TCCTTCGCGA ATCAGGCGAT TATGAAGAAT CGATCCTCGC CCTTGAAAAC
TCTATCCGGC TTCAGCCGCA GTCGGCCGAT GCTCATTTTG AAATTGCAGC GACCTACTTT
GCCCTTGGGG ATAACAACCT GACCATTCAG GCCCTGAGAA AAGCCTTTGC AATCGACCCC
GGCAAAAAAA AGCTGTTCAA ATCAACCTTT CCGGAGCTCT ACCAGCAGGA CACTATCCGC
CAGATGCTCG ATATTTCCTG A
 
Protein sequence
MSFLDFFDDG AGNESNGFFK KIQLDDLDLA SLYDTEELVE IINQLNEEGM HADALTVARH 
LATTCPYNTE SWFHLGNCLT VNGYFDDAKE AFRKATILSP ADSEMQLNLA LSHFNIANYS
AALEELDNMV TDSTLEKETF FYRGLILQKL ERYPEAEKNL EKCLDMEPAF TEAWYELAYC
KDLLGKLEES AKCYLEAIDQ DPYNVNAWYN RGLVLSKLKR YDEALECYDM ALAIADDFSS
AWYNKANVLA ITGMIEDAAE CYRKTIEFEP NDINALYNLA IAYEELEQYD EAISHYSRCV
EIKPDFADAW FALACCFEAN EEYDKALKAV NLALDHLPGT IDFLQLKAEI HYNMQDFESS
ITTYRLILDD EGDASQIWVD YAMVLRESGD YEESILALEN SIRLQPQSAD AHFEIAATYF
ALGDNNLTIQ ALRKAFAIDP GKKKLFKSTF PELYQQDTIR QMLDIS