Gene Paes_1565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1565 
Symbol 
ID6458405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1703133 
End bp1704641 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content51% 
IMG OID642725553 
Productprotease Do 
Protein accessionYP_002016230 
Protein GI194334370 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000901531 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00330598 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAGAA AATATACCTA CTTGTTTCTT GTGCTCGCCG GTGTGCTGGT TGGCGCGCTT 
GCTTTTTCTC ATATTGATGT CACTGTACCG GTTGCCGAGA AAAAGATCGC GGTCACGAGT
TTTTCGAATG ATGCTGCTGC GTCGCCGGGG GTCGAGCATC AGCCTATCCG TTCGCTGAAG
GACCTGAACG AAGCATTTGT CCAGTTGGCT GAGTCTGCGA CACCTTCCGT TGTGACGATT
TTTACCGAAA AGACGGTCAA CAGAAAAACC ATTTCTCCGT TTGATCTTTT CGGCAGTCCC
TTTGACGATT TTTTCAATGT TCCACGTGAC AGGGGAGGGC AGAATGGATC CAAGGAGGTG
CTCAGAGGAC TCGGCTCAGG GGTGGTCGTC AGTGCGGATG GTTATATCCT TACCAACAAC
CATGTTGTCG ATAATGCCGA TGTGATCTAC ATTCGTACGT ATGAAAATAA TAAAGTGGCG
GCCAAAGTGG TCGGCAAGGA CCCGAAGACA GATCTGGCGG TTATCAAGGC TGACGTCAAA
GGATTGAAGC CTATCGCTAT CGGCGACAGT GACGCACTGA GGGTTGGTGA GTGGGTGATC
GCTATTGGCA GTCCGCTCGG AGAGAACCTT GCCCGCACAG TAACGCAGGG TATTGTCAGC
GCTAAAGGCC GCGCCAATGT CGGACTTGCT GATTATGAAG ATTTTATTCA GACAGACGCT
GCCATCAATC CCGGCAATTC CGGTGGTCCC CTGGTCAATA TCAACGGTGA ACTTGTCGGG
ATCAACACTG CTATTGCAAG TCGGACAGGT GGTTTTGAAG GGATCGGCTT TGCTGTTCCT
TCGAATATGG CAAAAAAAGT GATGCAGTCG CTCATCAGCA ACGGCAAGGT TACAAGAGGA
TGGCTGGGCG TCACCATTCA GGATGTCGAT GAAAATATCG CCAAGGGGCT GCAGCTTGAT
CCGCCAGAGG GTGTTCTTGT CGGAACGGTT GTTGACGATG GTCCTGCTGC ATCGGCAGGG
GTGAAGACGG GAGATGTGAT CATTGCCATC GATGGTAAAA AAGTGACCGA TACCATTGAA
CTTCGTAACG GCATTGCAGA GACACCTCCG GGGACGACTG TAAAACTCAG GGTGTGGCGA
AACGGGCAGG TAAAGCTCTT GAGCGTGCGG TTGAGCGAGC TTCCCGGAAA GGAGGAGGTT
GCTGTCGAAG AGAAGGCTGA AATCACTGAT CTCCTGGGTT TCAGTGTTTC GGAACTGACC
TCCGAGCTGG CTTCACGATA CAGGCTGAGT CGGGATAAGG GTGCTGTTCT TGTGACCGGT
ATCGATCCAT CGAGCAAGGC GTATCGAGCT GGTTTGCGAC AGGGAGATCT CATTCTTTCC
GTCAACAAGA AAAACGTAAG CACCTACAAG GAGTTTCTTG CGGTGGCCGG CTCAGTGAAA
AAAGGCGATC TCCTCTTTCT CTTGATCGAA CGTCAGGGCA GCAAAGTGTA TTTTGCCTTT
AACATTTAA
 
Protein sequence
MKRKYTYLFL VLAGVLVGAL AFSHIDVTVP VAEKKIAVTS FSNDAAASPG VEHQPIRSLK 
DLNEAFVQLA ESATPSVVTI FTEKTVNRKT ISPFDLFGSP FDDFFNVPRD RGGQNGSKEV
LRGLGSGVVV SADGYILTNN HVVDNADVIY IRTYENNKVA AKVVGKDPKT DLAVIKADVK
GLKPIAIGDS DALRVGEWVI AIGSPLGENL ARTVTQGIVS AKGRANVGLA DYEDFIQTDA
AINPGNSGGP LVNINGELVG INTAIASRTG GFEGIGFAVP SNMAKKVMQS LISNGKVTRG
WLGVTIQDVD ENIAKGLQLD PPEGVLVGTV VDDGPAASAG VKTGDVIIAI DGKKVTDTIE
LRNGIAETPP GTTVKLRVWR NGQVKLLSVR LSELPGKEEV AVEEKAEITD LLGFSVSELT
SELASRYRLS RDKGAVLVTG IDPSSKAYRA GLRQGDLILS VNKKNVSTYK EFLAVAGSVK
KGDLLFLLIE RQGSKVYFAF NI