Gene Paes_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1920 
Symbol 
ID6460047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2101909 
End bp2104335 
Gene Length2427 bp 
Protein Length808 aa 
Translation table11 
GC content52% 
IMG OID642725905 
Productpeptidase S16 lon domain protein 
Protein accessionYP_002016579 
Protein GI194334719 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTGC CCGCACCCCT GAACGCTGAC ACGCTCTATA AACACTGTGA TGCCGAACAG 
TTCAGTTTCA GCACAACGGA GGAACTCGAA GACAAGCCGC AGTCTGCCGG ACAGGAACGC
GCGATCGAAG CTATCCGTTT CAGTATAGGA ATGAAACATG ACGGTTTCAA TCTCTTTGCC
CTCGGCCCCG GAGGCACGGG AAAACAGACC GCTATCGAAC TCTATCTCAA AACAACGGCT
CCGGCTGAGA AGGTGCCTGA CGACTGGTGT TACGTCTATA ATTTCCTGAA ACCCCGTCAG
CCTGCGGCGA TATCGATGCC TCCGGGCAAA GCCTGTGCAT TGAGCAAAGA TATGGACCTC
CTGATCGAAG AGCTCATCAC CAGTATTCCG GCAGCCTTCA ACAGCGAAGA GTATCAGGAG
CAGGAAAAAG CGATCAAAGA GGAGTTCCAG GATAAAGAAT CCTCAGCCAT TGAAGCGCTT
GAAAAAAAAG CTGCCGATAA TAATATAGCC GTCATCCGAA CACCTTCAGG CTTTGCGTTC
GCTCCCATAC GCAAAGGAGA GGTTCTTAAA TCCGAGGAGT TCCTGCGCCT CAAGCCTGAT
GAAAAAGAAG ACATCGAACG TGAAATCGCT GTCCTGAAAG AGTCGATGCA ATCGATCATG
ATGCAGATAC CCAAATGGCA GCGGGAAGGC CAGGAAAAAC TCAAGGAGCT CAACCAGCAG
GTTGCCGGTC TGGCTATCAG GCCGGTCATC AGCGAGCTGA AAACCAAATA CAGAGAGATA
GCTGCGATTG CAACCTATCT CGAAGCGGCC GAAAAAGATA TTATCGATAA TTTTGAACAA
TTTCTGGCAA GAGAGGATCA GATGCAGGAA ATCCTGCCAG TGCCTGCAGC CATCAAAGGA
AGCCGAAAAA AGCAGTTCCA CCGCTACAGG GTCAATGTGA TTGTCGACAA CGGCGACACC
AATGGCGCTC CGGTGGTCTA CGAAGATAAA CCCTCCTGCC AGAACCTCCT GGGCGATATC
GAGCACATCT CCCAGATGGG AACGCTCGTG ACGGATTTCA CACTGATCAA ATCCGGAGCG
CTGCACAAAG CTAACGGAGG CTACCTGATC CTCGACGCAC GCCGCCTCCT GCTTGAACCA
CTGGCATACG AAGCCTTAAA AAAAGCCATT CGTACCCGGC AAATCCGCAT TGAATCGCTT
GCCCAGCTCT ACAGTCTGAT CAGCACGGTC TCACTTGAGC CGGAACCGCT TGCGCTTGAT
ATCAAGGTTA TCCTGCTTGG AGAACGGCAC CTCTACTACC TCTTGAGCGC CTACGATCCG
GATTTCAGGG AACTCTTCAA GGTTGCCGCC GATTTTGACG ACACGATGGA GCGAAGCGAA
CCGGCAGCCA ATTCCTATGC CAATATTCTT GCCGGCATCG TACGCAAAGG AAATCTCCGT
CATCTCGACC GCCACGCGGT CGCACGCGTC ATTGAATACG GCGCCCGGAT CTCGGGAGAT
GCCAACCGCC TCTCTACCCA TCTGCAGAGC CTTGCCGATC TGGTCCATGA AGCCGATTTT
TTCGCCGGAG AAGACAACTC CGCCCATATA GGCCGGAACC ACATTCAGAA AGCCATTGAT
GCCAAGCGCT ATCGAGCAGG CAGAATACCG GAAAGAATCA GAACATCGAT GATGGAAAAG
ACCATCCTGA TCGATACTGA TTCGGAAAAA ACCGGACAGA TCAATGGACT TGCCGTCTAC
CTGCTTGGCG ACCAGTCTTT CGGGCATCCG AGCAGAATAA CAGCACAGAT CAGAATGGGA
AAGGGAGAGG TCATCGACAT CGAACGGGAA GTTGAAATGG GCGGCCCCAT CCACTCGAAA
GGAGTCCTGA TCCTAACCGG TTTTCTTGGT GGGCGATTTG GTGTCGATCA GCCGCTTTCA
CTTTCGGCAA CACTGGTCTT TGAACAATCA TACAGCGGAA TTGAAGGCGA CAGCGCATCA
TCAGCGGAAC TCTACGCTCT GCTCTCGGCC CTTTCCGACA TCCCCATACG CCAGTGGCTC
GCGGTGACCG GCTCGGTCAA CCAGCATGGA GAGGTTCAGG CTATCGGTGG CGTCAATGAA
AAAATTGAGG GTTTTTTCGA CCTCTGCAAT GCACGGGGAC TTTCCGGCAA AGAGGGCGTG
CTCATCCCGG CTTCCAACAC GAGACACCTC ATGCTGCGGG AAGATGTCGT AGAAGCAGTC
CGGGACAACC GTTTCCACAT CTATCCAGTG ACTCACATCG ATGAAGGCAT CGAGATCCTC
ACCGGAAAAT CAGCAGGAAA AGCGGATGAA AACAACGTCT GGCAAGAGGG TTCCATCAAT
GCCCGAATTG TCGCCAGGCT TAAAGAGATA GCCGAAAAAC AAAAGGCTTT TTCAGCACTA
TCGCAAAAGC ACAGCAACAA CGAGTAA
 
Protein sequence
MSLPAPLNAD TLYKHCDAEQ FSFSTTEELE DKPQSAGQER AIEAIRFSIG MKHDGFNLFA 
LGPGGTGKQT AIELYLKTTA PAEKVPDDWC YVYNFLKPRQ PAAISMPPGK ACALSKDMDL
LIEELITSIP AAFNSEEYQE QEKAIKEEFQ DKESSAIEAL EKKAADNNIA VIRTPSGFAF
APIRKGEVLK SEEFLRLKPD EKEDIEREIA VLKESMQSIM MQIPKWQREG QEKLKELNQQ
VAGLAIRPVI SELKTKYREI AAIATYLEAA EKDIIDNFEQ FLAREDQMQE ILPVPAAIKG
SRKKQFHRYR VNVIVDNGDT NGAPVVYEDK PSCQNLLGDI EHISQMGTLV TDFTLIKSGA
LHKANGGYLI LDARRLLLEP LAYEALKKAI RTRQIRIESL AQLYSLISTV SLEPEPLALD
IKVILLGERH LYYLLSAYDP DFRELFKVAA DFDDTMERSE PAANSYANIL AGIVRKGNLR
HLDRHAVARV IEYGARISGD ANRLSTHLQS LADLVHEADF FAGEDNSAHI GRNHIQKAID
AKRYRAGRIP ERIRTSMMEK TILIDTDSEK TGQINGLAVY LLGDQSFGHP SRITAQIRMG
KGEVIDIERE VEMGGPIHSK GVLILTGFLG GRFGVDQPLS LSATLVFEQS YSGIEGDSAS
SAELYALLSA LSDIPIRQWL AVTGSVNQHG EVQAIGGVNE KIEGFFDLCN ARGLSGKEGV
LIPASNTRHL MLREDVVEAV RDNRFHIYPV THIDEGIEIL TGKSAGKADE NNVWQEGSIN
ARIVARLKEI AEKQKAFSAL SQKHSNNE