Gene Sare_2249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2249 
Symbol 
ID5705875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2584364 
End bp2586700 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content71% 
IMG OID641271729 
ProductATP-dependent protease La 
Protein accessionYP_001537100 
Protein GI159037847 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.622317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.359605 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACTC TCCCGGTCCT TCCCCTGACC GACGCCGTGC TGCTACCCGG CATGGCGATC 
CCGGTGACCC TCGACCCGAC AACCCAGGCC GCCGTTGACG CGGCCCGCGC CACAGGCGAC
CAACGGCTGT TGGCAGTGCC CCGTCTCGAC GGCGAATACG GTCCGGTCGG CGTCGTCGCC
ACCATCGAGA AGGTCGGCCG GCTACCCAGT GGCGAACCGG CCGCCGTCGT CCGCGGCCTC
GCTCGTGCCC GCATCGGATC GGGTGTACCC GGTCCCGGTG CGGCGCTCTG GGTCGAGGCC
GCCGAACTGG CTGAGCCGGC CCCGGCCGGC AGGGCGCGGG AACTCGCCCG CGAGTACCGT
GCCCTGATGA CCTCGGTGCT CCAGCAGCGT GGCGCCTGGC AGGTCATCGA CGCGATCGAA
CGGATGACCG ACCTCTCCGA ACTGGCCGAC TCGGCCGGCT ACGTGTCCTG GCTCAGCCTG
GCCCAGAAGA CCGAACTACT CGCCGCACCG GATGTCACCA CCCGCCTGGA ACTGCTCGTC
GGCTGGGTCC GGGCACACCT GGCGGAGCAG GAGGTCGCAG AACAGATCAA CACCGACGTG
CGGGAAGGGC TGGAGAAGTC GCAGCGGGAG TTTCTGCTGC GGCAACAACT CGCCACCATC
CGCAAGGAAC TCGGTGAGGA CGAACCCGAG GGCTCGGCCG ACTACCGGGC GCGCGTCGAG
GCCGCCGACC TGCCGGCGCC GGTTCGCGAT GCCGCACTGC GCGAGGTCGG CAAGCTGGAA
CGGGCCAGCG ACGCCTCCCC GGAGGCCGGC TGGATCCGGA CCTGGCTCGA CACCGTACTC
GAGATGCCGT GGAACACGCG TACCGAGGAC AACACCGACC TCGTCGCCGC CCGCGCGGTT
CTCGACGCCG ACCACGCTGG CCTGACCGAC GTGAAGGACC GGATCCTCGA GTACCTGGCC
GTGCGGAACC GGCGTGTCGA GCGCAACCTC GGCGTGGTCG GCGGACGAGG ATCAGGCGCT
GTCCTTGCCC TCGCCGGCCC GCCCGGTGTC GGCAAGACCA GCCTGGGTGA GTCCGTCGCC
CGGGCACTCG GCCGCCGCTT CGTCCGGGTC TCGCTCGGCG GCGTCCGCGA CGAGGCGGAG
ATCCGCGGCC ACCGGCGGAC CTACGTCGGA GCACTGCCAG GCCGAATCGT GCGCGCGCTG
CGCGAGGCCG GCTCGATGAA CCCGGTGGTG CTTCTCGACG AGGTGGACAA GCTGGCTGTC
GGCTACTCCG GCGACCCGGC CGCCGCGCTG CTCGAGGTGC TCGATCCAGC GCAGAACCAC
ACCTTCCGGG ACCATTACCT GGAGGTCGAT CTCGACCTGT CAGATGTGCT GTTCCTAGCC
ACCGCCAACG TGGTGGAGGC CATTCCCAGC CCGCTACTGG ACCGGATGGA ACTGGTCACC
CTCGATGGCT ACACCGAGGA CGAGAAGGTC GCCATCGCCC GCGACCACCT GTTGCCACGG
CAGCGGGAGC GGGCCGGGCT GACCGCCGAT GAGGTCACCA TCAGTGATGG CGTCCTGGCC
CGGATCGCTG GCGAGTACAC CCGGGAGGCC GGTGTCCGGC AACTTGAGCG GTCGCTGGCG
AAGATCTTCC GCAAGGTCGC CGTGACAGCG ACCACCGACC CCGCGCCGGT GCACGTGGAC
ACCGGCAACC TCCACCGGTA CCTGGGCCGG CCAAAGTTCA GCCCAGAGTC GGCCGAGCGG
ACGGCGGTGC CCGGCGTGGC CACCGGTCTG GCCGTCACCG GTGCCGGCGG TGACGTCCTC
TTCGTCGAGG CGACCAGCAT GGCGGGCGAA CCCGGACTGA CCCTCACCGG GCAGCTCGGC
GACGTGATGA AGGAGTCGGC GCAGATTGCG CTCTCCTACC TGCGTTCCAA CGGGCGGCGG
CTTGGCTTGG ACCCGAATGC CCTGGCCGGG CGGCGGATTC ACCTGCACGT CCCGGCGGGA
GCCGTGCCCA AGGACGGACC GAGCGCCGGC ATCACCATGG TGACGGCACT GGCCTCGCTG
GTCAGCGGTC GGCCGGTACG CCCCGAATTC GGGATGACCG GCGAGGTGAC GCTCTCCGGA
CGGGCGCTGC CGATCGGTGG CGTGAAGCAG AAACTGCTCG CCGCCCATCG GGCCGGCCTG
ACCGAGGTGA TCATCCCTCA ACGCAACGAG CCGGACCTCG ACGACCTGCC GGCCGAAGTG
CGCGAGGCGT TGACGGTGCA CACCCTCGCG GATGTCGCCG ACGTGCTGGC CCTGGCACTA
CGCCCGGCCG ACCTCGACGC CGACTCGCTG GACGGCGAGG CACTCGCCAC CGCCTAG
 
Protein sequence
MATLPVLPLT DAVLLPGMAI PVTLDPTTQA AVDAARATGD QRLLAVPRLD GEYGPVGVVA 
TIEKVGRLPS GEPAAVVRGL ARARIGSGVP GPGAALWVEA AELAEPAPAG RARELAREYR
ALMTSVLQQR GAWQVIDAIE RMTDLSELAD SAGYVSWLSL AQKTELLAAP DVTTRLELLV
GWVRAHLAEQ EVAEQINTDV REGLEKSQRE FLLRQQLATI RKELGEDEPE GSADYRARVE
AADLPAPVRD AALREVGKLE RASDASPEAG WIRTWLDTVL EMPWNTRTED NTDLVAARAV
LDADHAGLTD VKDRILEYLA VRNRRVERNL GVVGGRGSGA VLALAGPPGV GKTSLGESVA
RALGRRFVRV SLGGVRDEAE IRGHRRTYVG ALPGRIVRAL REAGSMNPVV LLDEVDKLAV
GYSGDPAAAL LEVLDPAQNH TFRDHYLEVD LDLSDVLFLA TANVVEAIPS PLLDRMELVT
LDGYTEDEKV AIARDHLLPR QRERAGLTAD EVTISDGVLA RIAGEYTREA GVRQLERSLA
KIFRKVAVTA TTDPAPVHVD TGNLHRYLGR PKFSPESAER TAVPGVATGL AVTGAGGDVL
FVEATSMAGE PGLTLTGQLG DVMKESAQIA LSYLRSNGRR LGLDPNALAG RRIHLHVPAG
AVPKDGPSAG ITMVTALASL VSGRPVRPEF GMTGEVTLSG RALPIGGVKQ KLLAAHRAGL
TEVIIPQRNE PDLDDLPAEV REALTVHTLA DVADVLALAL RPADLDADSL DGEALATA