Gene Sama_1226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1226 
Symbol 
ID4603478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1508471 
End bp1510828 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content54% 
IMG OID639780576 
Productendopeptidase La 
Protein accessionYP_927103 
Protein GI119774363 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000223975 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAG AGCGTGAAGC GCATTTCGAA TTGCCGGTAT TACCCCTGCG AGATGTGGTG 
GTGTACCCCC ATATGGTCAT TCCGTTATTT GTGGGTCGTG AAAAGTCTAT TCGCTGCCTG
GAAACGGCCA TGGCCCAAGA TAAGCAGATC ATGCTTGTCG CTCAGCGAGA TGCCGATCTG
GATGAGCCCG GTGCTGACGA CATTTTTGAG GTGGGGACCA TCGCCTCCAT TCTGCAGCTG
CTCAAACTGC CTGACGGCAC AGTTAAAGTG CTGGTTGAGG GGGGGCGCCG TGCCCGCGTT
GCCCGTTACA CCCAGGAAGA GCCTTTCTTC ATCGGCCGCA TCGAAGAGCT GCCTTCGGCG
CCGTTGGAAG ACAAGGAAGA AGAGGTGTTG GTGCGCAGTG CCATCGCCCA GTTTGAAGGC
TACATCAAGC TCAATAAAAA AATTCCACCC GAAGTGCTCA CCTCCATGTC CGGCATCGAC
GAGGCTGCTC GCCTTGCCGA TACCATGGCG GCCCATATGC CACTCAAGCT CGAAGACAAA
CAGTCTGTGC TTGAAATGGT GAATGTGGGT GAGCGCCTCG AGTACCTGAT GGCCATGATG
GAAGGCGAAA TCGATTTGTT GCAGGTGGAG AAGCGCATCC GCTCCCGCGT GAAAAAGCAA
ATGGAAAAGA GCCAACGCGA GTACTACCTG AATGAGCAAA TGAAAGCCAT TCAGAAGGAG
CTTGGCGATC TTGATGAAGG CCATGATGAA TTTGAGACTC TCAATCGTAA GATTGAAGAG
GCCAAGATGC CGGCTGACGC CAAAGACAAG GCCCAGGCTG AGCTGAACAA GTTACGTATG
ATGTCTCCCA TGTCAGCAGA GGCCACTGTG GTGCGCTCCT ATGTTGATTG GATGACTTCA
GTGCCTTGGT TTGAGCGCTC CAAAATCAAG CGTGACCTCT CCAAGGCCGA GCAGGTACTT
AATGCAGACC ACTATGGTCT GGAAAAAGTC AAAGAGCGTA TCCTTGAGTA TTTGGCGGTT
CAGACCCGGG TGAAACAGCT TAAAGGCCCA ATCCTGTGTC TGGTAGGCCC ACCAGGTGTG
GGTAAAACCT CGCTCGGTCA GTCAATTGCC AAGGCAACCG GCCGTAAATA TGTGCGGGTG
GCGCTCGGTG GTGTGCGCGA TGAAGCGGAA ATCCGCGGCC ATCGCCGTAC TTACATCGGC
TCTATGCCGG GCAAAATCAT TCAGAAGATG GCCAAGGTTG GGGTGAAAAA CCCGCTGTTC
CTGTTGGATG AAATCGACAA GATGAGTTCC GACATGCGCG GCGACCCGGC TTCAGCCCTG
CTGGAAGTTT TGGACCCAGA GCAGAACGCC ACCTTTAACG ATCACTATCT GGAAGTCGAT
TATGACCTGT CCGATGTGAT GTTTGTGGCC ACCAGTAACT CCATGGATAT TCCAGGCCCG
CTGTTGGACC GTATGGAAGT GATTCGTCTG TCGGGTTACA CCGAAGACGA AAAGCTCAAC
ATCGCCAAGC AGCACTTACT GCCAAAGCAG GTAGAGCGCA ACGGCCTTAA GCCACAGGAA
ATCAGCGTCG ATGATGGTGC AATTCTCGGC ATTATCCGTT ACTACACCCG TGAGGCAGGC
GTCCGTGCCC TTGAGCGTGA GCTGTCGAAG ATTTGCCGTA AAGTCGTGAA GCAAATACTG
CTGGATAAAG CGGTGAAACA CGTGGATGTG ACAGGTGAAA ACCTTAAATC ATTCCTGGGT
GTCCAGCGTC ATGACTATGG TAAGGCTGAG TCCAACAATC AGGTAGGTCA GGTAACAGGT
CTTGCCTGGA CTCAGGTTGG CGGCGATCTC CTGACCATTG AAGCCACTTC CGTGGCGGGT
AAGGGCAAGC TGACATACAC GGGCTCTCTG GGTGATGTGA TGCAGGAGTC GATTCAGGCC
GCGATGACAG TGGTTCGGGC GCGCGCCGAG CAGCTGGGTA TCAACCCTGA CTTCTATGAA
AAGCGCGATA TCCATGTGCA CGTGCCTGAA GGTGCGACGC CAAAGGATGG TCCATCAGCC
GGTGCCGCCA TGTGTACCGC ACTGGTTTCA ACCTTAACGG GTAATCCTGT TCGCGCTGAC
GTGGCCATGA CAGGTGAGAT TACTTTGCGC GGTGAAGTGC TGCCCATCGG TGGTCTCAAG
GAGAAACTGC TGGCGGCTCA CCGTGGTGGC ATCAAGCATG TGCTTATCCC CAAGGAAAAC
GAACGTGACC TTGAAGAAAT CCCCGCTAAC GTGGTGGCAG ATCTGCAAAT TCACCCGGTT
CGTTGGGTGG ACGAAGTGCT GAAATTGGCG CTGGAGCGCC CGGTTGAAGG CTTCGAAGTG
GTCAAAAACG CCGGATAA
 
Protein sequence
MSQEREAHFE LPVLPLRDVV VYPHMVIPLF VGREKSIRCL ETAMAQDKQI MLVAQRDADL 
DEPGADDIFE VGTIASILQL LKLPDGTVKV LVEGGRRARV ARYTQEEPFF IGRIEELPSA
PLEDKEEEVL VRSAIAQFEG YIKLNKKIPP EVLTSMSGID EAARLADTMA AHMPLKLEDK
QSVLEMVNVG ERLEYLMAMM EGEIDLLQVE KRIRSRVKKQ MEKSQREYYL NEQMKAIQKE
LGDLDEGHDE FETLNRKIEE AKMPADAKDK AQAELNKLRM MSPMSAEATV VRSYVDWMTS
VPWFERSKIK RDLSKAEQVL NADHYGLEKV KERILEYLAV QTRVKQLKGP ILCLVGPPGV
GKTSLGQSIA KATGRKYVRV ALGGVRDEAE IRGHRRTYIG SMPGKIIQKM AKVGVKNPLF
LLDEIDKMSS DMRGDPASAL LEVLDPEQNA TFNDHYLEVD YDLSDVMFVA TSNSMDIPGP
LLDRMEVIRL SGYTEDEKLN IAKQHLLPKQ VERNGLKPQE ISVDDGAILG IIRYYTREAG
VRALERELSK ICRKVVKQIL LDKAVKHVDV TGENLKSFLG VQRHDYGKAE SNNQVGQVTG
LAWTQVGGDL LTIEATSVAG KGKLTYTGSL GDVMQESIQA AMTVVRARAE QLGINPDFYE
KRDIHVHVPE GATPKDGPSA GAAMCTALVS TLTGNPVRAD VAMTGEITLR GEVLPIGGLK
EKLLAAHRGG IKHVLIPKEN ERDLEEIPAN VVADLQIHPV RWVDEVLKLA LERPVEGFEV
VKNAG