Gene SO_0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_0047 
Symbol 
ID1167946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp53199 
End bp54407 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content46% 
IMG OID637342061 
Productcarboxyl-terminal protease, putative 
Protein accessionNP_715689 
Protein GI24371647 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACATC TACTCCGCAA TATCGCCAGT CTTGGACTCG GCCTAAGCTT AGGCCTGTCT 
ATTAGCCTAT CCAGTCAAGA GAATACCAAG TCGTATCGAA GTGACTTTGA TTACCCGCTA
TTGCAGGATG TGCTCGAAAC GGTCGAAACC TATTACGTTA AAACTGTGAC TAAGGATGAG
CTTGTTCAAG CGGCAATTAA AGGCATCTTT GAGCATTTAG ATCCCTATTC AAGCTTTCTA
AATCACCAAG AATTACTCGA TCTAAAAGAT TCAAATCGGG GTGAGTACTT CGGCTTTGGC
TTTGAAGTCG CCAGCGAAAA AGACCATATC AGCATCATTG CCCCCTTTGC GAACTCCCCA
GCCGAACAGG CTGGGATTCA AGCCGGTGAC ATCATTATCA AGCTGAATAA CACCCCCACG
ACAGAAACTA ACCTTGCGGA TATTCTTAAC CAAATCAAGC AACACAGTTT GAGTCATCAA
AGTATTCGCC TCACGCTAAA ACACCGTAAT GACGAAGCAG AATTTGAGGT GATGTTAAAA
CCTAGCACAA TCACAATTCA GTCGGTCGCG AGCAAATTAT TGGATGGGAA CATTGGCTAC
GTAAGGCTCA GCAGCTTTCA AGAAGACTCT ACCGAAGATA TGGTACGCAC CCTGAGCCAA
TGGCAAGGCA CTCAGTTAAC GGGCTTGATA TTGGACCTAC GCAATAATCC CGGCGGCCTG
CTCGATCAGG CAATTAATAT TGCCGACCTC TTTTTGGCAA AAGGGCGAAT CGTCTCCACC
TCTGGCCGTT TTTTTGATGC CAATTCAGAC TATTACGCCT CACCGCAAAC CATGCTCGCC
AACGTACCCA TGCTAGTGCT AATCAATAAA GGCTCCGCAT CAGCATCAGA AGTACTGGCC
GCCGCATTGC AAGAAAATGG CCGGGCAAAA CTCCTAGGCG AAACCAGCTT TGGTAAAGGA
ACAGTGCAAA GCCTTATTCC TATTCTTAAC AACGGCAATG CGGTCAAACT GACCATAGCC
CAGTACAACA CGCCTAAAGG GGAGAATATC CACGACATAG GGATTAAGCC CGACATCAAA
GTAGTCTCCG AAACTGGCTC CAATCAAAAG AATATGCCTA TAATCGACGC TATCTCTGCA
CGAACCGATG TCAGCCAAGA CACGATTGTC ACTTCAGCTA TCACTTGGAT GCAACATCAT
GACGAATAA
 
Protein sequence
MKHLLRNIAS LGLGLSLGLS ISLSSQENTK SYRSDFDYPL LQDVLETVET YYVKTVTKDE 
LVQAAIKGIF EHLDPYSSFL NHQELLDLKD SNRGEYFGFG FEVASEKDHI SIIAPFANSP
AEQAGIQAGD IIIKLNNTPT TETNLADILN QIKQHSLSHQ SIRLTLKHRN DEAEFEVMLK
PSTITIQSVA SKLLDGNIGY VRLSSFQEDS TEDMVRTLSQ WQGTQLTGLI LDLRNNPGGL
LDQAINIADL FLAKGRIVST SGRFFDANSD YYASPQTMLA NVPMLVLINK GSASASEVLA
AALQENGRAK LLGETSFGKG TVQSLIPILN NGNAVKLTIA QYNTPKGENI HDIGIKPDIK
VVSETGSNQK NMPIIDAISA RTDVSQDTIV TSAITWMQHH DE