Gene Sama_3073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_3073 
Symbol 
ID4605320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp3651391 
End bp3652740 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content55% 
IMG OID639782489 
Productserine protease 
Protein accessionYP_928945 
Protein GI119776205 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0102381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.200966 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAA AACTCTCCCT CGTATCCGCC GCTATTTTAG GTGCAACCCT GACCCTTGGC 
ACCCTGCCCG CTTATGCCTC TTTGCCGGTA GCCGTTGACG GACAGCAGCT GCCAAGCCTT
GCGCCCATGT TGGAAAAAAC CACTCCGGCT GTGGTCTCGG TTGCTGTGTC GGGTACCCAT
GTCTCCAAGC AGCGGGTCCC CGATGTATTC CGCTATTTTT TTGGTCCCAA CGCGCCTCAG
GAGCAGGTTC GTGAACGCCC CTTTCGCGGA CTGGGCTCGG GCGTCATTAT CGATGCAGAC
AAGGGCTATA TAGTCACCAA CAACCACGTT ATTGATGGCG CAGACACCAT ACAGATTGGT
TTGCTCGATG GGCGCGAATT TGAAGCCAAG CTCATCGGCA GCGACAGTGA ATCCGATATT
GCGTTGCTGC AAATCAAGGC GGATAAACTG ACCGAGATTA AGTCGGCCGA CTCAGATGCC
ATCCACGTGG GCGACTTCGC CGTAGCCATA GGCAACCCCT TTGGTCTGGG CCAAACAGTG
ACCTCAGGCA TAGTCTCTGC CTTGGGCCGC AGTGGTCTGG GTATCGAGAT GCTGGAAAAC
TTTATCCAAA CCGACGCGGC TATCAATAGC GGTAACTCAG GCGGCGCACT GGTGAATCTT
CGTGGCGAGC TGATTGGTAT CAATACCGCC ATCGTTGCTC CCGGCGGCGG CAACGTGGGT
ATAGGTTTTG CCATTCCCGC CAACATGATG CATTCACTGG TGGATCAGAT TATCGAACAT
GGTGAAGTGC GCCGCGGTGT ACTCGGCATC TCCGGACGTG AGCTCGACAG TAAACTGGCC
GAAGGCTTTG GTCTGGACTC CCAGCACGGT GCCTTTGTGA ATGAAGTCAT GCCAGACAGC
GCAGCCGACG ACGCTGGCAT CAAAGCCGGT GACATCATCA TCAGCGTTGA TGGCCGTAAG
ATTAAGAGCT TCCAGGAACT GCGCGCCAAA ATAGGCACTC TGGGCGCCGG TGCCAAGGTG
GAACTGGGCA TCATCCGCGA CGGCAAAAAC AAGACAGTGA AGGTCACGTT GGGCGAAGCG
TCCAATCAAA CAGCCTCCGC TGATGAGTTG CACCCTCAGC TGGCCGGTGC AAATCTCGAA
AGCACCTCCA AAGGGGTTGA AATCATGGAA GTACAGGAAG GCTCCCCTGC CGCTCTGAGC
GGTCTGCGCA AGGGTGATAT CATAGTGGGT GTCAATCGCA CTGCCGTGAA AGACCTGAAA
GAGCTCAGAG AGCAACTGAA AGAACAGGAT GGCGCTGCCG CCCTGAAAGT ACTGCGAGGA
AAGAGCATCC GTTATCTCGT ACTGAGATAA
 
Protein sequence
MKAKLSLVSA AILGATLTLG TLPAYASLPV AVDGQQLPSL APMLEKTTPA VVSVAVSGTH 
VSKQRVPDVF RYFFGPNAPQ EQVRERPFRG LGSGVIIDAD KGYIVTNNHV IDGADTIQIG
LLDGREFEAK LIGSDSESDI ALLQIKADKL TEIKSADSDA IHVGDFAVAI GNPFGLGQTV
TSGIVSALGR SGLGIEMLEN FIQTDAAINS GNSGGALVNL RGELIGINTA IVAPGGGNVG
IGFAIPANMM HSLVDQIIEH GEVRRGVLGI SGRELDSKLA EGFGLDSQHG AFVNEVMPDS
AADDAGIKAG DIIISVDGRK IKSFQELRAK IGTLGAGAKV ELGIIRDGKN KTVKVTLGEA
SNQTASADEL HPQLAGANLE STSKGVEIME VQEGSPAALS GLRKGDIIVG VNRTAVKDLK
ELREQLKEQD GAAALKVLRG KSIRYLVLR