Gene Sama_3400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_3400 
Symbol 
ID4605647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp4026959 
End bp4028179 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content55% 
IMG OID639782820 
Producttrypsin-like serine protease 
Protein accessionYP_929272 
Protein GI119776532 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTTC TTTGTCTGGC ACTGGCGGTT GCGCTTTGTG CCAACCTTTC TGGGTGTGCC 
AACACCCGAC TTAATGCCAA TAAATTGCCA AAAGGTGTGG CGGTCAACAA GTCCATCACC
CTGCCGGTAG ATGTCTCTGT GGCTTACTAC GTGCCCAAAA TTCAAATGGA CAGTCGTTTC
TATCTGGAGA AGTGGAATAT CTGGGTAGAG CCGGGCTCCG CCTTGTCGGA CGGTGTGCGT
GATGCCTTCA ATGCTTACTT CAGCAATGCC GTCCTGCTGG ACCGTGCAAG CGATGACGCC
GTTGGCCTGG TGATAGATCT GGATCCCGAA TGGGAGTTCG TCTCTGGCAA AGCCGTGATG
ACCTTGAGCT ACAAGGTGCT GAATGGTGGT GATAAACCGT TGAAAGAAGG TAAGAAAACC
TTTAAGGCCG ACATTGGCTA TGTGGGCGAC AACTCTGGCA TGTACAACGC GGCGATGCGG
GCAACCCAAC TGGTCATTGT AGACGTGCTC AATAGCCTCA AACCCACAGC GGCCGGGTAT
CCGGCGCAGC TGGCCATGAA GGCGACCAAC CCGCTACAAT TGGCCAACAT GGAGAAGCCA
GTGAGCAGTG GTACCGGGTT TTATATCAAT GAAACCGGAC AGTTACTGAC GGCTGCCCAT
GTGCTGCGGG AATGCATGGT CACCAAGGTG CAGACACCAA CTCAGTCTTA CGATGCCACT
GTGGATGCCA GCTCCACCCT TTTGGATCTG GCTGTGGTGA GTACAGGTGC GCCAGCTGAG
CGCTACTTGC CGCTGCGTAA GGGCAGTGAA ATCTTCCTCG GTGAAGCCGT GACCAACGTG
GGTTACCCCC TGCAGGGCCT GCTTGCGGCG TCGCCAAACC TGACCCGAGG TAACGTCAGC
TCCATGAATG CCCTCAAAGG TTCCGTTGGC CAGTTCCAGT TCTCGGCACC TATTCAGCCC
GGCTCCAGCG GGGGGCCCGT GGTATCAGAT GGTGGTGAAC TGCTGGGAGT GACAGTTGCC
ACGCTCAACG CAGCCAAGCT GATTGAATCC GGCGCCCTGC CACAGAATGT GAACTTTGCC
CTCGATGCCA GGCACGTTGC CCGATTCCTC GACAAGCATC AGGTGGCGTT TCATCAGGTA
GAGCCGAATC TGAAAGGGGA TATTCGTATC AGCAACGATG CGGCTTTGTC GTCTGTAGTG
CAGTTGGCCT GTTATCAATA A
 
Protein sequence
MRFLCLALAV ALCANLSGCA NTRLNANKLP KGVAVNKSIT LPVDVSVAYY VPKIQMDSRF 
YLEKWNIWVE PGSALSDGVR DAFNAYFSNA VLLDRASDDA VGLVIDLDPE WEFVSGKAVM
TLSYKVLNGG DKPLKEGKKT FKADIGYVGD NSGMYNAAMR ATQLVIVDVL NSLKPTAAGY
PAQLAMKATN PLQLANMEKP VSSGTGFYIN ETGQLLTAAH VLRECMVTKV QTPTQSYDAT
VDASSTLLDL AVVSTGAPAE RYLPLRKGSE IFLGEAVTNV GYPLQGLLAA SPNLTRGNVS
SMNALKGSVG QFQFSAPIQP GSSGGPVVSD GGELLGVTVA TLNAAKLIES GALPQNVNFA
LDARHVARFL DKHQVAFHQV EPNLKGDIRI SNDAALSSVV QLACYQ