Gene Sama_3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_3074 
Symbol 
ID4605321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp3652836 
End bp3653906 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content56% 
IMG OID639782490 
Productprotease DegS 
Protein accessionYP_928946 
Protein GI119776206 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.940129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.195971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGCG CTTTACTCTA TTTAGGCAAA GCCGTGCTCT TTGGCTTAGT GATGGCCGCC 
GTGTTTCTTC TCGCGCAAAA CTACCTGGGT AACGACCATG GCAACCTGTT CCGGGATCGT
GGCAATGGTG GCAGCGAACT TTCTTTTGCC AAGGCGGTCA GACGTGCCGC ACCCGCGGTC
GCCAACATAT ACAACATCAG CATAGATCAG CGCCAACCGC TCAACAGTGG CGCACTCCAG
GGGCTTGGCT CGGGGGTGAT CATCAGCAAG GAAGGCTACA TACTCACCAA CTACCACGTT
ATCAAGAAGG CAGATGAAAT TGTCATCGCC CTGCAGGATG GCCGGCGTTT CAGCGCCGAA
GTGGTTGGCT CGGATCCGGA AACAGACCTG AGCGTACTCA AAATTGAAGG GGATAAGCTG
CCTGTGGTGC CACTTAACCT CACCACCCCG CCCCAGGTAG GTGACGTGGT GCTGGCCATC
GGCAATCCCT ATAACCTGGG TCAAACCATT ACCCAGGGCA TTATCAGTGC CACAGGTCGT
AACGGTTTGA GCTCCGGTTA TCAGGACTTC CTGCAGACCG ATGCGGCCAT TAACGCGGGC
AACTCGGGTG GTGCCCTGAT AGATACCAAT GGCGATGTGA TAGGCATCAA CACCGCCGCC
TACCAGATAG GTGATGAAGG CGGTGGACAC GGCATCAGCT TTGCGATTCC CATCCGCCTC
GCCTACTCCA TCATGGGTAA ACTCATCGAA CACGGCCGGG TGATCCGCGG TGCCCTTGGG
ATTTCCGGCG AGGCGCTGAC ACCCTTTATG GCGCAGGTGC TAAAAATGCA GGAAGTTCGC
GGCGTACTGG TGACAGGCGT GGATCGCAAC GGACCGGCAG CCGATGCCCA GATGCAACCC
AGGGATGTGA TTATTGAATA TGGCGGTGAA AGTGTGGTCA GTGCTGAAAT GCTGATGGAC
CGCATTGCAG AAACCAAACC CGGTACCGAA GTGCAGATGA CCATCATTCG CCAGGGCAAG
GCTTACACCT TACCCGTGAT TATCGGCGAG AAGGCAACCG AGTATAACTA A
 
Protein sequence
MKSALLYLGK AVLFGLVMAA VFLLAQNYLG NDHGNLFRDR GNGGSELSFA KAVRRAAPAV 
ANIYNISIDQ RQPLNSGALQ GLGSGVIISK EGYILTNYHV IKKADEIVIA LQDGRRFSAE
VVGSDPETDL SVLKIEGDKL PVVPLNLTTP PQVGDVVLAI GNPYNLGQTI TQGIISATGR
NGLSSGYQDF LQTDAAINAG NSGGALIDTN GDVIGINTAA YQIGDEGGGH GISFAIPIRL
AYSIMGKLIE HGRVIRGALG ISGEALTPFM AQVLKMQEVR GVLVTGVDRN GPAADAQMQP
RDVIIEYGGE SVVSAEMLMD RIAETKPGTE VQMTIIRQGK AYTLPVIIGE KATEYN