Gene Sama_1447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1447 
Symbol 
ID4603699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1757126 
End bp1758889 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content53% 
IMG OID639780797 
Productprotease, putative 
Protein accessionYP_927324 
Protein GI119774584 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.747842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGTT ACAGCATTAT TCCAAGTGAT CCCAAAGCTC ATCTGTTTGA GGTCAATCTG 
GAGATCCCAA CCCCAAGTGT TAACCAATTG CTTTCGTTGC CAGCCTGGAT CCCGGGCTCG
TATATGGTGC GTGACTTTGC CCGCAATCTG ATTGGAATCA AAGCCTGGCA AGCGAATAAC
CCCGTCGCTC TGACTCAACT CGATAAACAA CGTTGGCAAG TGGAAAACCG CTCAACAGAG
CCGCTGCGCC TGTCATATCA GGTGTACGCA TTCGACCTGT CGGTACGCAG TGCACATCTG
GATATGACCC ATGGTTTTTT CAATGGCTCC AGTGTGTTTC TGGCGGTAGA AGGCCAAACC
GACGGACCGC TGTGGGTTGA GATTCAGACG CCGACCGAGA TTGCCCTCAA TCATTGGCGA
GTGGCGACAA CCCTGCCAAG ACGCAGCGGC GATGCCTGGC AGTTTGGCGC CTTTGAAGCC
GAAAACTATG ACGCCCTTAT CGACCACCCG GTAGAAATGG GTGACTTTAC CCAGGCAAGT
TTTGAGGCCT GCGGCGTGCC CCACCACATA GTGCTGACCG GCCGCCATCG CTGCGATATG
GAGCGCCTTT GCCAGGATTT AAAAGCCATC TGCGAAACGC AAATACAGCT GTTTGGTGCG
CCAGCCCCCT TTGATGAATA CCTGTTTATG ACCATGGTAT TGGACAATGG CTTTGGCGGA
TTGGAACACC GCAGCTCAAC GGCACTGGTG TGCTCACGTA AAGATTTGCC GCTGTCCATG
GACGACAAAG TCAACAAGGA CTATCGCACT TATTTATCCC TGTGCAGCCA TGAGTATTTC
CACAGCTGGA ACGTTAAGCG GATTAAACCC GCTACCTTTA TCCCCTATTC GCTGGAGAGT
GAGAGCTACA CCCGGCAATT GTGGGCTTAC GAAGGACTGA CGTCTTACTA TGACGACCTG
CTGACGCTTA AAGCGGGCAA AGTCGATACC GAAACCTACC TGGACATGCT CAGTGAAACC
ATGACACGGG TATACCGGGG TGTGGGCCGT TTCAAACAAA GCCTCAGGGA TTCGAGCTTT
AATGCCTGGA CCAAGTTTTA CAAACAGGAT GAAAATGCCC AAAACGCCAT CGTAAGCTAT
TACACCAAGG GCGCACTCTT CGGCCTGTAT CTCGATCTTG TCATCCGCAA GGAAACCGCA
GGTAAGGAGA GCCTGGACTC CCTGATGCGC CTGTTATGGG AGCGTTTTGG ACTGACCGGA
CAGGGAACTG ACGAACACAG TCATCAGGCT CTGGTAGCAG AGCTGCTTGG CCGCGATGTG
AGCGATATTT TCGCCTATTT GGATAACACG GACGACTTAC CCCTCGCCCC GCTGCTTGCC
GAGTTTGGTG TTCACATGGC GCTCAGGGCC GCAAGCGGTC AAAACGACAC CGGGGGCGGA
AAACCTTCGC CACTGACACT GTCTCTTGGT GCCAAATTCA AGGCAGAGCC ATTGGGGGTA
AGAATTCAAA CGGTGGCAGA ATCCAGCAGT GCACATCTGG CCGGTTTATC GGCTGGTGAC
TTACTGATTG CCATCGATGG GCTTCAGGCC ACGGCCAATC TTGAAAACGT GCTTAACAGT
TACAATGAAG GCGATGAGCT GGCGCTGCAT TTCTTCCGCC GCGACGAATT GATGACTGCC
AAATTACGGC TTCAGGCAGC CCCTCTGGAT ACAGTGTCGC TGCATCTGGA AGATGCCACT
GCGATAGCGG CCTGGCTCGG GTAA
 
Protein sequence
MIRYSIIPSD PKAHLFEVNL EIPTPSVNQL LSLPAWIPGS YMVRDFARNL IGIKAWQANN 
PVALTQLDKQ RWQVENRSTE PLRLSYQVYA FDLSVRSAHL DMTHGFFNGS SVFLAVEGQT
DGPLWVEIQT PTEIALNHWR VATTLPRRSG DAWQFGAFEA ENYDALIDHP VEMGDFTQAS
FEACGVPHHI VLTGRHRCDM ERLCQDLKAI CETQIQLFGA PAPFDEYLFM TMVLDNGFGG
LEHRSSTALV CSRKDLPLSM DDKVNKDYRT YLSLCSHEYF HSWNVKRIKP ATFIPYSLES
ESYTRQLWAY EGLTSYYDDL LTLKAGKVDT ETYLDMLSET MTRVYRGVGR FKQSLRDSSF
NAWTKFYKQD ENAQNAIVSY YTKGALFGLY LDLVIRKETA GKESLDSLMR LLWERFGLTG
QGTDEHSHQA LVAELLGRDV SDIFAYLDNT DDLPLAPLLA EFGVHMALRA ASGQNDTGGG
KPSPLTLSLG AKFKAEPLGV RIQTVAESSS AHLAGLSAGD LLIAIDGLQA TANLENVLNS
YNEGDELALH FFRRDELMTA KLRLQAAPLD TVSLHLEDAT AIAAWLG