Gene Sama_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0474 
Symbol 
ID4602729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp583184 
End bp584632 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content58% 
IMG OID639779810 
ProductTldD protein 
Protein accessionYP_926354 
Protein GI119773614 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.862683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.185697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTTT TAACCCAAGT TGAACAGAGC CTGCTCGCCA ACAACCTGAG CCTGGATAAC 
CTGGCCGAGT ACCTCAAACT CATCCACAAC CATAATGTGG ACTTTTCTGA TCTCTATTTT
CAGGGCAGCA GGCATGAAAC CTGGGTATTG GAAGACGGCA TCATCAAAGA AGGCTCCTTC
CACATCGAAC GCGGTGTGGG AGTGCGTGCC ATCAGTGGTG AGAAAACCGG ATTCGCCTAC
GCCGATGAAA TCACCCCGGC TGCCCTCGAA GCGGCAGCGC TGGCAGCACG CGGTATAGCG
CAGGTCGGCA GCGATTATCA GGTCAAAGCC TGGAAACGTC AGGGCGCCAA GACCCTGTAT
GACAGCGCCG ATCCTATTGC CGCCATGGAA GAAGCCACCA AAATCGCCAT GTTGCGTCAG
GCCGACGCCT ATATCCGCAG CCTCGATTCC CGCATCATCC AGGTGGTGGT AAGCCTCTCT
GCTGTGCACG AAGAAATCCT GGTAGCCGCA AGTGATGGCA CCCTGGCGGC CGACATTCGT
CCGTTGGTGC GGTTCAACTG TTCAGTGATT ATGGAAGAAA ACGGCAAGCG GGAGCGCGGC
GGTGCCGGTG GCGGTGGCCG CCACGATTTT GCCCCCCTCT TTGCCAATGA TGACACCGGC
ATGCCGGTGT GCTTTGCCTT TGCCCGTGAA GCCGTGCGTC AGGCGCAGGT GAACCTTAAC
GCTGTGGACG CGCCTGCCGG TGAAATGCCT GTGGTACTTG GACCCGGCTG GCCGGGCGTG
CTGCTGCACG AAGCCGTTGG CCATGGTCTG GAAGGCGACT TTAACCGCAA AGGATCCAGC
GCCTTCAGTG GCAAGGTGGG TCAGCAGGTA GCTTCCAAGC TGGTGACAGT GGTGGACGAC
GGCACCCTGC CGGGCCGCCG CGGCTCCCTC ACCATCGATG ATGAAGGCGT ACCGGCCCAC
CGAACCGTAT TGATTGAAAA CGGTATCCTG AAAGGCTATA TGCAAGACAA GCTCAACGCC
CGTTTGATGG GAGTGGCGCC CACAGGTAAC GGTCGCCGTG AGTCCTACGC CCATCTGCCT
ATGCCACGTA TGACCAACAC CTATATGACA GCCGGTAACG ACAACCCAGC CGATATTATC
AAGTCGGTGG AAAAGGGCAT TTACGCACCT AATTTCGGCG GTGGTCAGGT GGACATCACG
TCGGGCAAAT TCGTGTTCTC GGCCTCTGAA GCCTACCTGA TTGAAAAGGG CGAGATCACC
CGCGCCATCA AGGGGGCGAC CCTGATTGGC AACGGCCCTG AGGCCATGAG TCAAATCTCT
ATGGTGGGTA ACGATCTCGA ACTCGACAAG GGGGTGGGAG TGTGTGGTAA AGATGGCCAG
AGCGTGCCTG TGGGTGTGGG CCAGCCAACC CTGAGACTCG ATCGCTTAAC TGTGGGCGGC
ACCGCCTGA
 
Protein sequence
MPFLTQVEQS LLANNLSLDN LAEYLKLIHN HNVDFSDLYF QGSRHETWVL EDGIIKEGSF 
HIERGVGVRA ISGEKTGFAY ADEITPAALE AAALAARGIA QVGSDYQVKA WKRQGAKTLY
DSADPIAAME EATKIAMLRQ ADAYIRSLDS RIIQVVVSLS AVHEEILVAA SDGTLAADIR
PLVRFNCSVI MEENGKRERG GAGGGGRHDF APLFANDDTG MPVCFAFARE AVRQAQVNLN
AVDAPAGEMP VVLGPGWPGV LLHEAVGHGL EGDFNRKGSS AFSGKVGQQV ASKLVTVVDD
GTLPGRRGSL TIDDEGVPAH RTVLIENGIL KGYMQDKLNA RLMGVAPTGN GRRESYAHLP
MPRMTNTYMT AGNDNPADII KSVEKGIYAP NFGGGQVDIT SGKFVFSASE AYLIEKGEIT
RAIKGATLIG NGPEAMSQIS MVGNDLELDK GVGVCGKDGQ SVPVGVGQPT LRLDRLTVGG
TA