Gene Sama_0456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0456 
Symbol 
ID4602711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp562057 
End bp563256 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content56% 
IMG OID639779792 
ProductMSHA biogenesis protein MshN, putative 
Protein accessionYP_926336 
Protein GI119773596 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCA TCAATCAAAT GCTGAAAGAT TTGGATAAGC GTGCCGAGCC GCATCAGCTG 
CAGCAGCTGC CATCCACTGT GGCCATTCCA GTGAACAAAT CTCCCGGGCT GCCCTGGCGT
TGGCTTATCC TGCTGTCGCT GCTCCTTGTT GCCATCGCGC TGGTGCTTTG GAACCTGAAA
GCATCCAATG CCAAAGACGC TATGGTATTA AGCCCTATGG TTTCAGGACA GCAAGCTGAG
TTAAACACAG CAAGTGCTGC GCCGGTAAAA ATGGAACCTG CGAGCAATGC TCAGGTGTCA
CATGTTCAGG AGCCCGCAGA TGCCAGCAGC GACACAGCGT CGGGCTTGGC GAAAGAGCCC
GCGGGCGAGT CAGATAAATC AATCAAAGAC ACGACGGGGA ATGGCGAGTC TGAAACCTTA
GCCGAGTCAG TGGCAGTATC ATTCGCAGGA TCAGCAAACG CGCCTGCCCC GTCAGAAACG
TCAGTTGCAG GCGAGGCAAC TGTACTGGCT TTGCAAGTCA ATACCTCGAG CGAGCCGCCG
GTGATTGACT CTTCCAGCGT CGCCATGAAT ACCAATACGC CATCCGCAGG CAGCATGGCT
GTCACCGAAG TGGTGCTGCC CCCGGCAGAG CAAGCCGATC GCGCCATGCT AAAGGCCAAT
GGAGCTAGGG ATGCAGGCAA GCTGGATGAG GCGATGCGCC AATACGCCAT GGCGCTTTCT
TATGAGCCCG CGCGCCATGA AGCAAGACGC CAATTGGCGG CGCTTCACTA TGGTCAGGGG
CAAGCAGGCG AAGCCATCAA ACTGCTCGAG CGTGGGTTGG TGCAATTCCC TGAGCAGTCG
TCTTTTGCAC TACTTCTGGG CCGTTTGTGG CGGGAACAGG GCAATAAATC GCAGGCGCTG
GCAGCCCTTG ATGTCATTGG GGATACAGAT TCTCTTTCTC GGGATAAGTG GCTATTGGTG
GCCGATATTG CCCGTGAACA AGACGATCAC GCGCTGGCCG AAGCCGCCTA CCAAAAGTTG
TTGGGCACCG GTATGGAAAA AGCACAATGG TGGTTGGGGC TTGCCTACGC ACAGGATGCA
CAGGGCAAGA TGGCAGATGC CCGTTATCAC TATCAGCGTG CCCTTGGCAC TGCCGGGTTA
TCATCCGACG CCCGCGCCTA CATAGAAAAC AGATTGATGC AGTTGGGAGA TAACCAATGA
 
Protein sequence
MSVINQMLKD LDKRAEPHQL QQLPSTVAIP VNKSPGLPWR WLILLSLLLV AIALVLWNLK 
ASNAKDAMVL SPMVSGQQAE LNTASAAPVK MEPASNAQVS HVQEPADASS DTASGLAKEP
AGESDKSIKD TTGNGESETL AESVAVSFAG SANAPAPSET SVAGEATVLA LQVNTSSEPP
VIDSSSVAMN TNTPSAGSMA VTEVVLPPAE QADRAMLKAN GARDAGKLDE AMRQYAMALS
YEPARHEARR QLAALHYGQG QAGEAIKLLE RGLVQFPEQS SFALLLGRLW REQGNKSQAL
AALDVIGDTD SLSRDKWLLV ADIAREQDDH ALAEAAYQKL LGTGMEKAQW WLGLAYAQDA
QGKMADARYH YQRALGTAGL SSDARAYIEN RLMQLGDNQ