Gene Sama_0144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0144 
Symbol 
ID4602401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp165836 
End bp167947 
Gene Length2112 bp 
Protein Length703 aa 
Translation table11 
GC content56% 
IMG OID639779456 
Productoligopeptidase B 
Protein accessionYP_926026 
Protein GI119773286 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTCCG CATGCAGTCA CAATATCGAT AAACATTCAG GAGCCCTTGC CCGGGAAGAT 
GCGGCGCCTG TGGCCCCTGT CGCGGCCAAG GTGCCCTATG AAATGACTCA CCATGGTGTC
ACCCGGGTGG ATGACTACTA TTGGCTGCGG GACGATGACC GCGCCGACGC GGATGTGATA
GCCCATCTCA ATGCCGAAAA TGCTTACGCC GAAGCCCGAA TGGCACCCTG GAAAGGGTTA
AAAGAAAAAC TCTTTGCCGA AATGACCGCC CGCATGGTGA AGGACGACAG CTCAGTGCCT
TATCTGTGGC ATGGCAAGTA CTATTACCGC CGGGTGGAAG GCGACAAAGA GTACCCCATC
ATCGCGCGCA AAGCGTCCCT GAACGGGACT GAAGAAGTGT TGCTGGATGC CAATGAACGT
GCTGCCGGGC AAGATTTTTA TTCCCTTGGC AATGTCAGCG TCAGTGAAGA TGGCAAGCTT
ATGGCCTTCG GGGAAGACCT GCTGAGCCGC CGGATTTACA AGATCTATTT CAAGGATTTG
GCCAGCGGTA ACTTGTTGGC CGATGTGCTG GACAACACCG AAGGCGAAGC TGTGTGGGCC
AACGATGGCA AGCATGTGTT TTACATCGCC AAAGACCCTC AGACGCTGCT TGGCTATCAG
GTGTATCGCC ACCGTCTTGG GACGCCTCAA AGCGAAGATG TGCTGGTGTA TGAAGAGGCC
GACGACAGCT TCTATATCGG CCTTGGCAAG ACCCTTGATG GCAGCCGGAT TTTACTCTTC
CATGAAAGTA CCACGACCAC CGAAGTGCAG CTGTTGGATG CCAACCAGCC GCTGGGCAAT
TTCACCTCTT TTCTGCCGAG GGAAGAGGGT CATGAATATG CCATCACCAA GCTGGGGGAT
GAGTACTATG TGCTTACCAA CTGGCAGGCC ACCAATTTCC GTCTGATGAA AGTCAATGGC
ACTGACACCG CCGACAAATC CCGCTGGCAG GAAGTTGTGC CCTATAACGA GCAGGTGCGT
ATTGAAGATA TGCTGCTGCT CAAAGACGCC CTGGTGCTGC AAACCCGGGA AGCGGGTGCT
ACCCACATAC GGGTCTACGA TCCGGATGGC AGCCAGAGCC GGGAAATCCA TTTCGATGAA
GCCGCGTACG TGACCTGGCT TGGCACCAAC CGAGACCAGA GCGCGAACAC AGTACGTTTT
GGCTATTCCA GCCTGACCAC CCCAGATGCT ACCTACGAAT ACGACATTCA CACCGGTGAG
CGTAAGTTGC TTAAGCAGCT TAAGGTGCCG GGGTTTGAAG CAGGCGCCTA TCAGAGCGAA
CGTTTGATGC TGCCCGCCCG GGATGGCGTG TTGGTGCCCG TGTCTGTGGT GTATCGCAAA
GATAAATTCA AAAAAGATGG CACCAATCCG CTCTACCAAT ATGGCTATGG TGCCTATGGC
AGCGTGGTCG ATCCGGACTT TTCCCTGTCG GCGCTGAGCC TGCTGGACAG AGGCGTGGTG
TATGCCATCG CCCACGTACG GGGCGGCGAG ATGCTGGGCA GGCCCTGGTA TGATGCCGGT
CGCATGTTCG ACAAGAAGAA CTCCTTCACC GACTTTATTG ATGTGACCGA CGCACTGGTC
AATGCCGGTT ACGGTGCCAG AGACAAGGTT GTGGCCAGTG GTGCCAGTGC CGGCGGACTG
CTTGTGGGCG CCGTGGCCAA TATGGCGCCC GACAGGTACC TGGCCATCCA TGCCGGTGTG
CCCTTTGTGG ATGTGGTCAC CACTATGCTG GATGAGTCGA TTCCACTGAC CACCAACGAG
TACGACGAGT GGGGCAACCC CAACGAGAAG CCGAGCTTTG ACTATATGCT CAGCTACTCG
CCCTATGACA ACGTGACCCG TCAGCCATAT CCCCATCTGT TGGTCACCAC AGGCCTGCAC
GACTCACAGG TTCAGTACTT TGAGCCGGCC AAGTGGGTCG CTAAATTGCG TGAGCTTAAA
ACCGATAACA ACGAACTGCT GCTGGTGACG GATATGGAAG CAGGCCATGG TGGCAAGTCG
GGCCGCTATA GCCAGTTTGA GGATATGGCG TTGGAATACG CATTTTTCCT GCACCTGTGG
GGCATAGAGT GA
 
Protein sequence
MVSACSHNID KHSGALARED AAPVAPVAAK VPYEMTHHGV TRVDDYYWLR DDDRADADVI 
AHLNAENAYA EARMAPWKGL KEKLFAEMTA RMVKDDSSVP YLWHGKYYYR RVEGDKEYPI
IARKASLNGT EEVLLDANER AAGQDFYSLG NVSVSEDGKL MAFGEDLLSR RIYKIYFKDL
ASGNLLADVL DNTEGEAVWA NDGKHVFYIA KDPQTLLGYQ VYRHRLGTPQ SEDVLVYEEA
DDSFYIGLGK TLDGSRILLF HESTTTTEVQ LLDANQPLGN FTSFLPREEG HEYAITKLGD
EYYVLTNWQA TNFRLMKVNG TDTADKSRWQ EVVPYNEQVR IEDMLLLKDA LVLQTREAGA
THIRVYDPDG SQSREIHFDE AAYVTWLGTN RDQSANTVRF GYSSLTTPDA TYEYDIHTGE
RKLLKQLKVP GFEAGAYQSE RLMLPARDGV LVPVSVVYRK DKFKKDGTNP LYQYGYGAYG
SVVDPDFSLS ALSLLDRGVV YAIAHVRGGE MLGRPWYDAG RMFDKKNSFT DFIDVTDALV
NAGYGARDKV VASGASAGGL LVGAVANMAP DRYLAIHAGV PFVDVVTTML DESIPLTTNE
YDEWGNPNEK PSFDYMLSYS PYDNVTRQPY PHLLVTTGLH DSQVQYFEPA KWVAKLRELK
TDNNELLLVT DMEAGHGGKS GRYSQFEDMA LEYAFFLHLW GIE