Gene Sbal223_4217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_4217 
Symbol 
ID7089158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp5012478 
End bp5014517 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content49% 
IMG OID643463091 
ProductOligopeptidase A 
Protein accessionYP_002360106 
Protein GI217975355 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones89 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACC CTTTGCTTAG CGGCGCAAAA TTACCGCTTT TTTCACAGAT TAAACCGGAA 
CATATTCAAG TCGCCGTTGA GCATGCCATT GCTCAGTGCC GCACTAAGAT TGATCAAGTG
CTGCAGAATC AAGGTCCTTA CACTTGGGAC AACTTGGTCG CGCCACTAGA ACAGGTTGAC
GATGAACTGA GCCAAATTTG GTCGCCCGTT TCTCACATGA ATTCAGTGAC CAGCACCGAT
GAATGGCGCG CTGCTCATGA TGCTTGTTTG CCATTATTGT CGGAATACGG CACCTATGTT
GGTCAACACC AAGGTTTGTA TCAAGCCTAT AAAGCCCTGC GTGCATCGAA TGAATTTACA
CAATTAAGCC AAGCCCAGCA GAGGGTGATT GAACACAGTC TGCGCGATTT TGAACTATCG
GGTATTGGCT TAGATGACGC GCAAAAAGTG CGTTATGGCG AAATGGTAAA ACGTCTTTCT
GAACTCACCA GTGGCTTTTC GAATCAATTA CTGGATGCGA CCCAAGCTTG GACTAAGTTA
ATCACAGATG AAGCGGAACT TGCTGGCCTG CCAGAGTCAG CCAAAGCCGC CGCTAAGGCG
ATGGCCGATG CGCGTGAGTT AGACGGTTGG TTGTTTACCT TAGATTTCCC TTCTTATTTA
CCTGTCATGA CTTACAGCGA AAACCGCAGT CTTCGCGAAG AGTGTTACCG CGCTTTCGTT
ACGCGCGCCT CGGATCAAGG TCCTAATGCT GGCGAGTTTG ATAACAGCCC GCTGATGGAC
GAAATCCTTG CTTTGCGCCA CGAACTCGCA TTGTTACTCG GTTTTGACAG TTTTGCCGAC
AAATCCCTAG CGACGAAAAT GGCTGAAACG CCGGCGCAGG TGATGGCATT TTTAAATGAG
TTAGCGCTGC GCTCAAAGGA TCAAGCTAAG GCCGAAGTGG CCGAGCTGAG AGCGTTTGCT
GAGGCACAAT ACGGCGTGAG CGAAATGGCA TCTTGGGACT TAAGTTTCTA TGCCGAGAAG
TTGCAGCAGC ATAAGTATGA GATTTCACAG GAAATCCTGC GTCCTTACTT CCCTGAAGAT
AAAGTGCTCT CGGGTTTGTT CTACACAGTC TCGCGTTTGT TCGGCCTTAA AATCGTTGAG
CAAAAAGACC TTGATCGTTG GCATAAAGAC GTGCGTTTCT TCGATATTTT CGACGAAACT
GACGAACACA GAGGCAGTTT TTACCTCGAC TTATATGCCC GCACAGGTAA ACGTGGCGGG
GCTTGGATGG ATGACTGCCG TGTGCGCCGT CAAACCGCCG ATGGATTGCA AAAGCCAGTG
GCCTATTTGA CCTGTAACTT TAATGGTCCG GTCGATGGCA AACCTGCGCT ATTCACCCAC
GATGAAGTGA CGACCCTGTT CCACGAATTT GGCCATGGTA TCCACCACAT GCTGACCAAA
ATCGATGTTG CAGGCGTGTC GGGTATCAAT GGTGTGCCTT GGGATGCGGT CGAGTTACCG
AGCCAATTTA TGGAAAACTG GTGCTGGCAG GAAGAAGCCT TAGCCGAAAT TTCAGGTCAC
TTCGAAACGG GCGAGCCATT ACCTAAAGCC TTGTTAGATA AGATGCTGGC GGCGAAAAAC
TTCCAATCTG GCATGATGAT GCTGCGCCAA CTTGAGTTCT CGCTATTCGA TTTTAGAATG
CACCACGAAT ACGACCCCGC TAAAGGCGCG CGCATTCAAG AAACCTTAGA CGAAGTGCGC
CGTCAGGTGG CCGTATTAAC GCCGCCAGAC TTTAACCGTT TCCAACATGG TTTTGCCCAC
ATTTTTGCGG GCGGTTATGC CGCGGGTTAT TACAGTTATA AATGGGCAGA AGTGCTATCG
GCGGATGCGT TTTCCCGCTT CGAAGAGGAA GGGATTTTCA ATCCAGAAAC GGGTCGTCGC
TTCCTGCACA ATATCCTCGA AATGGGTGGC TCAGCCGAAC CTATGGACTT GTTCAAACAG
TTTATGGGAC GCGAGCCGAA TATCGATGCC CTGTTAAGAC ATTCAGGCAT TGCTGCATAA
 
Protein sequence
MSNPLLSGAK LPLFSQIKPE HIQVAVEHAI AQCRTKIDQV LQNQGPYTWD NLVAPLEQVD 
DELSQIWSPV SHMNSVTSTD EWRAAHDACL PLLSEYGTYV GQHQGLYQAY KALRASNEFT
QLSQAQQRVI EHSLRDFELS GIGLDDAQKV RYGEMVKRLS ELTSGFSNQL LDATQAWTKL
ITDEAELAGL PESAKAAAKA MADARELDGW LFTLDFPSYL PVMTYSENRS LREECYRAFV
TRASDQGPNA GEFDNSPLMD EILALRHELA LLLGFDSFAD KSLATKMAET PAQVMAFLNE
LALRSKDQAK AEVAELRAFA EAQYGVSEMA SWDLSFYAEK LQQHKYEISQ EILRPYFPED
KVLSGLFYTV SRLFGLKIVE QKDLDRWHKD VRFFDIFDET DEHRGSFYLD LYARTGKRGG
AWMDDCRVRR QTADGLQKPV AYLTCNFNGP VDGKPALFTH DEVTTLFHEF GHGIHHMLTK
IDVAGVSGIN GVPWDAVELP SQFMENWCWQ EEALAEISGH FETGEPLPKA LLDKMLAAKN
FQSGMMMLRQ LEFSLFDFRM HHEYDPAKGA RIQETLDEVR RQVAVLTPPD FNRFQHGFAH
IFAGGYAAGY YSYKWAEVLS ADAFSRFEEE GIFNPETGRR FLHNILEMGG SAEPMDLFKQ
FMGREPNIDA LLRHSGIAA