Gene Sbal223_4454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_4454 
Symbol 
ID7094371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011668 
Strand
Start bp38622 
End bp39842 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content40% 
IMG OID643467317 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_002364275 
Protein GI217980299 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones74 
Plasmid unclonability p-value0.859615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones97 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGG TTTTATATTC ATTGCCTGAG GGGTGGCACT TAGAAACTAT TGGTGAAGTA 
GCATCTAAAT TGGTTACAGG TAAGACACCA TCAACAAAAA AGGCAGAATA TTACTCTTCT
AGCGAAGTGG ATTGGTTTAC TCCATCAGAT TTTGGCTCCA CTGCTGTGTT GAATAATTCT
CGACGGAAGC TTAGTTCTTT AGCAATTGAA GACGGTACGA TAAAGAAAAT GCCAAAGGAC
TCTATTCTTT TGGTAGCTAT TGGCGCAACT ATTGGAAAAG TTGGCCTAGC GGAAGATGAA
TCTTGCTTCA ATCAACAAGT TACAGGGATA CACTTTAAAG AAAAAATTCA TCCTAAGTAT
GCGTATTATT GGTTAAGTTA TATAAAACCA GAAATAATCA CGAAATCTTC ACAGGCCACA
CTCCCGATAA TTAATCAGAC GGGCATTAAA GGTCTTTCAT TCTTGTATCC TGAAAAAGAA
GAACAAAAAT GCATCGTCGA AAAACTCGAT GCACTGCTTA CCCGCATCGA CACCGCCATC
GAGCATTTGC AGGAAAGCAT CACACTGAAA AATAGCCTTC TTCAATCAGC ACTCGATGGT
CAGTTTTCAG CTATCACTGA GAGAATGACG ATTGAATCGC TAGCTGAGGT AAAAGGTGGT
AAGCGCTTAC CAAAAGGCGA AAAGCTAAGT GATGAAGAGA CTGAGCACCC ATATATCCGA
GTGGCAGATT TTACAGATAA AGGAACGATA GATTTATCTG GTATCAAGTA CATATCGAAA
GAAATCCACG AACAAATCAA ACGCTATGTG ATTTCCAAGG ACGACCTTTA TATCAGTATT
GCTGGGACTA TCGGTAAGAC AGGTTTCGTC CCTTCAGAAT TGGACGGTGC AAACCTGACA
GAAAACGCCG CTAAGTTGGT TATCAAAGAT AAACAACAAC TAGATTTAAG CTACCTGTAT
CTATTTACAT TGACCTCTGA CTTCTCTGCT CAGGCAGGAT TAGCGACGAA GACCGTTGCA
CAGCCAAAGT TGGCGCTAAC TCGTTTAAGC AAGATTGAAA TACCAATATG TTCCTTGGAA
GAACAGAAAT CATTGGTATC TACAATTGAA GCGCTGAAAA GTAAAATTCA CGATGCGGAA
GCAGTCCTTC TAGGGAAAAT AGAAGACTTG AAAAGTTTAA AAGCATCAAT TCTCGATTCT
GCCTTCAAAG GCGAGCTCTA A
 
Protein sequence
MEQVLYSLPE GWHLETIGEV ASKLVTGKTP STKKAEYYSS SEVDWFTPSD FGSTAVLNNS 
RRKLSSLAIE DGTIKKMPKD SILLVAIGAT IGKVGLAEDE SCFNQQVTGI HFKEKIHPKY
AYYWLSYIKP EIITKSSQAT LPIINQTGIK GLSFLYPEKE EQKCIVEKLD ALLTRIDTAI
EHLQESITLK NSLLQSALDG QFSAITERMT IESLAEVKGG KRLPKGEKLS DEETEHPYIR
VADFTDKGTI DLSGIKYISK EIHEQIKRYV ISKDDLYISI AGTIGKTGFV PSELDGANLT
ENAAKLVIKD KQQLDLSYLY LFTLTSDFSA QAGLATKTVA QPKLALTRLS KIEIPICSLE
EQKSLVSTIE ALKSKIHDAE AVLLGKIEDL KSLKASILDS AFKGEL