Gene Sbal195_2137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_2137 
Symbol 
ID5753887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp2542652 
End bp2543971 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content54% 
IMG OID641288423 
ProductSPP1 family phage head morphogenesis protein 
Protein accessionYP_001554566 
Protein GI160875250 
COG category[S] Function unknown 
COG ID[COG2369] Uncharacterized protein, homolog of phage Mu protein gp30 
TIGRFAM ID[TIGR01641] phage putative head morphogenesis protein, SPP1 gp7 family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.232277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAAC CAAGAGTGCC TAAAACCGTT GATTTAAGCA TTGCCATTAA CCAAGCCCCC 
GCCGATGCGG TGGCCTATTT CCGTGCCAAA GGCTTTGCTA TAAGTGACGA TTGGCAAGAC
GTGTGGACCC GCGCCCACGC CCGTGCGTTT ACTGTGGCCA AGGCGGCACA GATGGATGTG
CTCACGGCGA TCCGTAATCA AGTGGATGCA GCCCTAAGCC AAGGGTTAAC CGCTAAGCAG
TTTCAGGCAA ACCTTAAGCC TCAACTCGAA AAGCTCGGAT GGTGGGGTAA AAAGGAAGTC
GATGGCCGCG AGGTACAGCT GGGGAGTCCC TACCGCTTAA ACACTATCTA TCGTCAAAAC
CTGCAAACCG CTTACATGGC TGGGCGCTAT CGGCGCATGT TATCGCGCAC CAAAACCCAT
CCCTATTGGC AGTATGTGGC GATAGATGAC GGCCAAACAC GGCCAGCCCA TGCGCGGCTT
AGGGGTAAAG TGTTCCGCTT TGACGATCCA ATATGGGACA TCATTTATCC TCCCAATGGC
TGGGGCTGTC GTTGCCGCGT TCGGGCGCTC ACCGAGGCGC AAGTGAAGGC GATGGGGATC
ACTGTGGAAA ATGGCGAAGG TTATATCCAG CGCTTTGACA CTGAGACAGT CGCGCGCGGA
ACGGGTGAAG TGTTAACCGT GCCCCATGCG CGTATCGATC TGCCCGATGG CAGCAGCATG
AGCCCCGATT TAGGCTGGGC CTATAGTCCA GGCGAAGCCG CCTTTGGTAC CGACGTGGCC
GTCGCTAAAA AGCTGGGCAC TATTCAATCC CTCGATACCC GCGCGCAGTT TATTCAGGCA
CTCAATAATA GCCCACTGCG CCACGCCCAG TTTGCCCAAT GGACCGATGA AGTCCTTGCC
GCCAATCCAG GGCAGAAACG AAGACCAGGC TTAGGCGTAC AGGCTTTAGG TTTTATGACG
CCATCAATTC AAGCTTCGGT AACGGCGCGT TTAGGCCGCG AGCCTAGCGC ATTGCTGGCT
ATAACTGAGC GGCAACTCAG CACAGCGCAA GCCTCGGTGA CGCCTAACAA GTTACAACAA
TTACCGCTGA TGCTGGCTAA GCCCGAGGCC GTTCTATGGG ATAGCGATAA TCAAACCTTG
CTTTATGTCT ATTCGACGGA ATGGCAAGCG GATGATGAAC GAGCTAGCAA ATTACTTATC
GCGAGCCATT GGTCATTACA GCGCACACCT AATGAAGGGC AAGTGGAAGG GTTTAAGTTA
TCGCTAGCAG AATTACAGCA AGCCCAATAC CAAGTGCTTG AAGGCCAGTT AAAAGGGTAG
 
Protein sequence
MPKPRVPKTV DLSIAINQAP ADAVAYFRAK GFAISDDWQD VWTRAHARAF TVAKAAQMDV 
LTAIRNQVDA ALSQGLTAKQ FQANLKPQLE KLGWWGKKEV DGREVQLGSP YRLNTIYRQN
LQTAYMAGRY RRMLSRTKTH PYWQYVAIDD GQTRPAHARL RGKVFRFDDP IWDIIYPPNG
WGCRCRVRAL TEAQVKAMGI TVENGEGYIQ RFDTETVARG TGEVLTVPHA RIDLPDGSSM
SPDLGWAYSP GEAAFGTDVA VAKKLGTIQS LDTRAQFIQA LNNSPLRHAQ FAQWTDEVLA
ANPGQKRRPG LGVQALGFMT PSIQASVTAR LGREPSALLA ITERQLSTAQ ASVTPNKLQQ
LPLMLAKPEA VLWDSDNQTL LYVYSTEWQA DDERASKLLI ASHWSLQRTP NEGQVEGFKL
SLAELQQAQY QVLEGQLKG