Gene Sbal_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal_3601 
Symbol 
ID4844173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS155 
KingdomBacteria 
Replicon accessionNC_009052 
Strand
Start bp4229447 
End bp4231444 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content50% 
IMG OID640120869 
Productphage terminase GpA 
Protein accessionYP_001051945 
Protein GI126175796 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATAT CGGCCGCGCA GATTAAAAAT CTGAAAGCCG CCGTTGCTGC TGGGCTGCGT 
TCGTTCTATC GCCCACCCAT GCTCACCTGT TCTGAATATG CCGACGAGCA CTTTTACATG
TCGTCGGAGT CCTCTTACAC CGAGGGTAAG TGGGAAAGTT TACCGTTTCA AATTGGCATT
CTTAATGCCA TGGGTAACGA CCAAATCAGC ACGCTTAACT TAATGAAGTC AGCGCGTGTC
GGTTACACCA AAATGCTGAT GGCTAACGCT GCTTACAAGA TTGAACACAA AAAGCGCAAC
GTGTTGATCT ATCAGCCTCG TGATGGTCAA GCCAAAACCT TCATGAAAAA GCACGTTGAA
ACGGCGATTC GTGACATCCC TGTTTGGCGC GCGCTTGCAC CTTGGATGGG TCGCAAACAT
AAAGACAGCA CGCTAGAAGA TAAAATATTC ACCAATGGCA AAACGCTGAT GGTGCGCGGT
GGTACCGCTG CAGCTAACTA TCGCGAAATC TCCACCGATG ATGTGATCTA CGATGAGCTA
GCGGGTTTTG ATGAATCCAT CGAGCACGAA GGTAACGCCA CATCGCTCGG TGATACTCGT
ATCGAACTGT CGATGTTTCC TAAGTCGATT CGCGGTTCAA CGCCTAAAGT GCTCGGTACC
TGCCAGATTG AAAAAGCCTG CAGCGAATCA CCGCACTATT TTAGGTTCAA CTTACCTTGC
CCACACTGCG ACGAACTGCA GGATTTAAAG TGGGGCGGCC CCGAAGAAGC CTTTGGGATT
AAGTGGCATA AAAATGCCAA GGGAGAACAT GACCCAAGCA CAGCCTATTA TCTGTGTGAG
CACTGTGGTT GCTGTATCGA AAACAATCAG CTCGATGATA TGGAGCTGCA CCCAAGCGCA
GTTTGGATAT GCGAAAACAC CGGCATCCGC ACTAAAGACT TTTTAGATTT TTATGATGCC
GACGGAAACG ACATCACCAC GCCGCCCAAT ATCTCGATTC ATATCTGGTC GGCCTATAAC
TCTCTCAACA GCTGGGCGAA ACTGGTTACT GAGTTCTTTA AAGCCAAAGG CGATAAAGAA
AAGCTGCAGA CTTTCGTCAA CACTAAGTTA GGCCAGCCAT GGGATAACGA CAACGGCGAA
CGCTTAGAGT GGGAAGAATT AGCGAAGCGC CGCGAAATGT ACCCGAGTGG CAAAGTACCT
AACTGGGTGG TGTATTTAAC GGCAGGTATC GACACCCAAG ATAACCGTTA CGAAGGCCGT
GTTTGGGGCT GGGGTGCGGG TAAAGAGGCG GCGCTAATTG ACCGCTTTAT TCTCCATGGT
GATCCCGCTG ATCAAGTGCT TAAAGACAAA GTGGCTGAGC GTATTGCGCA AAGCTATGCC
CGTGCTGATG GCGTTGTGCT CAATATCGGC GTAGTGGGTT GGGATTCAGG CGGCCACTAC
ACCGATGACG TTTACGCCAT GAGTAAAAAG CTTGGGCTAA TGCGGGTTAT ACCCATTAGA
GGTGCCAACG TTTACGGCAA GCCGATCGCC AACTTCCCCC GTAAGCGAAC CGCCAAAGGT
GTTTACTTAA CGGAGGTTGG TACCGACAAC GCCAAAGAGC TGTTGATGTC GATGTTGCGA
ATTGCCCCTG ATGTTGATGT GCGTAAGCCT GGTGCAATTC ACTTCCCGTT AAACGAAGCG
GTATGTGATG ACGTTGAGCT GCAACAGCTC ACCAGCGAAC GCAAGGTGCC GGTGCGCCAA
AACGGGCGGA TCATCTATAA ATGGGACAAC CAAAAGCGCC GCAACGAGGC GTTAGATTGT
TTCGTTTACG CCTTGGCCGC GTTGTATATC GCGATAGAAA AATTCGGCAT CAATCTCGAC
AAGCTTTCAC AAGTTACCCC GATCGCCATA TCAAGCGACC AACCCAAAGA ACCAAAACCC
AAAGCCGTAA AACAGGCCAA TGCAAATGCT GCTTACCTAA ACGGTGGCGG TGGTGGCAGT
TCTGGCGGTT GGCTGTAG
 
Protein sequence
MSISAAQIKN LKAAVAAGLR SFYRPPMLTC SEYADEHFYM SSESSYTEGK WESLPFQIGI 
LNAMGNDQIS TLNLMKSARV GYTKMLMANA AYKIEHKKRN VLIYQPRDGQ AKTFMKKHVE
TAIRDIPVWR ALAPWMGRKH KDSTLEDKIF TNGKTLMVRG GTAAANYREI STDDVIYDEL
AGFDESIEHE GNATSLGDTR IELSMFPKSI RGSTPKVLGT CQIEKACSES PHYFRFNLPC
PHCDELQDLK WGGPEEAFGI KWHKNAKGEH DPSTAYYLCE HCGCCIENNQ LDDMELHPSA
VWICENTGIR TKDFLDFYDA DGNDITTPPN ISIHIWSAYN SLNSWAKLVT EFFKAKGDKE
KLQTFVNTKL GQPWDNDNGE RLEWEELAKR REMYPSGKVP NWVVYLTAGI DTQDNRYEGR
VWGWGAGKEA ALIDRFILHG DPADQVLKDK VAERIAQSYA RADGVVLNIG VVGWDSGGHY
TDDVYAMSKK LGLMRVIPIR GANVYGKPIA NFPRKRTAKG VYLTEVGTDN AKELLMSMLR
IAPDVDVRKP GAIHFPLNEA VCDDVELQQL TSERKVPVRQ NGRIIYKWDN QKRRNEALDC
FVYALAALYI AIEKFGINLD KLSQVTPIAI SSDQPKEPKP KAVKQANANA AYLNGGGGGS
SGGWL