Gene Sbal223_3937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3937 
Symbol 
ID7086700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp4689931 
End bp4691247 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content47% 
IMG OID643462813 
Productprotein of unknown function DUF21 
Protein accessionYP_002359834 
Protein GI217975083 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTCT TTGATAACGT GATGATTATA TTGTGCCTAA TCGGCGCGAG TTGTTTCTTC 
TCTATGTCAG AAATTGCACT TGCCGCTTCA CGTAAAATCC GTTTACGTCA GTTGGCCGAT
GAGGGCGATG CCCGTGCAGA AAAAGTGCTG CAGTTACAAG CTGTTCCCGG CAGTTTTTTT
ACTGTGGTAC AAATCGGTCT TAATGCCGTT GCCATCATGG GCGGTATCGT CGGTGAGTCG
GCATTTCGGC CTTATTTCTA TGAGTTACTG TCGCCTTTGC TGACCGATCC TTGGTTGAGC
CAAATGAGCT TTGTTTTATC CTTTATTGTC GTCACCAGTG CCTTTATTCT GGTGGCCGAT
TTGATGCCAA AACGCATTGC CATGGCGATG CCAGAGCCCG TCGCGTTGGC CGTTGTTGGG
CCTATGTCTT TCTGTATCGT GCTACTGCGT CCATTAGTGT GGTTTTTCAA TGGCATGGCG
AGCGGCATCT TTAAACTGCT GCAAATCCCA ACCGTGCGTA ACGATGCCAT CACCTCTGAC
GACATTTATG CCGTGATGAA CGCGGGCGCC GAAGCAGGGG TATTAGATCG CGGTGATCAA
CAGATGATGG AAAATGTGTT TGAAATGCAA ACCGTTTCTG TGACTTCGGC CATGACGGCC
CGTGAAAGCT TAGTGTACTT TTTACTGCAA GATAGCGAAG AAGATATTAA GCGTAAGATT
TCTGAAGATC CCCACACTAA GTTCCTCGTC TGCGATGGTC AGTTAGATAT GATCAAAGGT
TTTGTGGATG CAAAAGAGCT GCTGATCCGA GTGATTAACG GTGAGAATAT TACTCTAAAA
GGCAGTAACT TAGTCCACAC TTCGCTGATC ATTCCTGATA CTTTGAGCCT ATCAGAGGCA
ATGGAATACT TTAAGAATAG CCGCGCCGAT TTTGCCGTGG TCATGAACGA ATATGCGTTA
GTGGTGGGGA TTGTTACGAC CAACGATTTG CAGCGCGCGG TAATGGGTGC TTGGTCATTG
CACGAGAGCG AAGAGCAGAT CATCGCCCGT GATAGCAACT CTTGGTTGGT TGATGGTGTA
ACGCCGATTA CTGATGTGAT GCGCGCCTTC GGCATCGAAG AATTCCCGCA TAATCAGAAC
TACGAAACCA TTGCCGGTTT TATGATGTAT ATGCTGCGTA AAATCCCTAA GCGTACCGAT
TTCGTGAACT ATGCGGGCTA TAAATTTGAG GTGGTCGATA TCGATTCTTA CAAGGTCGAT
CAGTTGTTGG TGACCCGTAT CGATCCTATC GATAAGCTCA ACTCGCCAGA TGTGTAA
 
Protein sequence
MSLFDNVMII LCLIGASCFF SMSEIALAAS RKIRLRQLAD EGDARAEKVL QLQAVPGSFF 
TVVQIGLNAV AIMGGIVGES AFRPYFYELL SPLLTDPWLS QMSFVLSFIV VTSAFILVAD
LMPKRIAMAM PEPVALAVVG PMSFCIVLLR PLVWFFNGMA SGIFKLLQIP TVRNDAITSD
DIYAVMNAGA EAGVLDRGDQ QMMENVFEMQ TVSVTSAMTA RESLVYFLLQ DSEEDIKRKI
SEDPHTKFLV CDGQLDMIKG FVDAKELLIR VINGENITLK GSNLVHTSLI IPDTLSLSEA
MEYFKNSRAD FAVVMNEYAL VVGIVTTNDL QRAVMGAWSL HESEEQIIAR DSNSWLVDGV
TPITDVMRAF GIEEFPHNQN YETIAGFMMY MLRKIPKRTD FVNYAGYKFE VVDIDSYKVD
QLLVTRIDPI DKLNSPDV