Gene Sbal195_4237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_4237 
Symbol 
ID5756068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp5014115 
End bp5015506 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content50% 
IMG OID641290593 
ProductSel1 domain-containing protein 
Protein accessionYP_001556655 
Protein GI160877339 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000679734 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGATAT TCACGATTGC GACTCTCCTA TTACTGCTGA CCCTAAGCTT TGCAGCAACA 
GCGGAAAAAT CCGCGCTGGA ACAGTTATTT GAGCAACAAC AATACAATGC TTTCTTACAA
CAAGCGCAGC AGCAAGCCGC AGAAAACAAT GCTGATGCGT TATTTCTCCT GGGTAAAGCC
TACCATCTCG GCCGTGGCGT CACGCAAGAC AATGCCACCG CCAGCCAATA CTATGAACAA
GCCAGAGCCT TAGGTTCGGC CCGTGCCAGC CATAATCTAG GCAGTATGGC GCTGGATAAT
GATCGAAAAA CGCAGGCCAT CCCCTTTTTA GAAGAAGCAC TGGCGCGGGG TCTTAAGCTA
CCGACCTTAT ATAACCTAGG ACGAGCACAC AGTCCGGCAG ATCCCAGCTC AAGGTTCTAT
CTAACGAAAG CAATCGCAGC GGCGCAGCGA GCGGGGAGTT ACTTTGGACA AGCCTTTGAA
CTTCAACCCG ATAACCTCTA TCTGGATAAT GCCTCACGCG AGTATCTGCG CGCTTACTTA
ATAGCGATGC AAAGTGTCGG CAGCGAAAGA GAAAGTCTCG ACCTCCCGCA GTTACGCCAG
CAAGCCATAA AGTGGCTGGA ATTGGGTATG GCAGCAGATG ATGGTACGGC GTGGACAAAC
TACGGCGCCC TACTGTTGAA TGAAAATGAC TACAGCGGCG CAAAAGCCGC ATTTTTACAA
GGTGCTAAAC GTCAAGTTGC CATCGCTAAC TATCATTTAG GCAGCATGGA AGAGCGTGGT
TTGGGCGCAG AGGCCAATAA AGTGCAAGCA CTGGTATACT ACGAGAAAGC AGCGTTAGCG
GGAATGGAAG AAGCGAAGGC TCCCGCTTAT GAACTGCTTA AGGCACAACT GGAATATGAA
GACGAACTCA CTAAATTGGA ACAAGGCATT GCCCGCTTTA ATGCACTAAA ACAGCAAGAA
CAGTATGTTC CCATCTCACT GGACTCTGTT ATAAATCGGC TGGCGTGGGG CACGTTTCTG
GCACAGCAGC GGCAACTGAC TCTTTCGCTA CCCACAGCGG CCAAAATCCA ATTAAGCTTG
CAAGCCTGTG GTTTAGGCTA CAACCAATTG CATGGCAGCA CATACAATAT TGGAGAAAAC
TCGCACTGGC GTATGGCGGC GTATCTGACC TTAGCCGACA AAATCGCTTT ACCACTTGAA
GGCAGAGTCG ACGAGCATGG CTGTGCCAGC TTAACGATTG CCATGACAGA TCAGCTGTAT
CGCTTATTAC AACAAGGTGC CGTATTCGGA TTGCGCTTTC CTAACTACAC ACTGCCACTG
GCATTAAGCC AGCAAGAAAA TATCTTGCAG CTGACACTTA TGCCCGTCGA GACCCCATTA
CCGGTTGATT AG
 
Protein sequence
MKIFTIATLL LLLTLSFAAT AEKSALEQLF EQQQYNAFLQ QAQQQAAENN ADALFLLGKA 
YHLGRGVTQD NATASQYYEQ ARALGSARAS HNLGSMALDN DRKTQAIPFL EEALARGLKL
PTLYNLGRAH SPADPSSRFY LTKAIAAAQR AGSYFGQAFE LQPDNLYLDN ASREYLRAYL
IAMQSVGSER ESLDLPQLRQ QAIKWLELGM AADDGTAWTN YGALLLNEND YSGAKAAFLQ
GAKRQVAIAN YHLGSMEERG LGAEANKVQA LVYYEKAALA GMEEAKAPAY ELLKAQLEYE
DELTKLEQGI ARFNALKQQE QYVPISLDSV INRLAWGTFL AQQRQLTLSL PTAAKIQLSL
QACGLGYNQL HGSTYNIGEN SHWRMAAYLT LADKIALPLE GRVDEHGCAS LTIAMTDQLY
RLLQQGAVFG LRFPNYTLPL ALSQQENILQ LTLMPVETPL PVD