Gene Sbal195_4217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_4217 
Symbol 
ID5756048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp4991555 
End bp4992460 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content50% 
IMG OID641290573 
ProductDNA binding domain-containing protein 
Protein accessionYP_001556635 
Protein GI160877319 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1910] Periplasmic molybdate-binding protein/domain 
TIGRFAM ID[TIGR01764] DNA binding domain, excisionase family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00203903 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.426085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCTG CCAGTGAATT GGTTTACATG AGCGCGAAGC AAGTGGCCGA GTATTTAGAT 
CTTAACGAGA AAAAAGTCTA CGCCATGGCC AACGACAGAA TTCTCCCCGC CACTAAAATC
ACCGGTAAAT GGCTATTCCC GAAAGTGCTA ATCGACCGTT GGGTGATGGA TTCGTGTCAC
AGTGGCATGC TTACCGACCG TTTGTTGATC ACCGGTAGTG ACGATCCACT CTTATCTATG
CTGGTGGCGC GCTTGATGGC ACAAGTCGGT AGTCGTGAGT TGATAAGCTA CAGCGCGACA
GGTTCACGCT TAGGATTAGA GTTACTCGCT AAAGGTTATG CCGATGTGTG TACCTTACAC
TGGGGCAGCA TGGAGGATCG CAATATCCGT CATCCAGCCT TACTTAAAGG GTATAACAAT
CATCAACAAT GGATCATGGT GCACGGTTAC TCCCGTCAAC AAGGGTTGAT CATGCGTGCC
GATATGCACC ACAGATGCCA AGAGGAAGAT AAAGTCGTGA ACTTACCTTG GCGTTGGGTG
AGTCGTCAGG GCGGCGCGGG TAGCCAGCAA CATTTAGAAC ATTGGTTGTT AAAGCAAGGC
GCTCGCTTAG ATCAGCTAAA TGTCGTGCTG ACGGCCTATA GTGAACGCGA GCTGGCAGGT
TATATCGCCC GTGGTGATGC CGATATAGGT TTTGGCTGTC AATCTGTGGC ATTGGAGAGT
GGTTTGAGTT TCGTGCCACT GATTAAAGAG TCCTTCGATT TCGTTATGCC GCAAAGCATT
TACTTCCGTC GTCAGCTTCA ACAACTCTTT ACTATGTTGG CGAGCGGCCA CTCGAGGCAA
ATGGCGGCGC TACTGGGTGG CTATGATCTT ACCGACTGCG GACAATTACT CTGGAGTGCG
AGCTAA
 
Protein sequence
MTSASELVYM SAKQVAEYLD LNEKKVYAMA NDRILPATKI TGKWLFPKVL IDRWVMDSCH 
SGMLTDRLLI TGSDDPLLSM LVARLMAQVG SRELISYSAT GSRLGLELLA KGYADVCTLH
WGSMEDRNIR HPALLKGYNN HQQWIMVHGY SRQQGLIMRA DMHHRCQEED KVVNLPWRWV
SRQGGAGSQQ HLEHWLLKQG ARLDQLNVVL TAYSERELAG YIARGDADIG FGCQSVALES
GLSFVPLIKE SFDFVMPQSI YFRRQLQQLF TMLASGHSRQ MAALLGGYDL TDCGQLLWSA
S