Gene Sbal195_1704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_1704 
Symbol 
ID5753440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp2050223 
End bp2052574 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content48% 
IMG OID641287980 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_001554137 
Protein GI160874821 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0286912 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGAT TCATGTTTGG CTTTAGTGCC ATCTTGTTAT CAGCGATGTT GTGGCCTTCG 
CTGCCGCCAG TCAGTTATAT ACCCTACCTT GTTGTTGGGG CGTTAATTTT ATACCGAAAA
ATCCCTGTCT GCGCAGGTGG ACTCTTTGCA ATGGCATGGC TAACTGGTTT TTGCTTAGGA
TTATCTAGGC AGGATCTTCC AGTGTTACAA CAACCTATAC AGGTTAGGGG CGAAATCATA
TCACTAGTTA GTCGAAACAG CGACTGGCTA AGTTTGGATA TATCGGTCAT TAAACCAAAT
TTAATCTTAG GGCCAAATGC AAAATTGCGA CTCACGTGGA AAGATCCGCC AGAAGTCGAT
GTCGGTCAGG TTTGGCAGTT TACCTTGATG CCTAAGCGTA TTGCTAGCGT GTTAAATCAA
GGTGGCTACA ACGAGCAGAA ACAATTAATC AGCCAACATG TTGTCGGTAA AGGGCGAGTG
ATAGAGGCGC AGTTACTTGC GTTTTCACCT TCACTGCGTA ACAAGCTGAT AAGCACATTG
ACGCCTGAAT TAGTCTCTTT GCCGCAAGGT GATATTCTAC TCGCCTTACT TTTGGGTGAT
AAGCAGTTGA TCTCAAAAGT GCGTTGGCAA GCATTACGAC AGACAGGTAC GGGTCATTTA
GTGGCGATAT CTGGCTTGCA TCTTTCGGTC ATAGCAGCAT GGATTTACAC ATACCTTTTG
TTTGGCTTAA GTCGCTTAGT GCCACATCAG AGTCGGCGTA ATATCACCCT TGCCTTAGTC
GCAGCGGGCA TAGGCGCCGT ATTTTATGCC TATCTTGCTG GCTTTGGCAT TTCGACGCAG
CGTGCATTAG TGATGATTCT GCTGTTAATG CTGTTGAGTC TATTAAAGCG CTTTTCTACG
GCGTGGGAGC GCTTATTATT CGCCCTGTTC ATTGTATTGC TGCTTGATCC GCTCGCTTGT
TTAAGTGCGG GGTTTTGGCT GTCGTTTTGT GCTTTAGGCA TCATCTTATA CACCTTAGAA
ATCCAACCAA GGGAATTTAC CCCCGCGTCG ACTCGAAGGG CGCGTTTACG CACAGGGATG
ATGCAGTTTT GGGCTATCCA ATGGCGCTTA AGTCTCGGGC TTGGATTATT GCAAGCGGCA
CTCTTTGGCG GCGTGAGCGT ACACAGTCTG TGGATGAACA TCTTGGCCGT GCCTTGGTTT
AGCTTTGTCG TGATCCCCTT AGCCATGGCG GGATTTGTCT GCTGGTGGCT TGGGACGGCT
TTGGGATTAT CTCACTTCGG ACTCTCTTCG CTTGGAGTAC TGCGTCTCAG TGATTGGAGT
TTATCGCCCT ATGCGCAGTT GCTCGACATC AGCCGGCAAT TACCCGCCCA TTGGTTAGCA
CTCTCTGACA GCCTATTAGC CTTAGGATTC TGCGCGCTCG TTGGTGGTGT GTTATGGCGC
TATGTGCCTA AGCATAAACA CTATATTGCT TGGTTAAGCC TCTTGAGTTT GCTGTTTATG
CCTGCGTTAC TTTTTTGTAT GACGCTATGG TCTCCCGTTC AAACCCATCG ATGGACCATG
CATTTACTTG ATGTGGGCCA AGGTTTAGCC GTGGTGATTG AAAAGAATGG CAAGGGGTTT
ATCTATGACA CGGGCGCCGC CTTTGGTGAT GATTTCAGTT ATGCGGAGCG GGTGATATTG
CCCTTTTTAA AAGCCAAAGG CATCCAAGAG ATTAATTATA TTGTTGTCAG CCATAGCGAT
AACGATCATG CTGGCGGTGC GCCTGTCTTG ATGGAGGCTT ATCCCAAGGC ATTGGTGATC
ACTGATGTGG CAGGCTTTAG CGGCCAAGAT TGCCGCCCAA GGCAGATTCA ATGGCAAGGA
TTGCGCCTTA ACTTACTCTC GCCGCCTCAA GTGCTGGCGG GCAATAATGG CTCTTGTGTG
GTGCGCATTG ATGATGGCCT GCAGAGTCTG CTGCTCACCG GGGACATTGA AAAACAAACC
GAAGCGGCAC TATTACGTAG TGAGTTAAGT GTGAGCGGTG ATCTAAGTGA CCTAAACGAG
TTACAAAGCG ACGTGCTGGT AGCGCCGCAC CATGGCAGTA AAACCTCGTC GACTGAGGAC
TTTATCGATG CCGTTGCCCC TAAGTTAGTG CTGTTTCCGG CGGGTTTTGC TAATCGCTAT
GGTTTTCCAA AAACCACAGT GGTTGAGCGG TATCAGCGAC GAGAGATTAG GAGCCTGACC
ACAGGGACAG AAGGGCAGAT TAGTGTGATT TTTCAGCAGT CTGAGTTAGA GGTAAAGACC
TATCGTGGTG ATTTAGCGCC ATTTTGGTAC AACTCTCTGT TTAGATTTGG TGACTTGATT
AATCCAGAGT AG
 
Protein sequence
MNRFMFGFSA ILLSAMLWPS LPPVSYIPYL VVGALILYRK IPVCAGGLFA MAWLTGFCLG 
LSRQDLPVLQ QPIQVRGEII SLVSRNSDWL SLDISVIKPN LILGPNAKLR LTWKDPPEVD
VGQVWQFTLM PKRIASVLNQ GGYNEQKQLI SQHVVGKGRV IEAQLLAFSP SLRNKLISTL
TPELVSLPQG DILLALLLGD KQLISKVRWQ ALRQTGTGHL VAISGLHLSV IAAWIYTYLL
FGLSRLVPHQ SRRNITLALV AAGIGAVFYA YLAGFGISTQ RALVMILLLM LLSLLKRFST
AWERLLFALF IVLLLDPLAC LSAGFWLSFC ALGIILYTLE IQPREFTPAS TRRARLRTGM
MQFWAIQWRL SLGLGLLQAA LFGGVSVHSL WMNILAVPWF SFVVIPLAMA GFVCWWLGTA
LGLSHFGLSS LGVLRLSDWS LSPYAQLLDI SRQLPAHWLA LSDSLLALGF CALVGGVLWR
YVPKHKHYIA WLSLLSLLFM PALLFCMTLW SPVQTHRWTM HLLDVGQGLA VVIEKNGKGF
IYDTGAAFGD DFSYAERVIL PFLKAKGIQE INYIVVSHSD NDHAGGAPVL MEAYPKALVI
TDVAGFSGQD CRPRQIQWQG LRLNLLSPPQ VLAGNNGSCV VRIDDGLQSL LLTGDIEKQT
EAALLRSELS VSGDLSDLNE LQSDVLVAPH HGSKTSSTED FIDAVAPKLV LFPAGFANRY
GFPKTTVVER YQRREIRSLT TGTEGQISVI FQQSELEVKT YRGDLAPFWY NSLFRFGDLI
NPE