Gene Sbal_1682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal_1682 
Symbol 
ID4841778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS155 
KingdomBacteria 
Replicon accessionNC_009052 
Strand
Start bp1959966 
End bp1962317 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content48% 
IMG OID640118899 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_001050062 
Protein GI126173913 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGAT TCATGTTTGG CTTTAGTGCC ATCTTGTTAT CAGCGATGTT GTGGCCTTCG 
CTGCCGCCAG TCAGTTATAT ACCCTACCTT GCTGTTGGGG CGTTAATTTT ATACCGAAAA
ATCCCTGTCT GCGCAGGTGG ACTCTTTGCA ATGGCATGGC TAACTGGTTT TTGCTTAGGA
TTATCTAGGC AGGATCTTCC AGTGTTACAA CAACCTATAC AGGTTAGGGG TGAAATCATA
TCACTAGTTA GTCGAAACAG CGACTGGCTA AGTTTGGATA TATCGGTCAT TAAACCAAAT
TTAATCTTAG GGCCAAATGC AAAATTGCGA CTCACGTGGA AAGATCCGCC AGAAGTCGAT
GTCGGTCAGG TTTGGCAGTT TACCTTGATG CCTAAGAGTA TTGCTAGCGT GTTAAATCAA
GGTGGTTACA ACGAGCAGAA ACAATTAATC AGCCAACATG TTGTCGGTAA AGGGCGAGTG
ATAGAGGCAC ATTTACTTGC GGTTTCACCT TCACTGCGTA ACGAGCTGAT AAGCGCATTG
ACGCCTGAAT TAGCCTCTTT GCCGCAAGGT GATATTCTAC TCGCCTTACT TTTGGGTGAT
AAGCAGTTGA TCTCAAAAGT GCGTTGGCAA GCATTACGAC AGACAGGTAC GGGTCATTTA
GTGGCGATAT CCGGCTTGCA TCTTTCGGTC ATAGCAGCAT GGATTTACAC ATGTCTTTTG
TTTGGCTTAA GTCGCTTAGT GCCACATCAG AGTCGGCGTA ATATCACCAT TGCCTTAGTC
GCAGCGGGCA TAGGCGCCGC ATTTTATGCT TATCTTGCGG GCTTTGGCAT TTCGACGCAG
CGTGCATTAG TGATGATTCT GCTGTTAATG CTGTTGAGTT TATTAAAGCG CTTTTCTACG
GCGTGGGAGC GCTTATTATT CGCCCTGTTC ATTGTATTAC TGCTTGATCC GCTCGCTTGT
TTAAGTGCGG GATTTTGGCT GTCGTTTTGT GCCTTAGGCA TCATCTTATA CACCTTAGAA
ATCCAACCAA GGGCATTTAC CCCTGCGTCG ACTCGAAGGG CGCGTTTACG CACTGGGATG
ATGCAGTTTT GGGCTATCCA ATGGCGCTTG AGTCTCGGGC TTGGATTACT GCAAGCGGCA
CTCTTTGGCG GCGTGAGCGT ACACAGTCTG TGGATGAACA TCTTGGCCGT GCCCTGGTTT
AGCTTTGTAG TGATCCCCTT AGCCATGGCG GGATTTGTCT GCTGGTGGCT TGGGACGGCT
TTGGGATTAT CTCACTTCGG ACTCTCTTCG CTTGGAGTAC TGCGCCTCAG TGATTGGAGC
TTATCGCCCT ATGCGCAGTT GCTCGACATC AGCCAGCAAT TACCCGCCCA TTGGTTAGCA
CTCTCTGACA GCTTATTAGC CTTAGGATTC TGCGCGCTCG TTGGTGGTGT GTTATGGCGC
TATGTGCCTA AGCATAAACA CTATATTGCT TGGTTAAGCC TATTAAGTTT GTTGTTTATT
CCCGCATTAC TTTTTTGTAT GACGCTATGG TCTCCCGTGC AAACCCATCG ATGGACCATG
CATTTACTCG ATGTGGGCCA AGGTTTAACC GTGGTGATTG AAAAGAATGG CAGGGGTTTC
ATCTATGACA CGGGCGCCGC CTTTGGTGAT GATTTCAGTT ATGCGGAGCG GGTGATATTG
CCCTTTTTAA AAGCCAAAGG CATCCAAGAG ATTGATTATA TTGTTATCAG CCATAGCGAT
AACGATCATG CAGGCGGTGC GCCTGTCTTG ATTGAGGCTT ATCCCAAGGC TTTGGTGATC
ACAGATGTGG CTGGCTTTAG CGGCCAAGAT TGCCGTCCAA GGCAGATTCA ATGGCAAGGA
TTGCGCCTTA ACTTACTCTC GCCGCCTCAA GTGCTGGCGG GCAATAATGG CTCCTGTGTG
GTGCGCATTG ATGATGGCCT GCAAAGTCTG CTACTCACCG GGGACATTGA AAAACAAACC
GAAGCGGTAC TATTACGTAG TGAGTTAAGC GTGAACGGTG ATCTAAGTGA CCTAAACGAG
TTACAAAGCG ACGTGCTGGT GGCGCCGCAC CATGGCAGTA AAACCTCGTC GACGGAGGAC
TTTATCGATG CCGTTGCCCC TAAGTTAGTG CTGTTTCCGG CGGGTTTTGC TAATCGCTAT
GGTTTTCCTA AATACACAGT GGTTGAACGT TATCAGCGGC GAGAGATTAG GAGCCTGACC
ACAGGGTCAG AAGGGCAGAT TAGTGTGATT TTTCAGCAGT CTGAGTTAGA GGTAAAGACC
TATCGTGGTG ATTTAGCGCC ATTTTGGTAC AACTCTCTGT TTAGATTTGG TGACTTGATT
AATCCAGAGT AG
 
Protein sequence
MNRFMFGFSA ILLSAMLWPS LPPVSYIPYL AVGALILYRK IPVCAGGLFA MAWLTGFCLG 
LSRQDLPVLQ QPIQVRGEII SLVSRNSDWL SLDISVIKPN LILGPNAKLR LTWKDPPEVD
VGQVWQFTLM PKSIASVLNQ GGYNEQKQLI SQHVVGKGRV IEAHLLAVSP SLRNELISAL
TPELASLPQG DILLALLLGD KQLISKVRWQ ALRQTGTGHL VAISGLHLSV IAAWIYTCLL
FGLSRLVPHQ SRRNITIALV AAGIGAAFYA YLAGFGISTQ RALVMILLLM LLSLLKRFST
AWERLLFALF IVLLLDPLAC LSAGFWLSFC ALGIILYTLE IQPRAFTPAS TRRARLRTGM
MQFWAIQWRL SLGLGLLQAA LFGGVSVHSL WMNILAVPWF SFVVIPLAMA GFVCWWLGTA
LGLSHFGLSS LGVLRLSDWS LSPYAQLLDI SQQLPAHWLA LSDSLLALGF CALVGGVLWR
YVPKHKHYIA WLSLLSLLFI PALLFCMTLW SPVQTHRWTM HLLDVGQGLT VVIEKNGRGF
IYDTGAAFGD DFSYAERVIL PFLKAKGIQE IDYIVISHSD NDHAGGAPVL IEAYPKALVI
TDVAGFSGQD CRPRQIQWQG LRLNLLSPPQ VLAGNNGSCV VRIDDGLQSL LLTGDIEKQT
EAVLLRSELS VNGDLSDLNE LQSDVLVAPH HGSKTSSTED FIDAVAPKLV LFPAGFANRY
GFPKYTVVER YQRREIRSLT TGSEGQISVI FQQSELEVKT YRGDLAPFWY NSLFRFGDLI
NPE