Gene Sbal223_2642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2642 
Symbol 
ID7089927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3111498 
End bp3113165 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content51% 
IMG OID643461531 
ProductC-5 cytosine-specific DNA methylase 
Protein accessionYP_002358555 
Protein GI217973804 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000523714 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCGGCT TAATCGTAGA TAACTTCGCG GGTGGAGGCG GCGCTTCCAC TGGCATGGGC 
TGGGCAATCG GTCGCAGCAT TGATATCGCT ATCAATCACG ACCCTGACGC CATTGCGATG
CACTCAGTTA ATCACCCGAA TACCTTGCAC TATTGCGAGT CGGTATTTGA TATCGACCCA
GTGCAAGCCA CCGCAGGTAA ACCCGTTGAT TTGGCATGGT TCTCGCCTGA CTGTAAACAC
TTCAGCAAAG CCAAAGGCAG CAAGCCAGTC AATAAAGAGA TTCGCGGCTT AGCGTGGGTA
ACTGTTCGCT GGGCGATGAA GGTACGCCCT CGCGTGATGA TGCTCGAAAA CGTCGAGGAA
TTTAAAACGT GGGGGCCATT GATAGGAATA GATACAAAAG ACCAACGCCC TGACCCAGAT
AGAAAAGGCG AAACCTTTAA CGCTTTTGTC AGTATGTTGA GCTCTGGCAT AGATGCAGAT
CACCCCGCAC TTGCCGAATG TGTTGAAACC TTAGGCTTGC TCGATACCGC TAAATTGATT
AACGGGCTGG GTTATAAGGT CGAGTGGCGC GAGCTACGGG CCTGTGACTA TGGCGCCCCA
ACTATCCGTA AGCGTCTATT CATGATCGCC CGTTGTGATG GTCAGCCAAT CGTTTGGCCT
GAGCCTACCC ACGGCGCACC GGATAGCGAA GTGGTTAAAT CGGGCAAGCT GCTACCGTGG
CGCACAGCCG CCGAGTGCAT CGACTGGTCA CTGCCCTGCA AATCAATATT TGGTCGTAAA
AAGCCACTGG CTGAAAACAC CATGAAGCGT ATTGCTAAAG GCATTCAGCG GTTTGTGATT
GATGCCAAAG AACCGTTTAT TGTTCCGCAA AACGTCACAT TAGCGCCATT CATCACCGAG
CACGCTAACG CCAGCAATCA ACGCAACATG GCCGTTAATG AACCCTTGCG CACTATCTGC
GCAGCAGTCA AAGGCGGTCA CTTTGCAGTG GTGCAACCTG TGATCGAGAA AGTTAACGAA
ACCATCCCAA CCTTCACAGA ATGGTTTGCC AAAACTAAGA ATTTTGGTGG TAGCTATGAG
GAATACGTAA AGCTTTACGG AGATGATGAA TTACTTGAGC CACGCACCGC CGCCAACATC
TGCAAGCATT ACGGCGGTAA CTATACGGGA CCAGGTGATG ACCTAAACAA CCCTTTGCCG
ACAGTGACAA CGGTTGATCA CAACGCGCTG ATCACCAGCC ACATGATTAA ATTACGTGGC
ACTAACCTCG GCTTTCCGAT GGACGAACCA GCACACACGA TCACCGCTGG CGGCTTACAC
CTCGGCGAAG TGCGCGCGTT CTTCATCAAA TACTACGGCA ACGAGCAAGA CGGCGTGGCA
TGTAACGAAC CATTGCACAC CATCACCACC AATGACCGCT TTGGCCTAGT GATGATCAAG
GGTGAACCCT ATCAAATAAT TGATATCGGT ATGCGCATGC TCGAACCCCA TGAGCTATTC
GCCTGCCAAG GTTTCACCCC TGACTACATC ATCAACAACT ACAACGGCAA ATCGACCAAA
AAGCAGCAAG TCGCCCGTGT TGGTAACAGC GTACCGCCGC AATTTGCCGA AGCACTCACC
CGCGCAAATC TCCCCGAGCT TTGCACTCAA ACCGCAGAAG CGGCATAG
 
Protein sequence
MRGLIVDNFA GGGGASTGMG WAIGRSIDIA INHDPDAIAM HSVNHPNTLH YCESVFDIDP 
VQATAGKPVD LAWFSPDCKH FSKAKGSKPV NKEIRGLAWV TVRWAMKVRP RVMMLENVEE
FKTWGPLIGI DTKDQRPDPD RKGETFNAFV SMLSSGIDAD HPALAECVET LGLLDTAKLI
NGLGYKVEWR ELRACDYGAP TIRKRLFMIA RCDGQPIVWP EPTHGAPDSE VVKSGKLLPW
RTAAECIDWS LPCKSIFGRK KPLAENTMKR IAKGIQRFVI DAKEPFIVPQ NVTLAPFITE
HANASNQRNM AVNEPLRTIC AAVKGGHFAV VQPVIEKVNE TIPTFTEWFA KTKNFGGSYE
EYVKLYGDDE LLEPRTAANI CKHYGGNYTG PGDDLNNPLP TVTTVDHNAL ITSHMIKLRG
TNLGFPMDEP AHTITAGGLH LGEVRAFFIK YYGNEQDGVA CNEPLHTITT NDRFGLVMIK
GEPYQIIDIG MRMLEPHELF ACQGFTPDYI INNYNGKSTK KQQVARVGNS VPPQFAEALT
RANLPELCTQ TAEAA