Gene Sbal195_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_2003 
Symbol 
ID5753747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp2386259 
End bp2387809 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content53% 
IMG OID641288284 
ProductC-5 cytosine-specific DNA methylase 
Protein accessionYP_001554433 
Protein GI160875117 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00321659 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.797975 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGGCT TAATCGTCGA TAATTTTGCA GGCGGTGGCG GTGCAAGTAC GGGCATAGCG 
TGGGCGATTG GTCGCAGTGT GGATATTGCC ATTAACCATG ACCCTGACGC CATTGCGATG
CATTCAGCAA ATCACCCAGA GACCTTGCAC TACTGCGAGT CGGTATTTGA TATCGACCCC
GTGCAAGCTA CGGCAGGTAA GCCAGTTGAT TTAGCTTGGT TCTCACCCGA CTGCAAACAC
TTCTCAAAAG CCAAAGGCAG TAAGCCCGTG AGTAAAGAGA TCCGCGGCTT GGCTTGGGTG
ACGGTTCGCT GGGCGATGAT GGTTCGGCCG CGCGTGTTGA TGCTTGAGAA TGTTGAAGAA
TTTAAAACGT GGGGGCCTGT TATTGAATGC CCAGTCACGG AGGCGATGCG CCCCTGCCCT
GAGCGCAAAG GCGAAACCTT TAACGCCTTT GTCAGTATGT TGAGCACTGG CATAGATGCA
GAGCACCCCG CACTTGCTGA GTGCGTCGAA ACATTAGGCT TGCTCGATAC CGCTAAACTG
ATGAAAGGAC TGGGTTATAA GGTGGAGTGG CGCGAACTAC GTGCCTGTGA CTTCGGCGCC
CCGACTATTC GCAAACGGCT ATTTATGATC GCCCGTTGTG ATGGCCAGCC CATCATTTGG
CCTGAGCCGA CCCACGGCGC ACCCAATAGC GAAGCGGTTA AATCGGGCAA GCTGCAACCT
TGGCGAACGG CGGCTGAATG CATCGACTGG TCACTGCCTT GCCGATCCAT CTTTGGCCGT
AAAAAACCGT TGGCTGAAAA CACCATGAAG CGCATTGCTA AAGGGATTCA AAAGTTTGTG
TTTGATGCCA AGGAGCCGTT TATCGTTCCG CAAAATGTCA CGTTAGCCCC TTTTATTACT
GAACACGCGA ATGCCAGTAA TCAACGCAAT ATGCCCGTTG ATGAACCCTT GCGCACGATA
TGCGCCCAAG TAAAAGGCGG TCACTTTGCA GTGGTGCAAC CGGTACTAGC GGCGGCCAAT
ATCTGCAAGC ATTACGGCGG CAATTACTCG GGACCAGGTG ACGACCTAAA TAATCCCTTA
CCGACGGTGA CAACGGTTGA TCACAACGCG CTGATCACCA GCCACATGAT TAAATTGCGT
GGTACTAACC TCGGCTTTGC AATGGATGAA CCCGCACACA CGATCACCGC TGGCGGCTTA
CACCTCGGCG AAGTGCGCGC GTTCTTCATC AAATACTACG GCAACGAGCA AGACGGCGTG
GCATGTAACG AACCATTGCA CACTATCACC ACCAATGACC GCTTTGGCCT AGTGATGATC
AAGGGTGAGC CCTATCAAAT CATCGATATT GGTATGCGCA TGCTCGAACC CCATGAGCTG
TTCGCCTGCC AAGGTTTCAA TCCTGAATAC ATTATCAGTA ACTACAACGG CAAATCGACC
AAGAAACAGC AAGTCGCCCG TGTGGGTAAC AGCGTTCCGC CCCCATTTGC CGAAGCACTC
ACCCGCGCAA ATCTCCCAGA GCTTTGCATA CACACCGCCG AAGCGGCATA G
 
Protein sequence
MRGLIVDNFA GGGGASTGIA WAIGRSVDIA INHDPDAIAM HSANHPETLH YCESVFDIDP 
VQATAGKPVD LAWFSPDCKH FSKAKGSKPV SKEIRGLAWV TVRWAMMVRP RVLMLENVEE
FKTWGPVIEC PVTEAMRPCP ERKGETFNAF VSMLSTGIDA EHPALAECVE TLGLLDTAKL
MKGLGYKVEW RELRACDFGA PTIRKRLFMI ARCDGQPIIW PEPTHGAPNS EAVKSGKLQP
WRTAAECIDW SLPCRSIFGR KKPLAENTMK RIAKGIQKFV FDAKEPFIVP QNVTLAPFIT
EHANASNQRN MPVDEPLRTI CAQVKGGHFA VVQPVLAAAN ICKHYGGNYS GPGDDLNNPL
PTVTTVDHNA LITSHMIKLR GTNLGFAMDE PAHTITAGGL HLGEVRAFFI KYYGNEQDGV
ACNEPLHTIT TNDRFGLVMI KGEPYQIIDI GMRMLEPHEL FACQGFNPEY IISNYNGKST
KKQQVARVGN SVPPPFAEAL TRANLPELCI HTAEAA