Gene Sbal_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal_3039 
Symbol 
ID4845447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS155 
KingdomBacteria 
Replicon accessionNC_009052 
Strand
Start bp3563227 
End bp3564318 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content50% 
IMG OID640120287 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_001051390 
Protein GI126175241 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000495773 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCTA CAGCCTCCTT CGCTACACGT ATCGTCTCTT GGTACGACAA TCACGGTCGT 
AAAACCCTCC CTTGGCAGCA AGATAAAACC CCATATAGCG TATGGGTTTC TGAAATCATG
CTGCAACAAA CTCAGGTTGC GACTGTTATT CCCTATTACC TTAAATTTAT GGCGCGTTTC
CCCGATGTGT TAGCACTTGC TAACGCGCCA GATGATGAGG TGTTGCATCA TTGGACTGGC
CTTGGGTATT ACGCTAGAGC GCGTAATCTA CATAAAGCAG CCAAGATGAT CCGCGACGAT
TATCAGGGAT TATTTCCAAC GGATTTTGAG CAAGTACTTG CGCTGCCTGG CATTGGCCGC
TCAACGGCAG GCGCAGTATT GTCACTGTCT CTTGGCCAGC ATCACCCGAT CCTCGACGGT
AACGTCAAAC GCGTGTTAGC AAGACACGGC GCCATAGCAG GTTGGCCGGG GCAAAAAACG
GTCGAAGCGC AGCTTTGGCA GCTAACTGAC ACGTATACGC CGCAGCAAGA TATTCAGAAA
TATAATCAAG CCATGATGGA TATCGGCGCC AGTATTTGTA CTCGTAGCAA ACCTAACTGC
GCCGCTTGCC CTGTGGCGAT TGATTGCAAA GCTCAGCTGA TTGGCAGACA AACCGATTTC
CCTGGCAAAA AGCCTAAAAA AACCATACCG ACCAAAGCGG CGTGGATGTT AGTGCTAATG
CAAGACAACC AAGTGTTTTT AGCTAAACGT CCGCCAGCGG GAATTTGGGG CGGACTTTGG
TGTTTCCCTG AGTTTGCCAC CCACGCCGCA CTTGAAACCC ACCTCGAAGA GCAAGGGTTT
GCAGGGCAAC AACTCGAATG GCTAACTGGC TTTAGGCACA CGTTTAGCCA CTTCCATTTA
GATATTCAGC CCATGATGCT TAATTTAGAT AACACCCACG GCAATAAAGA GAGCGTGGGC
GCTGTCATGG AACAAAATCA GTCTCTCTGG TATAACATAA GTCATCCTTC CAAAGTGGGA
CTCGCCGCCG CCACCGAGCG CGTGCTAGCC AATTTGGGAT CACTAGTTCA ATCCGCAGTC
AGTAAGGAAT AA
 
Protein sequence
MKSTASFATR IVSWYDNHGR KTLPWQQDKT PYSVWVSEIM LQQTQVATVI PYYLKFMARF 
PDVLALANAP DDEVLHHWTG LGYYARARNL HKAAKMIRDD YQGLFPTDFE QVLALPGIGR
STAGAVLSLS LGQHHPILDG NVKRVLARHG AIAGWPGQKT VEAQLWQLTD TYTPQQDIQK
YNQAMMDIGA SICTRSKPNC AACPVAIDCK AQLIGRQTDF PGKKPKKTIP TKAAWMLVLM
QDNQVFLAKR PPAGIWGGLW CFPEFATHAA LETHLEEQGF AGQQLEWLTG FRHTFSHFHL
DIQPMMLNLD NTHGNKESVG AVMEQNQSLW YNISHPSKVG LAAATERVLA NLGSLVQSAV
SKE