Gene Shewmr4_1367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1367 
Symbol 
ID4251386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1594653 
End bp1596281 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content51% 
IMG OID638117966 
Producttranscriptional regulator Ada / DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase 
Protein accessionYP_733502 
Protein GI113969709 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.987326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0122052 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGA TAACCGTGAG CCTAAAGCCA AGTTTCAAAC CGCCATTAGT GCAAACATCA 
AGTGACTTGC AGGATTGCCA AATATCCGCC GGGATTTGCC ATGAAGCGCG CATGAGTCGC
GATCCGCGTT TTGATGGTAA GTTCTTTGTG GGCGTGTTAA GTACGGGGAT TTATTGCCGA
ACAGTATGCC CAGCAGTGGC GCCGAAGGAA GAAAATGTTC GTTATTTTGA CTCGGCCATT
AAAGCTGCGC AGGCGGGACT TAGACCTTGC CTGCGTTGCC GCCCCGATAG CGCCCCTGGT
TCTAATCCTT GGAAAGGTAC TAACACTACC TTAGACAGAG CGATAGGCTT GATTGAAGCT
GGCACCTTAT CTGGCGAACA AGCATTAACG GTTGAGGCCT TGGCAGATAA ACTGGGGATC
AGCAGCCGTT ATTTAAATAA ATTATTTACC GCAGGATTTG GCACATCACC AAAGCAATTT
GCCCTCTATC GTCAGTTATT GTTCGCAAAA CAGTTATTGC ATCAAACGCA GTTGCCCATC
ACTCAGGTCG CCCTTGCCGC AGGTTTTAAC AGTATTCGTC GTTTCAACGA GGCGTTTCAG
CAGGCATTGC AATTAACGCC TACGCAGTTA CGAAAATCGA GCCAGAAGTC GACGACGAAA
ATAGACGACG GGGAGAGTGT CGAACTCGTT TGTTCGCAGG AGATGCCAGC GCATAGCTTG
AGTCTGTATC AGTACTACCG GCCGCCACTC GATTGGTCGG CGCAATTAGC CTTTTATCGC
CTGCGCGCCG TCGCTGGCAT GGAATGGTTT AGTGAAACAG AAGACTCTCA CGCCCATGGG
GCGATTGACC AAGATACGCC ACTTGAGTAT GGCCGCACTC TGCAACTTGA GGATATCCGC
GCTGTGGTGC ATATCGTCCA TGAGGCGTCT TTGCATCGCT TTAAAATCAC GCTGACGTTA
ACGCCCGACT CGCCCTTATC GGGTCTACAA AAGCTCATCA CCCAAGTGCG GCGCATTTTA
GATCTCGATG CCGACATGCA GCAGATTGAA CACAATTTAG AGCACTTGAC CGATCTTAAA
CTCAGTAGCA AATCCGGGCT TAGGATCCCG GGCGCTGGCA CTGTGTTTGA GGCGGGTTGC
CGAGCGGTCT TAGGCCAGCA GGTGAGCGTA GTGCAGGCCA CTAAATTACT GAATATATTG
GTCGAAGCCT ATGGCGAGCG CTTTAGTTTA AACGGGCGGG AATATCGACT ATTCCCCACG
CCGCAGGCGA TTCGTGAGGC GAGTCTCGAT GAGCTTAAAA TGCCCGGTGC GCGCAAATTG
GCGCTCAATG CGCTTGCGGC GTTTATCTGT GAGCAGCCCG AAGCCTCAGT CGATGAGTGG
ATTGGGGTAA AAGGCATCGG CCCTTGGACC ATTGCCTATG CCAAACTGCG AGGACTTGGC
GATCCGAATA TCTTCTTGCA CACGGATTTA ATTGTTAAAA AACAATTGTT AGCCAGTGTT
GCCGCGCAAA AAGGCGTGGA TATTGAAGCG GTAAAGCAGC TGGATTATAC AACGCTTGCA
CAGCGGGTGA GCGCGGATAT CGCCCCTTGG GGGAGTTATT TAACTTTCCA GCTGTGGTCC
AATGCATAA
 
Protein sequence
MKQITVSLKP SFKPPLVQTS SDLQDCQISA GICHEARMSR DPRFDGKFFV GVLSTGIYCR 
TVCPAVAPKE ENVRYFDSAI KAAQAGLRPC LRCRPDSAPG SNPWKGTNTT LDRAIGLIEA
GTLSGEQALT VEALADKLGI SSRYLNKLFT AGFGTSPKQF ALYRQLLFAK QLLHQTQLPI
TQVALAAGFN SIRRFNEAFQ QALQLTPTQL RKSSQKSTTK IDDGESVELV CSQEMPAHSL
SLYQYYRPPL DWSAQLAFYR LRAVAGMEWF SETEDSHAHG AIDQDTPLEY GRTLQLEDIR
AVVHIVHEAS LHRFKITLTL TPDSPLSGLQ KLITQVRRIL DLDADMQQIE HNLEHLTDLK
LSSKSGLRIP GAGTVFEAGC RAVLGQQVSV VQATKLLNIL VEAYGERFSL NGREYRLFPT
PQAIREASLD ELKMPGARKL ALNALAAFIC EQPEASVDEW IGVKGIGPWT IAYAKLRGLG
DPNIFLHTDL IVKKQLLASV AAQKGVDIEA VKQLDYTTLA QRVSADIAPW GSYLTFQLWS
NA