Gene Shewana3_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_1420 
Symbol 
ID4479637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp1649381 
End bp1651009 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content51% 
IMG OID639725990 
Producttranscriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II 
Protein accessionYP_869060 
Protein GI117919868 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.484928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGA CAAGCGTGAG CCAAAATTCA GCCCCACAAC CGCCATCCGA GCCGACATCA 
AGCGACGCGC AGGATTGCCA AATATCCGCC GGGATTTGCC GTGAAGCGCG CATGAGCCGC
GATCCACGTT TTGACGGTAA GTTCTTTGTG GGCGTGTTAA GTACGGGGAT TTATTGCCGA
ACAGTGTGCC CAGCAGTAGC GCCGAAGGAA GAAAATGTGC GTTATTTTGA CTCGGCCATT
AAAGCTGCAC AGGCGGGACT CAGGCCTTGT CTACGTTGCC GCCCCGACAG TGCTCCTGGC
TCTAATCCGT GGAAAGGCAC TAACACAACC CTAGACAGAG CCATAGGTTT GATTGAAGCT
GGCGCCTTAT CTGGTGAGCA CGGATTAACG GTCGAGGCCT TGGCGGACAA ATTGGGGATC
AGCAGCCGTT ATTTAAATAA ATTATTTACC GCAGGATTTG GCACTTCACC TAAACAATTT
GCCCTCTATC GTCAGTTATT GTTCGCAAAG CAGTTGTTGC ATCAGACGCA GTTGCCCATC
ACTCAGGTCG CCCTTGCCGC AGGCTTTAAC AGTATTCGCC GTTTTAACGA GGCATTTCAG
CAGGCGCTGC AATTAACGCC CACCCAGTTG CGAAAATCCA GTCAGAAATC GACTATTAAA
ATAGACGACG GGGAGAGTAC TGAACTCGCT TGTTCACAGG AGATGCCAGC GCACAGCTTG
AGTCTGTATC AGTATTATCG ACCGCCGCTC GATTGGTCGG CGCAATTAGC CTTTTATCGC
CTGCGCGCCG TCGAAGGCAT GGAATGGTTT AGTGAGAGTG TAGCCCCTCA CTTCCATGAG
GCGATTGACC CAAATATACC ACTTGAGTAT GGCCGCACCC TGCAAATTGA GGATATTCGC
GCAGTGGTAC ATATCGTCCA TGAGGCGCCT TTGCATCGCT TTAAAATCAC GCTGACGTTA
ACGCCTGACT CGCCCTTATC GGGTCTACAA AAGCTCATCA CCCAAGTGCG GCGCATTTTA
GATCTCGATG CAGATATGCA GAAGATTGAA CACAACCTTC AGCATTTGAC CGATCTTAAA
CTCAGTAGCA AATCCGGGCT TAGGATCCCC GGCGCAGGCG CGGTGTTTGA GGCGGGTTGC
CGAGCCGTCT TAGGTCAGCA GGTGAGCGTA GTGCAGGCGA CTAAATTACT GAATATATTG
GTCGAAGCCT ATGGCGAGTG CTTTAGTTTA AACGGACGGG AATATCGCCT GTTTCCTACG
CCGCAGGCGA TTCGTGATGC GAGTCTCGAT GAGCTTAAAA TGCCCGGTGC CCGCAAACTG
GCGCTCAATG CGCTTGCAGC CTTTATCTGT GAGCAGCCCG AAGCCTCAGT CGATGAGTGG
ATTGGGGTAA AAGGCATAGG CCCTTGGACC ATTGCCTACG CCAAATTACG TGGACTTGGC
GATCCGAATA TCTTCTTGCA CACGGATTTA ATTGTTAAAA AACAATTGTT AGCCTGTGTT
GCCGAGCAAA GTGGCTTGGA TATTGAAACG GTAAAACAGC TGGATTATAC GGCGCTTGCA
CAGCGGGTGA GTGCGGATAT CGCCCCTTGG GGAAGTTATT TGACCTTCCA GCTGTGGTCC
AATGCATAG
 
Protein sequence
MKQTSVSQNS APQPPSEPTS SDAQDCQISA GICREARMSR DPRFDGKFFV GVLSTGIYCR 
TVCPAVAPKE ENVRYFDSAI KAAQAGLRPC LRCRPDSAPG SNPWKGTNTT LDRAIGLIEA
GALSGEHGLT VEALADKLGI SSRYLNKLFT AGFGTSPKQF ALYRQLLFAK QLLHQTQLPI
TQVALAAGFN SIRRFNEAFQ QALQLTPTQL RKSSQKSTIK IDDGESTELA CSQEMPAHSL
SLYQYYRPPL DWSAQLAFYR LRAVEGMEWF SESVAPHFHE AIDPNIPLEY GRTLQIEDIR
AVVHIVHEAP LHRFKITLTL TPDSPLSGLQ KLITQVRRIL DLDADMQKIE HNLQHLTDLK
LSSKSGLRIP GAGAVFEAGC RAVLGQQVSV VQATKLLNIL VEAYGECFSL NGREYRLFPT
PQAIRDASLD ELKMPGARKL ALNALAAFIC EQPEASVDEW IGVKGIGPWT IAYAKLRGLG
DPNIFLHTDL IVKKQLLACV AEQSGLDIET VKQLDYTALA QRVSADIAPW GSYLTFQLWS
NA