Gene Shewmr4_1180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1180 
Symbol 
ID4251464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1383336 
End bp1384454 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content49% 
IMG OID638117765 
ProductA/G-specific DNA-adenine glycosylase 
Protein accessionYP_733317 
Protein GI113969524 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000236828 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCTA CAGCCACCTT TGCTACACGT ATCGTTAATT GGTACGACAA CCACGGTCGT 
AAAACCCTCC CTTGGCAGCA AGATAAAACC CCATATCGCG TATGGGTTTC AGAGATTATG
CTGCAACAAA CTCAGGTAGC GACTGTTATC CCCTATTACC AGCGTTTTAT GGCACGTTTC
CCCGATGTGT TAACCCTCGC TAACGCGCCG GATGATGAAG TACTGCATCA TTGGACTGGG
CTTGGTTATT ACGCCAGGGC TCGCAATCTA CATAAAGCCG CTAAGATGGT TCGCGATTTG
TATCAAGGGC AATTTCCAAC AGACTTTGAG CAAGTGTTAG CGCTGCCTGG TATTGGCCGC
TCGACAGCAG GTGCAGTGTT ATCCCTATCA CTTGGGCAAC ATCATCCGAT CCTCGATGGC
AATGTTAAGC GCGTATTAGC AAGACATGGC GCGATTGCGG GCTGGCCCGG TCAAAAACCT
GTGGAAGAAC AACTCTGGCA ATTAACCGAG CAGTTAACGC CAGGGCAGGA TATTCAAAAA
TATAACCAAG CCATGATGGA TATTGGTGCC AGTATTTGTA CTCGCAGCAA ACCCAATTGT
GCCGCTTGCC CTGTGGCTAT TGATTGCAAA GCTCAATTAA TGGGAAGACA AACTGAGTTC
CCCGGTAAAA AGCCTAAGAA AACTATCCCA GAGAAAGCCG CTTGGATGTT GGTTTTACTC
AAAGATAACC AAGTCTTCTT GGCTAAGCGC CCTCCTGCCG GCATTTGGGG CGGACTCTGG
TGCTTCCCCG AATTTAGTAC TCAAGCCGCA CTCAATGCAG AGCTTGAAAC CCAAGGTTAT
CACGCCGCAC AACTCGAACC ATTAATCGGT TTTAGGCATA CCTTTAGCCA TTTCCATTTA
GATATTCAAC CCATGCTACT GAATTTGGAT AGCCAAGCGA ATGGCTACGA CAAGCAAACC
TCGGCTATGC AGAGCGTGGG CGCAGTCATG GAACAAAACC AGTCTCTCTG GTATAACATC
AATCAACCTT CCAAAGTGGG ACTCGCCGCC GCAACAGAGC GCGTGTTGGC CAACTTGGGA
TCACTCGTAG CTATTGCTAG CAACCTCGAC AGTCAGTAA
 
Protein sequence
MKSTATFATR IVNWYDNHGR KTLPWQQDKT PYRVWVSEIM LQQTQVATVI PYYQRFMARF 
PDVLTLANAP DDEVLHHWTG LGYYARARNL HKAAKMVRDL YQGQFPTDFE QVLALPGIGR
STAGAVLSLS LGQHHPILDG NVKRVLARHG AIAGWPGQKP VEEQLWQLTE QLTPGQDIQK
YNQAMMDIGA SICTRSKPNC AACPVAIDCK AQLMGRQTEF PGKKPKKTIP EKAAWMLVLL
KDNQVFLAKR PPAGIWGGLW CFPEFSTQAA LNAELETQGY HAAQLEPLIG FRHTFSHFHL
DIQPMLLNLD SQANGYDKQT SAMQSVGAVM EQNQSLWYNI NQPSKVGLAA ATERVLANLG
SLVAIASNLD SQ