Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_1180 |
Symbol | |
ID | 4251464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 1383336 |
End bp | 1384454 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638117765 |
Product | A/G-specific DNA-adenine glycosylase |
Protein accession | YP_733317 |
Protein GI | 113969524 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000236828 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCTA CAGCCACCTT TGCTACACGT ATCGTTAATT GGTACGACAA CCACGGTCGT AAAACCCTCC CTTGGCAGCA AGATAAAACC CCATATCGCG TATGGGTTTC AGAGATTATG CTGCAACAAA CTCAGGTAGC GACTGTTATC CCCTATTACC AGCGTTTTAT GGCACGTTTC CCCGATGTGT TAACCCTCGC TAACGCGCCG GATGATGAAG TACTGCATCA TTGGACTGGG CTTGGTTATT ACGCCAGGGC TCGCAATCTA CATAAAGCCG CTAAGATGGT TCGCGATTTG TATCAAGGGC AATTTCCAAC AGACTTTGAG CAAGTGTTAG CGCTGCCTGG TATTGGCCGC TCGACAGCAG GTGCAGTGTT ATCCCTATCA CTTGGGCAAC ATCATCCGAT CCTCGATGGC AATGTTAAGC GCGTATTAGC AAGACATGGC GCGATTGCGG GCTGGCCCGG TCAAAAACCT GTGGAAGAAC AACTCTGGCA ATTAACCGAG CAGTTAACGC CAGGGCAGGA TATTCAAAAA TATAACCAAG CCATGATGGA TATTGGTGCC AGTATTTGTA CTCGCAGCAA ACCCAATTGT GCCGCTTGCC CTGTGGCTAT TGATTGCAAA GCTCAATTAA TGGGAAGACA AACTGAGTTC CCCGGTAAAA AGCCTAAGAA AACTATCCCA GAGAAAGCCG CTTGGATGTT GGTTTTACTC AAAGATAACC AAGTCTTCTT GGCTAAGCGC CCTCCTGCCG GCATTTGGGG CGGACTCTGG TGCTTCCCCG AATTTAGTAC TCAAGCCGCA CTCAATGCAG AGCTTGAAAC CCAAGGTTAT CACGCCGCAC AACTCGAACC ATTAATCGGT TTTAGGCATA CCTTTAGCCA TTTCCATTTA GATATTCAAC CCATGCTACT GAATTTGGAT AGCCAAGCGA ATGGCTACGA CAAGCAAACC TCGGCTATGC AGAGCGTGGG CGCAGTCATG GAACAAAACC AGTCTCTCTG GTATAACATC AATCAACCTT CCAAAGTGGG ACTCGCCGCC GCAACAGAGC GCGTGTTGGC CAACTTGGGA TCACTCGTAG CTATTGCTAG CAACCTCGAC AGTCAGTAA
|
Protein sequence | MKSTATFATR IVNWYDNHGR KTLPWQQDKT PYRVWVSEIM LQQTQVATVI PYYQRFMARF PDVLTLANAP DDEVLHHWTG LGYYARARNL HKAAKMVRDL YQGQFPTDFE QVLALPGIGR STAGAVLSLS LGQHHPILDG NVKRVLARHG AIAGWPGQKP VEEQLWQLTE QLTPGQDIQK YNQAMMDIGA SICTRSKPNC AACPVAIDCK AQLMGRQTEF PGKKPKKTIP EKAAWMLVLL KDNQVFLAKR PPAGIWGGLW CFPEFSTQAA LNAELETQGY HAAQLEPLIG FRHTFSHFHL DIQPMLLNLD SQANGYDKQT SAMQSVGAVM EQNQSLWYNI NQPSKVGLAA ATERVLANLG SLVAIASNLD SQ
|
| |