Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_2945 |
Symbol | |
ID | 7317051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 3087767 |
End bp | 3088813 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643617845 |
Product | A/G-specific DNA-adenine glycosylase |
Protein accession | YP_002515004 |
Protein GI | 220936105 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCAT CCGATTTCAG CCAACGCCTG CTCGCCTGGT TCGATCGCCA TGGCCGCCAC GACCTGCCCT GGCAACAGGA CATCAACCCC TACCGGGTCT GGGTCTCGGA GATCATGCTG CAGCAGACCC AGGTGGGCAC CGTGATCCCC TATTACCAGC GCTTCATGGC GCGCTTCCCG GACGTGGCGA GCCTCGCCGA CGCGCCCCTG GACCAGGTGC TGCATCACTG GTCCGGGCTC GGCTACTACG CCCGGGCCCG CAACCTGCAC AAGGCCGCCC AGGTGGTCCG CGATCAGCAC GGCGGGCGTT TCCCCGAAGA CATCGAGGCC CTACAGTCCC TGCCGGGCAT CGGCCGCTCC ACTGCCGGGG CCATCCTTGC GCTCGCCTGC GGGCAGCGCC AGCCCATCCT GGACGGCAAC GTCAAGCGGG TACTGGCCCG GCACCGGGCC GTGGAGGGCT GGAGCGGCGA GACGGTGGTG CTGCGCGATC TGTGGTGCCT GGCCGAGGCC CACACCCCCG CTGAACGGGT GGCCGAGTAC ACCCAGGCCA TCATGGACCT GGGCGCCACG GTCTGTACCC GCAGCCGCCC CGCCTGCGGC CGCTGTCCTG TCGCGGAAGA CTGCCGTGCG CGTCTCGAGG GCCGCACCGG CGAGCTGCCC GCGCCGCGCC CGAAGCGTGT CCAGCCCCTG CGCGAGACCT GCATGCTCAT GGTCACCACG CCGGAAGGGG TGCTGCTGGA ACAGCGCCCG GCGCGGGGGC TGTGGGGTGG ACTCTGGGGC TTCCCCGAGG TGGATGACGA GGCATCGGCC CTGGCCTGGT GCCGCGCGTC CCTGGGCCTG GAGCCGCAAC GGCTGGAGGC CTGGAATCCC TTCATCCACA CCTTCACCCA CTTCCGCCTG CGCATCACCC CGCTGCGGGT CTCGTTGCAA GACCCTGCCG GCTGTGTGAT GGAAGCGCCC GGGCGGGTCT GGTATAACAC CCGGACCTCA TCAGGCCTCG GGCTCGCAGC CCCGGTGGCC CAACTGCTTG AAAAACTGGA TCTCTAA
|
Protein sequence | MSASDFSQRL LAWFDRHGRH DLPWQQDINP YRVWVSEIML QQTQVGTVIP YYQRFMARFP DVASLADAPL DQVLHHWSGL GYYARARNLH KAAQVVRDQH GGRFPEDIEA LQSLPGIGRS TAGAILALAC GQRQPILDGN VKRVLARHRA VEGWSGETVV LRDLWCLAEA HTPAERVAEY TQAIMDLGAT VCTRSRPACG RCPVAEDCRA RLEGRTGELP APRPKRVQPL RETCMLMVTT PEGVLLEQRP ARGLWGGLWG FPEVDDEASA LAWCRASLGL EPQRLEAWNP FIHTFTHFRL RITPLRVSLQ DPAGCVMEAP GRVWYNTRTS SGLGLAAPVA QLLEKLDL
|
| |