Gene Tgr7_2945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTgr7_2945 
Symbol 
ID7317051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. HL-EbGR7 
KingdomBacteria 
Replicon accessionNC_011901 
Strand
Start bp3087767 
End bp3088813 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content70% 
IMG OID643617845 
ProductA/G-specific DNA-adenine glycosylase 
Protein accessionYP_002515004 
Protein GI220936105 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCAT CCGATTTCAG CCAACGCCTG CTCGCCTGGT TCGATCGCCA TGGCCGCCAC 
GACCTGCCCT GGCAACAGGA CATCAACCCC TACCGGGTCT GGGTCTCGGA GATCATGCTG
CAGCAGACCC AGGTGGGCAC CGTGATCCCC TATTACCAGC GCTTCATGGC GCGCTTCCCG
GACGTGGCGA GCCTCGCCGA CGCGCCCCTG GACCAGGTGC TGCATCACTG GTCCGGGCTC
GGCTACTACG CCCGGGCCCG CAACCTGCAC AAGGCCGCCC AGGTGGTCCG CGATCAGCAC
GGCGGGCGTT TCCCCGAAGA CATCGAGGCC CTACAGTCCC TGCCGGGCAT CGGCCGCTCC
ACTGCCGGGG CCATCCTTGC GCTCGCCTGC GGGCAGCGCC AGCCCATCCT GGACGGCAAC
GTCAAGCGGG TACTGGCCCG GCACCGGGCC GTGGAGGGCT GGAGCGGCGA GACGGTGGTG
CTGCGCGATC TGTGGTGCCT GGCCGAGGCC CACACCCCCG CTGAACGGGT GGCCGAGTAC
ACCCAGGCCA TCATGGACCT GGGCGCCACG GTCTGTACCC GCAGCCGCCC CGCCTGCGGC
CGCTGTCCTG TCGCGGAAGA CTGCCGTGCG CGTCTCGAGG GCCGCACCGG CGAGCTGCCC
GCGCCGCGCC CGAAGCGTGT CCAGCCCCTG CGCGAGACCT GCATGCTCAT GGTCACCACG
CCGGAAGGGG TGCTGCTGGA ACAGCGCCCG GCGCGGGGGC TGTGGGGTGG ACTCTGGGGC
TTCCCCGAGG TGGATGACGA GGCATCGGCC CTGGCCTGGT GCCGCGCGTC CCTGGGCCTG
GAGCCGCAAC GGCTGGAGGC CTGGAATCCC TTCATCCACA CCTTCACCCA CTTCCGCCTG
CGCATCACCC CGCTGCGGGT CTCGTTGCAA GACCCTGCCG GCTGTGTGAT GGAAGCGCCC
GGGCGGGTCT GGTATAACAC CCGGACCTCA TCAGGCCTCG GGCTCGCAGC CCCGGTGGCC
CAACTGCTTG AAAAACTGGA TCTCTAA
 
Protein sequence
MSASDFSQRL LAWFDRHGRH DLPWQQDINP YRVWVSEIML QQTQVGTVIP YYQRFMARFP 
DVASLADAPL DQVLHHWSGL GYYARARNLH KAAQVVRDQH GGRFPEDIEA LQSLPGIGRS
TAGAILALAC GQRQPILDGN VKRVLARHRA VEGWSGETVV LRDLWCLAEA HTPAERVAEY
TQAIMDLGAT VCTRSRPACG RCPVAEDCRA RLEGRTGELP APRPKRVQPL RETCMLMVTT
PEGVLLEQRP ARGLWGGLWG FPEVDDEASA LAWCRASLGL EPQRLEAWNP FIHTFTHFRL
RITPLRVSLQ DPAGCVMEAP GRVWYNTRTS SGLGLAAPVA QLLEKLDL