Gene AnaeK_1032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_1032 
Symbol 
ID6784851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp1163238 
End bp1164779 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content81% 
IMG OID642762484 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002133396 
Protein GI197121445 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCCGC GCATGCCCCG CCCCGCCCCG CCGCCCGCGA TCCCCGGCCT CGACGCCGGC 
GCCTGCTGGC GCGCCCACGT CGCCCGCGAC GCGCGGTTCG ACGGCCGCTT CTTCACCGCG
GTGCTCTCCA CCGGGATCTT CTGCCGCCCG ATCTGCCGGG CGCGCACGCC GCGCCGGGAG
CACTGCGCGT TCTACCCGAG CGCCGCCGCC GCCCAGGCGG CCGGGTTCCG GCCCTGCCTG
CGCTGCCGGC CGGAGCTCGC GCCCGGCGTG GCCGGCTGGC GGGGCACCGC GAACACGGTC
GCGCGCGCGC TCGCGCTGAT CTCGGCGGGC GCCTGGGGCG AGCGCGACGA CGTGGAGGCG
CTCGCCGAGC GCGTCGGCGT CGGCGGCCGG CAGCTCCGCC GCCTGTTCGC CCGCCACGTG
GGCGCGCCGC CGGTCCGGAT CGCGCAGGCG CAGCGCGTGC TGCTGGCGCG CCGGCTGCTC
GCCGACACCA CCCTGCCGCT CGCCGACGTG GCCTCGGCGG CGGGCTTCGG GAGCGTGCGC
CGCTTCAACG AGGCGGTGCG GCGCACGTTC CGGCGCCCGC CGGGCGCGCT GCGGCGCGGC
GCGTCCGCCC CGCCGCCGGA CGGCGCGATC GCGATCGCGC TGCCGCACAC CGCGCCGTAC
GACTGGCCGG CGCTGCTCGG GTTCCTGGGC GCGCGGGCGA TCCCCGGCGT CGAGCAGGTG
TCGGACGGCG CGTACCGCCG CACCGTGGCG CTCGACGGCG CCGCGGGCAC GGTCGAGGTC
CGGCCCCATC CGCGGGGCCG CGGCCTGGTC GCGACGCTGC GGCTGCCGCG GGTGGCGGCG
ATCGCGCCCG CGGTGGAGCG CCTGCGCCGG CTGCTCGATC TCGACGCGGA CGCCCGGGCG
ATCGGCGCGC ACCTCTCGGG CGATCCGCTG CTCGCGCCGC TCGTCGCGGC GCGGCCCGGG
CTGCGCGTGC CGGGCGCGTG GGAGCCGTTC GAGCTGGTGG TGCGCGCGGT GCTCGGGCAG
CAGGTGAGCG TCGCCGCGGC CCGGACGCTG GCGGGCCGGC TCGCGGCGCG GCTCGGCGCC
CCGGTGGACT CCGGCGACCC CGCGCTGTCG CGGCTGTTCC CCGGCCCGGA GGCGCTCGCC
GGCGCCGACC TGGAGGGGCT CGGGCTGACC CGCGCCCGCG CCGCCACGCT CGCCGCGATC
GGCGGCGCGG TGCGGGACGA CCCGTCCCTG CTCGCGCCGG GCGGCGAGCT GGAGGACACC
GTGGCGCGCC TCGACGCGCT GCCCGGCATC GGCCGCTGGA CCGCGCAGTA CGTGGCGATG
CGGGCGCTGC ACCAGCCGGA CGCGTTCCCG GAGGGCGACC TCGGCCTGCT CGCCGCGCTC
GGCGACCTGC GCGGCCGCGG GCGGGCGCCG CCGGGGGAGC TGCTGCGACG GGCCGAGCGC
TGGCGCCCAT GGCGGGCGTA CGCGGCGCTG CACCTGTGGA CGAGCCTGCG GCCCCGCGCA
CGCGCGGCGC CGGGCGCACG GAAGAAGGGG AGGCGGTCAT GA
 
Protein sequence
MMPRMPRPAP PPAIPGLDAG ACWRAHVARD ARFDGRFFTA VLSTGIFCRP ICRARTPRRE 
HCAFYPSAAA AQAAGFRPCL RCRPELAPGV AGWRGTANTV ARALALISAG AWGERDDVEA
LAERVGVGGR QLRRLFARHV GAPPVRIAQA QRVLLARRLL ADTTLPLADV ASAAGFGSVR
RFNEAVRRTF RRPPGALRRG ASAPPPDGAI AIALPHTAPY DWPALLGFLG ARAIPGVEQV
SDGAYRRTVA LDGAAGTVEV RPHPRGRGLV ATLRLPRVAA IAPAVERLRR LLDLDADARA
IGAHLSGDPL LAPLVAARPG LRVPGAWEPF ELVVRAVLGQ QVSVAAARTL AGRLAARLGA
PVDSGDPALS RLFPGPEALA GADLEGLGLT RARAATLAAI GGAVRDDPSL LAPGGELEDT
VARLDALPGI GRWTAQYVAM RALHQPDAFP EGDLGLLAAL GDLRGRGRAP PGELLRRAER
WRPWRAYAAL HLWTSLRPRA RAAPGARKKG RRS