Gene A2cp1_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA2cp1_1029 
Symbol 
ID7297075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter dehalogenans 2CP-1 
KingdomBacteria 
Replicon accessionNC_011891 
Strand
Start bp1151142 
End bp1152683 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content81% 
IMG OID643593822 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002491446 
Protein GI220916142 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCCGC GCATGCCCCG CCCCGCCCCG CCGCCCGCGA TCCCCGGCCT CGACGCCGGC 
GCCTGCTGGC GCGCCCACGT CGCCCGCGAC GCGCGGTTCG ACGGCCGCTT CTTCACCGCG
GTGCTCTCCA CCGGGATCTT CTGCCGCCCG ATCTGCCGGG CGCGCACGCC GCGCCGGGAG
CACTGCGCGT TCTACCCGAG CGCCGCCGCC GCGCAGGCGG CCGGTTTCCG GCCGTGCCTG
CGCTGCCGGC CGGAGCTCGC GCCCGGCGTG GCCGGCTGGC GGGGCACCGC GAACACGGTG
GCGCGCGCGC TCGCGCTCAT CTCGGCGGGC GCCTGGGGCG AGCGCGACGA CGTGGAGGCG
CTCGCCGAGC GCGTCGGCGT CGGCGGCCGG CAGCTCCGCC GCCTGTTCGC CCGGCACGTG
GGCGCGCCGC CGGTCCGGAT CGCGCAGGCG CAGCGCGTGC TGCTGGCCCG GCGGCTGCTC
GCCGACACCA CCCTGCCGCT CGCCGACGTG GCCTCCGCGG CGGGCTTCGG GAGCGTGCGT
CGCTTCAACG AGGCGGTGCG GCGCACGTTC CGGCGTCCGC CGGGCGCGCT GCGGCGCGGC
GCGTCCGCCC CGCCGCCGGA CGGCGCGATC GCGATCGCGC TGCCGCACAC CGCGCCGTAC
GACTGGCCGG CGCTGCTCGG GTTCCTGGGC GCGCGCGCGA TCCCGGGCGT CGAGCAGGTG
TCGGACGGCG CGTACCGCCG CACCGTGGCG CTCGACGGCG CCGCGGGCAC GGTCGAGGTC
CGGCCCGATC CGCGGGGCCG CGGCCTGGTC GCGACGTTGC GGCTGCCGGG GGTGGCGGCG
ATCGCGCCCG CGGTGGAGCG CCTGCGGCGG CTGCTCGACC TCGACGCGGA CGCCCGGGCG
ATCGGCGCGC ACCTCTCGGG CGATCCGCTG CTCGCGCCGC TCGTCGCGGC GCGGCCCGGG
CTGCGCGTGC CGGGCGCGTG GGAGCCGTTC GAGCTGGTGG TGCGCGCGGT GCTCGGGCAG
CAGGTGAGCG TCGCCGCGGC CCGCACGCTG GCGGGCCGCC TCGCGGCGCG GCTCGGCGCC
CCGGTGGACT CCGGCGACCC CGCGCTGTCG CGGCTGTTCC CCGGCCCGGA GGCGCTCGCC
GGCGCCGACC TGGAGGGGCT CGGGCTGACC CGCGCCCGCG CCGCCACGCT CGCCGCGATC
GGCGCCGCGG TGCGAGACGA CCCGTCCCTG CTCGCGCCGG GCGGCGAGCT GGAGGACGCG
GTGGCGCGCC TCGACGCGCT GCCCGGCATC GGGCGCTGGA CCGCGCAGTA CGTGGCGATG
CGGGCGCTGC ACCAGCCGGA CGCGTTCCCG GAGGGCGACC TCGGCCTGCT CGCCGCGCTC
GGCGGCCTGC GCGGCCGCGG GCGGGCGGCG CCCGGGGAGC TGCTGCGACG CGCCGAGCGC
TGGCGCCCGT GGCGGGCGTA CGCGGCGCTG CACCTGTGGA CGAGCCTGCG GCCCCGCGCA
CGCGCGGCGC CCGGCGCACG GAAGAAGGGG AGGCGGTCAT GA
 
Protein sequence
MMPRMPRPAP PPAIPGLDAG ACWRAHVARD ARFDGRFFTA VLSTGIFCRP ICRARTPRRE 
HCAFYPSAAA AQAAGFRPCL RCRPELAPGV AGWRGTANTV ARALALISAG AWGERDDVEA
LAERVGVGGR QLRRLFARHV GAPPVRIAQA QRVLLARRLL ADTTLPLADV ASAAGFGSVR
RFNEAVRRTF RRPPGALRRG ASAPPPDGAI AIALPHTAPY DWPALLGFLG ARAIPGVEQV
SDGAYRRTVA LDGAAGTVEV RPDPRGRGLV ATLRLPGVAA IAPAVERLRR LLDLDADARA
IGAHLSGDPL LAPLVAARPG LRVPGAWEPF ELVVRAVLGQ QVSVAAARTL AGRLAARLGA
PVDSGDPALS RLFPGPEALA GADLEGLGLT RARAATLAAI GAAVRDDPSL LAPGGELEDA
VARLDALPGI GRWTAQYVAM RALHQPDAFP EGDLGLLAAL GGLRGRGRAA PGELLRRAER
WRPWRAYAAL HLWTSLRPRA RAAPGARKKG RRS