Gene Cagg_0908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0908 
Symbol 
ID7267981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1137145 
End bp1138359 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content55% 
IMG OID643565756 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_002462262 
Protein GI219847829 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.26902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00137885 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGCTGA TCCTCTATCT CGGCAAAGGC GGAGTGGGTA AGACCACGAC TTCGGCGGCG 
ACAGCAGTAC GAGCTGCCGA ATTGGGGTAT CGCACGCTCG TTGTCAGTAC TGATGTCGCA
CATAGTTTAG CCGATGCGCT CGATACTCCT CTCGGATCGC TCCCGACACA GATTGGCGAG
CGACTGTGGG GACAAGAGAT TAACGTGCTT GATGAAGTGC GTCAGCACTG GGGGGAATTG
CGGGTTTATT TGAGCAATCT CTTGCGTCGA CGTGGTGTTG ACGAGGTAGC TGCTGAAGAA
TTAGCGATCA TTCCCGGTAT GGAGGAGGTA GTCAGTCTTC TCCACATCCG ACGGCAAGCC
CGTGAAGGTA ATTTCGATGT GGTGATCGTT GATGCAGCTC CGACCGGTGA GACGGTACGT
CTGCTGACCA TGCCGGAGAC GTTTCAGTGG TATGCCGCAC GGGTGATGGA TTGGGAGCCA
ACGACGCTGA AAGTCGCGCG TCCACTCGTC AAGCAGTTGG TACCGGCAAC CGATGTCTTT
GCGAAGCTCG AACGGTTAAC GAAGGGGGTT GAGGCGTTAC GGGCCACCTT AACCGATCCA
CAGGTCAGTT CATACCGATT GGTCGTTAAC CCTGAACGGA TGGTGATCAA AGAGGCGCAA
CGGGCTTCGA CCTATCTTGC GCTCTTTGGG TATCCGGTTG ATGGCGTTGT CCTCAACCGG
GTATTGCCGG TCGATCAGGT CGAAGGCGAA TTTATGAAGG AGCTGGCCCG CATCCAACAG
GGCTACCGAC AGATGGTGTA TGACCTCTTC CGCCCGTTGC CGATTTGGGA AAGCCCGTAC
TATGCCCGTG ATCTGGCCGG AATTGACGAT CTTGCGATGG TCGGTCGGCA ACTGTTCGGC
GATGACGATC CGGTGAAGGT GCATTTTATC GGGAAAACGC AAGAGATTGT CAAACAGGGT
GATGAATATG TCTTACGTTT GCCGTTGCCG CATGTCGAGA TCGGCAAAGT CTCGATGACG
AAACGCGGTG ATGAGCTGTT TATCGAGATC GGTAATTTCC GCCGTGACAT GTTGTTGCCG
ACAACGCTGG CCGAACGACC GGCGCGGCGA GCCTATTTTC GCAATGGTGT GCTCGAAGTC
TTCTTTGGCC CGCCGGAGAC GTTGCCGCTG ACGAACAATG ATCAAGATAC GGAAGCGGAG
ACGGCAGCCT CATGA
 
Protein sequence
MRLILYLGKG GVGKTTTSAA TAVRAAELGY RTLVVSTDVA HSLADALDTP LGSLPTQIGE 
RLWGQEINVL DEVRQHWGEL RVYLSNLLRR RGVDEVAAEE LAIIPGMEEV VSLLHIRRQA
REGNFDVVIV DAAPTGETVR LLTMPETFQW YAARVMDWEP TTLKVARPLV KQLVPATDVF
AKLERLTKGV EALRATLTDP QVSSYRLVVN PERMVIKEAQ RASTYLALFG YPVDGVVLNR
VLPVDQVEGE FMKELARIQQ GYRQMVYDLF RPLPIWESPY YARDLAGIDD LAMVGRQLFG
DDDPVKVHFI GKTQEIVKQG DEYVLRLPLP HVEIGKVSMT KRGDELFIEI GNFRRDMLLP
TTLAERPARR AYFRNGVLEV FFGPPETLPL TNNDQDTEAE TAAS