Gene Cagg_1223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1223 
Symbol 
ID7266209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1500194 
End bp1501393 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content57% 
IMG OID643566066 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_002462568 
Protein GI219848135 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.293801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACTC TGATCTTTAC CGGAAAAGGC GGCGTTGGTA AGACGAGCGT CGCCGCAGCA 
ACGGCCCTAC GGGCTGCCGA TCGTGGCTTA AAAACACTGG TCATGAGCAC CGATCCTGCC
CACTCACTGG CCGATTCGCT CGATCTCGAG GGACCGCTGG GTCCTGAACC CGTTCGGATT
ACGAAGAACC TTGATGCGCT CGAAGTCAGC ATCTATCACG ACATCGAAAG CAACTGGGGT
ATTGTGCGCG AGCACTTCGC CCAACTTATG GCCGAGCAGG GCGTACAGGG CGTTTTGGCC
GATGAGATGA GCGTCCTGCC CGGTATGGAA GAGGCCTTCC CGCTTATTCG GATCAAGAAG
CATAAGGAGC GCGGTGATTA CGATCTTTTG GTGATCGATT GCGCGCCCAC CGGCGAGACG
CTACGGCTCC TTTCGGCCCC TGAAACGTTC AAGTGGGCGA TCAATATGTT GCGTGGGGCC
GAGCGTTACG TCATCCGGCC ACTGATCCGC CCAATGAGCA AGATCACGCC CGGCCTCAAC
AAAATGGTCG CGCCGCCTGA AGTGTACGAT GCCGTTGATG AGATGTTCCG CCAGATGGAG
GGGGTAACCG CGACGCTGGC TAATCCGCGC GAAACTTCGA TCCGCCTGGT GATGAACCCT
GAAAAGATGG TGATCAAGGA GAGCCAGCGG GCGTTGACCT ACCTGTCAAT GTACGGGATG
ACCGTTGACA TGGTCGTGGT CAATAAGATT TTACCTCTTG ACCAAGATAG CGGTTATCTG
AACCATTGGC GTGATGTGCA GCAGCGGTAT CTGCAAGACG TGGAGCACTC ATTTGTGCCG
TTGCCGATTC GGCGTGTGCC CTACTATCCC GAAGAGGTTG TCGGCCTTGA GAAGCTGCGC
CGGATGGGGG ATGATATCTA CGGCGATATG GATCCAACGG CCGTGCTCTA CGACCGCGCA
CCGCTAGAGA TTACTAAGGC TGGCGATAAA TTCTACCGGG TGAAGATCCG CTTGCCGTTT
GCCGATGTTT CACAACTCGA TCTCTACCAG AACGGTGATG AGTTGGTTGT CCAGATCGGC
GATTTCCGCC GTGTTATTAC CCTGCCGACG AGCCTTGCCG GCCTTGAAGC CGGGCAGGCA
GAGATGGAGG GTGAGTGGTT GATCGTGCCC TTCATGGCGC CGCAACTGGC GTCACGCTGA
 
Protein sequence
MRTLIFTGKG GVGKTSVAAA TALRAADRGL KTLVMSTDPA HSLADSLDLE GPLGPEPVRI 
TKNLDALEVS IYHDIESNWG IVREHFAQLM AEQGVQGVLA DEMSVLPGME EAFPLIRIKK
HKERGDYDLL VIDCAPTGET LRLLSAPETF KWAINMLRGA ERYVIRPLIR PMSKITPGLN
KMVAPPEVYD AVDEMFRQME GVTATLANPR ETSIRLVMNP EKMVIKESQR ALTYLSMYGM
TVDMVVVNKI LPLDQDSGYL NHWRDVQQRY LQDVEHSFVP LPIRRVPYYP EEVVGLEKLR
RMGDDIYGDM DPTAVLYDRA PLEITKAGDK FYRVKIRLPF ADVSQLDLYQ NGDELVVQIG
DFRRVITLPT SLAGLEAGQA EMEGEWLIVP FMAPQLASR