Gene Clim_2147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2147 
Symbol 
ID6355941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2365798 
End bp2367015 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content48% 
IMG OID642669738 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_001944150 
Protein GI189347621 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.686862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATTT TAACATTTAC AGGTAAAGGC GGAGTGGGTA AAACCAGCGT GTCAGCTGCA 
ACCGCTGTCC GTTTATCCGA GTTGGGCCAT CGCACCCTTG TTCTTTCAAC CGATCCGGCT
CACAGTCTGT CGGATTCATT CAATCTCGCT CTCGGTGCCG AACCAACCAA AATCAAGGAG
AACCTGCATG CCATCGAGGT TAATCCTTAT GTTGATCTGA AGCAGAACTG GCAGTCAGTT
CAGAAATACT ATACGAGAAT TTTTATGGCT CAGGGCGTTT CAGGCGTCAT GGCCGATGAG
ATGACCATTC TTCCCGGCAT GGAAGAACTG TTTTCTCTCC TGCGAATCAA ACGGTATAAA
ACCGCCGGAC TGTACGATGC GCTTGTACTC GATACCGCTC CGACCGGTGA GACCCTTCGC
CTTCTCTCTC TGCCCGATAC GCTTTCGTGG GGCATGAAAG CCGTTAAAAA TGTCAATAAA
TATATAGTCA GGCCGCTCAG CAAACCGCTG TCGAAAATGT CCGACAGGAT TGCTTACTAC
ATTCCACCCG AAGACGCTAT CGAATCGGTC GATCAGGTGT TCGACGAACT TGAGGATATT
CGGGAAATTC TTACCGATAA TGTTAAATCG ACCGTTCGGC TTGTCATGAA CGCCGAGAAA
ATGTCGATCA AGGAGACCAT GAGGGCTCTC ACCTATCTGA ACCTTTACGG CTTCAAGGTC
GATATGGTTT TGGTGAACAG GCTGCTCGAT ACCAACGAAA ACAGCGGATA CCTTGAAAAA
TGGAAGGGTA TCCAGCAGAA ATATCTTGGT GAAATAGAAG AAGGGTTTTC TCCGCTTCCG
GTCAAGAAAC TGAAAATGTA CGAGCAGGAA ATCGTCGGGT TGAAGGCTCT GGAAATGTTT
GCCCGCGATA TGTACGGAGA TACCGATCCC GCAGATCTCA TGTACAACGA GCCGCCGATC
AAATTTGTTC GGAACGGTGA TATTTATGAA GTGCAGCTGA AACTCATGTT CGCCAACCCG
GTCGATATCG ACGTCTGGGT TACCGGCGAT GAGCTTTATG TACAGATCGG CAATCAGCGT
AAAATCATTA CGTTACCGAT CAGCCTTACA GGACTCGAGC CAGGTGATGC GGTCTTCAAG
GATAAATGGC TCCACATCCC GTTCGATCTC AACCATCAGG GCAAGCATCA GAATCAGAAA
GAGTTTAACA AAGTGTGA
 
Protein sequence
MRILTFTGKG GVGKTSVSAA TAVRLSELGH RTLVLSTDPA HSLSDSFNLA LGAEPTKIKE 
NLHAIEVNPY VDLKQNWQSV QKYYTRIFMA QGVSGVMADE MTILPGMEEL FSLLRIKRYK
TAGLYDALVL DTAPTGETLR LLSLPDTLSW GMKAVKNVNK YIVRPLSKPL SKMSDRIAYY
IPPEDAIESV DQVFDELEDI REILTDNVKS TVRLVMNAEK MSIKETMRAL TYLNLYGFKV
DMVLVNRLLD TNENSGYLEK WKGIQQKYLG EIEEGFSPLP VKKLKMYEQE IVGLKALEMF
ARDMYGDTDP ADLMYNEPPI KFVRNGDIYE VQLKLMFANP VDIDVWVTGD ELYVQIGNQR
KIITLPISLT GLEPGDAVFK DKWLHIPFDL NHQGKHQNQK EFNKV