Gene Clim_2143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2143 
Symbol 
ID6355937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2361630 
End bp2362784 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content49% 
IMG OID642669734 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_001944146 
Protein GI189347617 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.237468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTA TTCTTTATCT AGGGAAGGGA GGGGTTGGAA AAACCACCGT CTCAGCTTCA 
ACGGCAACTG CAATTGCCCG TCGTGGAGAG CGTGTGCTGA TTATGAGCAC CGATGTGGCC
CACAGCCTTG CTGATGCGTT CAGCGTGGAA TTAAGCCAGA ATCCGATCGA GGTAGAAAAA
AACCTTTTTG CCATGGAGGT AAATGTTCTT GCTGAAATCA GGGAGAACTG GACGGAACTC
TATTCCTATT TTTCGTCGAT TCTTATGCAT GACGGGGCCA ATGAGGTCGT GGCTGAAGAG
CTTGCCATCG TGCCGGGCAT GGAAGAGATG ATCAGTCTTC GTTACATCTG GAAAGCAGCA
AAGTCAGGCA ACTACGATGT CATTATTGTC GATGCCGCAC CGACAGGCGA AACCATGCGT
TTGCTGGGTA TGCCGGAATC CTACGGCTGG TACTCCGACA AGATAGGCGG CTGGCATTCA
AAGGCAATAG GCTTTGCCGC TCCTCTGCTT TCAAAGTTCA TGCCGAAAAA GAATATCTTC
AAGCTGATGC CTGAGGTGAA CGAGCATATG AAAGAGTTGC ACGGCATGCT TCAGGATCAG
ACCGTCACCA CTTTCCGCGT TGTGCTCAAT CCCGAGAACA TGGTCATCAA GGAGGCTCTT
CGCGTGCAGA CCTATCTGAA TCTGTTCGGG TATAAGCTCG ATGCAGCCGT GGTCAACAAA
ATTCTTCCCG AAAGCTCCGC CGATCAGTAT CTGCAGAGCC TTATCGACAT TCAGCAGAAA
TATCTCCGGG TCATCGACAA CTGTTTCTAT CCGGTACCGA TTTTCCGGGC TCATCAGCAG
ACAGCCGAGG TGATCAACAC CGATCGTCTT CATGTGCTGA GTCAGGAGAT TTTCGGCGAT
AAAAATCCCT CTGCCGTTCT CTACAGCAAC GACAAGACTC AGACTCTCGA AAAAATCAAC
GGAAAATATG TGCTCAGTCT CTATCTTCCG AATGTCGAGG TCAAGAAGCT CAATGTCAAC
ATCAAGGGAG ATGAACTGCT CGTCGATATC AATAATTTCC GCAAAAGCAT TATTCTTCCG
AATGTGCTTG TCGGCAGAAA AACCGAAGGA GCCGATTTTG CCGCCGGTAA CCTGAACATC
ACTTTTGCCA ACTGA
 
Protein sequence
MRIILYLGKG GVGKTTVSAS TATAIARRGE RVLIMSTDVA HSLADAFSVE LSQNPIEVEK 
NLFAMEVNVL AEIRENWTEL YSYFSSILMH DGANEVVAEE LAIVPGMEEM ISLRYIWKAA
KSGNYDVIIV DAAPTGETMR LLGMPESYGW YSDKIGGWHS KAIGFAAPLL SKFMPKKNIF
KLMPEVNEHM KELHGMLQDQ TVTTFRVVLN PENMVIKEAL RVQTYLNLFG YKLDAAVVNK
ILPESSADQY LQSLIDIQQK YLRVIDNCFY PVPIFRAHQQ TAEVINTDRL HVLSQEIFGD
KNPSAVLYSN DKTQTLEKIN GKYVLSLYLP NVEVKKLNVN IKGDELLVDI NNFRKSIILP
NVLVGRKTEG ADFAAGNLNI TFAN