Gene Clim_2364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2364 
Symbol 
ID6355710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2592454 
End bp2593641 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content47% 
IMG OID642669956 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_001944366 
Protein GI189347837 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCTTA TTCTCATGAC AGGGAAGGGT GGTGTCGGAA AAACATCCAT GGCGGCGGCT 
ACCGGCTTGC GCTGTGCAGA GCTTGGTTAT AAAACGCTTG TCCTGAGTAC CGATCCCGCT
CATTCGCTTG CCGACAGTTT CGATATCCCC CTGGGCCATG AAGCGGTAAA GATATGCGAT
AATCTTTATG GCGCTGAACT TGATGTGCTT CAGGAGCTTG AACAGAACTG GGGGACGGTC
AAACGCTATA TTACCCAGGT ATTGCAGGCA AGAGGTCTTG ATGCTGTTCA GGCAGAAGAG
CTTGCCATTC TTCCCGGGAT GGATGAGATT TTCGGGCTCG TCAGGGTATT CCGACATCAC
AGGGAGGGGA ATTACGATGT GTTGATCATC GACTCGGCTC CTACAGGAAC AGCATTGCGC
CTTTTGAGTA TTCCTGAAGT CAGCGGCTGG TATATGCGCA GACTCTACAA GCCGATGGAG
AAGTTTGCGC TGTATCTCAG GCCGCTCGTC GAACCACTTT TCCGGCCTAT TGCCGGATTT
TCGCTTCCTG ACAGAGCGTT AATGAATGTC CCATACGAAT TCTACGAACA AATTGATGCG
CTTGGAAAGA TTCTCACGGA CAATGCCATT ACCTCTGTGC GGCTGGTGAC CAATCCGGAA
AAAATGGTTA TCAAGGAGTC GCTGCGCGCT CATGCCTATC TCAGTCTGTA CAATATTTCG
GTGGATATGG TTATCTCCAA CAGAATTATC CCGCCGGAAG TTACCGATCC TTATTTCGTT
TACTGGAAAG AGCATCAGCA GCGTTACAGA CAGGAAATCA TCGATAATTT CAGTCCTCTG
CCGGTCAAGG AGGTTCCTCT CTATACACGT GAAATCTGCG GCTTGAAAAC ACTCGAAAAA
CTTAAGGATT TTCTCTATCG TGATGAGGAC CCTTCAAAGG TTTATTATTT TCGTAATACG
TTTACTATCA GAAAGGTTGA AAACGGTTTT TCTCTCGAAC TTTATCTTCC GGGTATTCCC
AAAGATCAAA TTCAGCTCAG CAAAAGCGGC GATGAACTGA ATATCCATAT TGGCAATCAC
CGGAGAAATA TGGTGCTCCC ACAATCTCTT GCAACGCTGA ATACGGCCGG CGCAGAAATG
AACAGCGATC ATCTGGTGAT CAGGTTTTCA GAAATGGATG CAAAATAG
 
Protein sequence
MRLILMTGKG GVGKTSMAAA TGLRCAELGY KTLVLSTDPA HSLADSFDIP LGHEAVKICD 
NLYGAELDVL QELEQNWGTV KRYITQVLQA RGLDAVQAEE LAILPGMDEI FGLVRVFRHH
REGNYDVLII DSAPTGTALR LLSIPEVSGW YMRRLYKPME KFALYLRPLV EPLFRPIAGF
SLPDRALMNV PYEFYEQIDA LGKILTDNAI TSVRLVTNPE KMVIKESLRA HAYLSLYNIS
VDMVISNRII PPEVTDPYFV YWKEHQQRYR QEIIDNFSPL PVKEVPLYTR EICGLKTLEK
LKDFLYRDED PSKVYYFRNT FTIRKVENGF SLELYLPGIP KDQIQLSKSG DELNIHIGNH
RRNMVLPQSL ATLNTAGAEM NSDHLVIRFS EMDAK