Gene Clim_0551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0551 
Symbol 
ID6354902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp621591 
End bp622892 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content50% 
IMG OID642668187 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_001942622 
Protein GI189346093 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGTCGA GAGACTTAAC GGAAAACCAG TCTCAGCCGA GAGTCATAAT CTATTCAGGA 
AAGGGCGGAA CGGGTAAAAC CACGATATCC TCTTCGACGG CAGTAGCTCT TGCCCGGCAG
AACAAGAAGG TTCTCATCAT GTCTTCTGAT CCTGCCCATT CGCTTTCCGA TGTTTTCAAT
ACGCAGATCA GCCGTAACGA ACCGCAGAAA ATCGAAAACA ATCTCTATGG TCTCGAGGTC
GATACGATCT ATGAGCTGAA AAAAAACATG TCCGGCTTCC AGAAGTTCGT TTCCTCTTCC
TACCAGAACA AGGGAATCGA CAGCGGCATG GCTACCGAAC TTACCACGCA GCCCGGCCTC
GACGAGATTT TCGCACTCAG CCGCCTGGTC GATGAGGCGC AGTCGGGCAA ATGGGACGCC
CTGGTGCTCG ATACTTCGCC GACCGGCAAC ACCCTGAGAC TGCTTGCCTA TCCGGAAATC
ATCATTGGCG GTAATATGGG CAAACAGTTC TTCAAGCTTT ACAAGAGCAT GTCGTCACTG
GCCCGTCCCC TGAGCGGCAA CTCCATACCC GATGAGGACT TTTTCAACGA GATCAACGTT
CTGCTCAAGC AGATGGAAGA TATCAACAAG TTCATTCTCA GCCCGGAGGT TACCTTCCGT
CTGGTGCTGA ACCCCGAGAA GCTTTCCATT CTTGAAACAA AGCGTGCCTA CACCTTCGTG
CACCTTTACG GCATCAATAT CGACGGTATC GTCATCAACA AAATTCTGCC GACCTCGCGT
ACCGTGGGAG AGTATTTTGA GTTCTGGAGC GAGCTGCACA GCAAATATCT GATGGAGATC
GACAACTCCT TCTATCCTAC TCCCGTGTTT CGCTGCAATT TGCAGCGGAC CGAGCCGATC
GGGCCTGACG CGCTCCATGA GATCAGCAAG CTGGTGTTCG GCGAGGAAGT TCCGGATAAA
ATTTTCTACT CCGGAAAGAA TTTCTGGATC GAGACCCGCA AGAATGCCGT TACGGAAGAT
CATAGGGAAA TTCTCTGCAT CAAGATTCCG TTTCTCAAGG ATGCCGAAGA TGTAAAGGTC
GAGCGGATGG GTACCGACAT TGCCGTTACC GTTGACCGGG CCCAGAGAGT CATTACCCTT
CCGCGAGCGC TGTACAGCCT TGAACTCGAA AAATATATCC GCGAGGATAA CTTGCTTCGG
GTTGTTTTCA GAGAGCTTCC TGTTGAAAAA GAGGAGGTGG AACTGAGTGT CAATAAAAAC
ATGCTCGATA AACTTCGTTC AATGAGAAGA CTGAAGATAT AG
 
Protein sequence
MLSRDLTENQ SQPRVIIYSG KGGTGKTTIS SSTAVALARQ NKKVLIMSSD PAHSLSDVFN 
TQISRNEPQK IENNLYGLEV DTIYELKKNM SGFQKFVSSS YQNKGIDSGM ATELTTQPGL
DEIFALSRLV DEAQSGKWDA LVLDTSPTGN TLRLLAYPEI IIGGNMGKQF FKLYKSMSSL
ARPLSGNSIP DEDFFNEINV LLKQMEDINK FILSPEVTFR LVLNPEKLSI LETKRAYTFV
HLYGINIDGI VINKILPTSR TVGEYFEFWS ELHSKYLMEI DNSFYPTPVF RCNLQRTEPI
GPDALHEISK LVFGEEVPDK IFYSGKNFWI ETRKNAVTED HREILCIKIP FLKDAEDVKV
ERMGTDIAVT VDRAQRVITL PRALYSLELE KYIREDNLLR VVFRELPVEK EEVELSVNKN
MLDKLRSMRR LKI