Gene Rcas_1570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1570 
Symbol 
ID5539046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2019216 
End bp2020406 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content63% 
IMG OID640893708 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_001431681 
Protein GI156741552 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.807387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGA TACTCTACCT GGGCAAGGGT GGCGTTGGCA AAACGACCAC CTCGGCGGCG 
ACTGCGGTGC GCGCCGCCGA ACTCGGCTAC CGCACGCTGG TAGTCAGCAC CGATGTGGCG
CACAGCCTGG CTGATGCGCT CGATCATCCG TTGGGACCGC AACCGACGCA GCTTACCGAC
CGGCTCTGGG GGCAGGAAAT TAACGTGCTC GAAGAGGTGC GGCAGCATTG GGGCGAGTTG
CGCAACTATC TGGCAGGGTT GCTCAAACGC CGCGGCGTCA GCGATGTCGC TTCCGAAGAA
TTGGCGATCA TCCCCGGTAT GGAAGAGGTC GTCAGCCTTC TGCACATCCG GCGACAGGCG
CGCGAGGGCA ATTTCGACGC GGTGATCGTC GATGCGGCGC CGACCGGCGA GACCATCCGC
CTGTTGACCA TGCCAGAGAC CTTTCAGTGG TACGCGGCGC GGGTCATGGA TTGGGACCCC
GGCACCAAGA GCATGGCTAA ACCGCTGGTG CGCGCCCTGA TCCCGGCAAC CAACGCCTTC
GAGACGCTCG ACCGCCTGAC AAAGGGGGTC GAGGCGCTGC GCCAGATGCT GACCGATCCC
GACATCAGTT CGTACCGCCT GGTGGTCAAC CCGGAGCGCA TGGTCATCAA AGAAGCGCAG
CGCGCAGCGA CGTATCTGGC GCTGTTTGGC TATCCGGTCG ATGGTGTGGT GCTCAATCGG
GTGCTGCCAC GCAACGCAGT CGCCGGCGAA TTCATGGAAC GCCTGTATGA GATGCAGTCG
TCGTACCGCA AAATGGTGCA CGACCTGTTC GCGCCGCTGC CGATCTGGGA AGCGCCGCAT
TACCCGCATG ATATCCGGGG TATCAACGAT CTGTCGCAGG TTGGGCGCGA TATGTTCAAG
GACGAAGACC CGACGAAGGT CTTCTTCCGT GGCACCACGC AGGAAATCGT GCGCGACGGC
GATGAATATG TGATGCGTCT GCCGTTGCCG CACGTCGAAA TCGGCAAGGT GTCGATCACC
AAACGCGGCG ACGAACTGTT CGTTGCCATC GGCAATTTCA AGCGCGATAT GATCCTGCCG
CTGACACTCG CGGAACGACC GGCGAAGCGC GCGGTGTTCC GCGAAGGGGT GCTTGAGGTG
CGTTTTGGCG CCCCGGAGAC GGTCGAGCCG ACTGCGGCTT CCGCAGGGTG A
 
Protein sequence
MRLILYLGKG GVGKTTTSAA TAVRAAELGY RTLVVSTDVA HSLADALDHP LGPQPTQLTD 
RLWGQEINVL EEVRQHWGEL RNYLAGLLKR RGVSDVASEE LAIIPGMEEV VSLLHIRRQA
REGNFDAVIV DAAPTGETIR LLTMPETFQW YAARVMDWDP GTKSMAKPLV RALIPATNAF
ETLDRLTKGV EALRQMLTDP DISSYRLVVN PERMVIKEAQ RAATYLALFG YPVDGVVLNR
VLPRNAVAGE FMERLYEMQS SYRKMVHDLF APLPIWEAPH YPHDIRGIND LSQVGRDMFK
DEDPTKVFFR GTTQEIVRDG DEYVMRLPLP HVEIGKVSIT KRGDELFVAI GNFKRDMILP
LTLAERPAKR AVFREGVLEV RFGAPETVEP TAASAG