Gene Rfer_3664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRfer_3664 
Symbol 
ID3963992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodoferax ferrireducens T118 
KingdomBacteria 
Replicon accessionNC_007908 
Strand
Start bp4084783 
End bp4086552 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content63% 
IMG OID637918480 
Productarsenite-activated ATPase (arsA) 
Protein accessionYP_524895 
Protein GI89902424 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTCC TTGATCAAGC CCCGCGTTTT TTGTTCTTCA CCGGCAAGGG CGGTGTCGGC 
AAAACATCCA TCGCCTGCGC CAGCGCACTG GAACTCACAC GCTTGAACAA ACGCGTCTTG
CTGGTCAGCA CCGACCCAGC CTCCAACGTG GGGCAGGTGT TTGGCATCCG CATCGGCAAC
CAGATCACCA ACGTGGCTGA GGTGCCAAAT TTGAGCGCCC TGGAGATCGA CCCGCAAGCG
GCTGCCCAAG CCTACCGCGA CCGCATCGTC GGCCCGGTGC GCGGGGTGCT GCCTGATGCC
GTGGTCAAGG GCATTGAAGA GCAGCTCTCG GGCGCCTGCA CCACCGAGAT TGCCGCGTTT
GACGAATTCA CTGCGCTGCT CACGGATGGC GCGCTCACGC AGAATTTTGA CCACATCATT
TTCGACACCG CCCCGACCGG CCACACCATC CGTATGTTGC AGTTGCCCGG TGCCTGGAGC
GGTTTTCTGG AAGACGGCAA GGGCGACGCC TCTTGCCTGG GCCCACTGGC AGGACTTGAA
AAGCAGCGCA CCCAGTACAA GGCGGCGGTG GATGCCCTGG CCGACCCATT GAAAACCCGT
CTGATCCTTG TGGCACGGGC GCAAGCAGCT ACTTTAAATG AAGCAGCCCG CACGCATGGC
GAGCTGGCGG GTATTGGCCT GTCGAAGCAG TATCTGGTGA TCAACGGTGT GTTCCCGGAG
TCCGAAACCG TCCACGATGC GCTGGCACAG GCTATTTTTG ATCGCGAGCA GGCCGTGTTG
GCGAACTTGC CTGACGCGTT GCGCGACTTG CCCACCGATC AAATCGGACT CAAGGCGTTT
AATCTGGTCG GGCTGGCGCC TTTGCGTCAG CTGCTGGCGG CGGCCGAACC CGCAGTGCCG
ACGGCAGCGG ACGTACCGCC TGGTGCACCC CCTATCAGCG CTCCCAGCCT GGCCTCCCTG
GTCGAGGCCA TAGCGCGGGA GGGCCACGGG CTGGTCATGC TGATGGGCAA AGGCGGCGTC
GGTAAAACCA CGCTCGCTGC GGCTGTCGCA GTGGAGCTGG CCACGCGCAG CCTGCCCGTG
CACCTGACGA CCTCCGACCC GGCCGCGCAC CTGATGGAAA CGCTGGACGG CACGCTGGAA
CATCTGACGG TCAGCCGCAT CGACCCGCAT GAAGTCACCG AGCACTACCG CGCGCAAGTG
CTGGCCAGCA AAGGGGCCAA GCTCGATGCC GCTGGCCGCG CCGTGCTGGA AGAAGACCTG
CGCTCGCCCT GCACGGAAGA AATCGCGGTG TTCCAGGCCT TCTCGCGCGT GATCCGTGAG
GCTGGCAAGA AGTTTGTGGT GATGGACACC GCGCCTACCG GTCACACGCT GCTGCTGCTC
GACGCCACCG GCGCCTACCA CCGCGACGTC GCGCGCCAGA TGGGCAGCAG CGGCAGGCAC
TTCACCACGC CCATGATGCA GTTGCAGGAC CCCAGGCAAA CCAAGGTGCT GATCGTCACG
CTGGCCGAGA CCACGCCGGT GCTGGAGGCG GCCAATCTGC AAACCGATTT GCGTCGCGCC
GGCATTGAGC CCTGGGCCTG GGTCATCAAC AACAGTGTCG CGGCAGCTGC CGTGACATCG
CCTTTGCTAC AAGCACGAGC TCACAACGAG TTGCGTGAGA TTGCGGCCGT TGCGACCCAC
CACGCCAGTC GCTATGCGCT GGTACCGCTC TTGAAAGACG AACCGATTGG CATGCAGCGG
CTGCAACTTT TGTCAAAACA AGGAGCGTGA
 
Protein sequence
MKFLDQAPRF LFFTGKGGVG KTSIACASAL ELTRLNKRVL LVSTDPASNV GQVFGIRIGN 
QITNVAEVPN LSALEIDPQA AAQAYRDRIV GPVRGVLPDA VVKGIEEQLS GACTTEIAAF
DEFTALLTDG ALTQNFDHII FDTAPTGHTI RMLQLPGAWS GFLEDGKGDA SCLGPLAGLE
KQRTQYKAAV DALADPLKTR LILVARAQAA TLNEAARTHG ELAGIGLSKQ YLVINGVFPE
SETVHDALAQ AIFDREQAVL ANLPDALRDL PTDQIGLKAF NLVGLAPLRQ LLAAAEPAVP
TAADVPPGAP PISAPSLASL VEAIAREGHG LVMLMGKGGV GKTTLAAAVA VELATRSLPV
HLTTSDPAAH LMETLDGTLE HLTVSRIDPH EVTEHYRAQV LASKGAKLDA AGRAVLEEDL
RSPCTEEIAV FQAFSRVIRE AGKKFVVMDT APTGHTLLLL DATGAYHRDV ARQMGSSGRH
FTTPMMQLQD PRQTKVLIVT LAETTPVLEA ANLQTDLRRA GIEPWAWVIN NSVAAAAVTS
PLLQARAHNE LREIAAVATH HASRYALVPL LKDEPIGMQR LQLLSKQGA