Gene Arth_0209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0209 
Symbol 
ID4447334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp219011 
End bp220876 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content64% 
IMG OID639688005 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_829710 
Protein GI116668777 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAAACC TCCGGCTTCC TGGTGACAGG AGGCCGGAGG TTTTTTGTTG CCAGGAGGTG 
GCCGCCCTTA CGTCCATCGA CCATCGTCGA TATGCTGGGC GGGTGAAGTT TCTCCAGAAC
GCACCCCGCT TCCTGTTTTT CACCGGGAAG GGCGGTGTGG GCAAGACCTC CGTGGCGTGT
GCAACAGCGC TCACCCTTGC CAGGGCCGGC CGGAAGGTCT TGCTGGTCAG CACCGACCCC
GCGTCCAATG TGGGACAGGT CTTCGGCGTG ACCATCGGGA ACACCGTCAC TGCCATCCAG
GATGTACCAG GTCTCTCTGC GCTGGAAATA GACCCTGAAC AGGCCGTCGA AGCCTACCGG
GAACGGATCA TAGCCCCCGT CCGCGGACTG CTGCCCGAGA CTGAATTGGC TGGCATCGCG
GAGAGTCTCT CAGGTTCCTG CACCACGGAG ATCGCCTCGT TCGACGAATT CACGAACCTG
CTCGCCGACG ACAGCTCGTA CAGGGAGTAT GACCACATCG TTTTCGATAC CGCCCCGACG
GGCCACACGA TCCGGCTGTT ACAGCTGCCT GGCTCGTGGA CGGATTTCCT GGCGGCAGGC
AAGGGGGACC CGTCCTGCCT CGGACCCCTG TCAGGACTTG AGAAGCACAA GCAGGTATAT
GCAAAGGCGG TCCAGGCCCT CACGGACCCG GCGAAGACCC GGCTTGTGCT GGTCAGCCGT
GCGCAAACAT CGTCGCTGGG CGAGATCGAG CGGACGTATC TGGAGCTCAA CCAGATCGGG
ATCGGCAGCG GTTACGTCGT CGTCAACGGC GTCCTGCCCG ACGCGGCAGG GGAGGAGGCC
CTGGCGCAGG CCCTGCGCGC CCGGGAGGCT GCTGCCATGG AGGCCATTCC CGATGCGGTC
GCCGGCCTGC CCCGCGATGT CCTGGACCTG AAGCCGGGCA ACATGGTCGG CATTCCGGCC
CTTGAATCCC TGTTCGCGCC TACTTCCGGC GCCCCCATCA CTGACGACGC CGCGCTGGTT
CCCGAGATTG AGGATGCCCC GCTGGCTGCC CTGGTGAACG AAGTCGAGCT CGACGGGCAC
GGCCTCGTGA TGTGCATGGG CAAGGGCGGA GTCGGGAAGA CCACTGTCGC TGCTGCTATT
GCGGTCGCCC TGGCCAAACG CGGGCATGCG GTCCACCTCA CCACGACCGA TCCCGCGGCC
CACCTGACGG AGACGCTCCA TGGTTCCATC CCGGGCTTGA AGGTTTCCCG CATCGATCCG
GAAGCGGCCA TCCAGGAATA TCGCATCCAT GTGATGGAGA CCAAGGGCCG GAACCTGGAC
GATGACGGAC GCGCCGCGCT GGCCGAGGAC CTGATGTCCC CGTGCACGGA CGAGGTGGCT
GTGTTCAGGC AGTTCTCCAG GGTGGTCCAA GAGTCCCGCC GACACTTCGT GGTGATCGAC
ACGGCACCCA CCGGACACAC CCTGCTGCTC TTGGACGCCA CCGGTTCGTA CCACCGGGAG
ATTGCCCGCC AGGTCGGCGA CACCATGGGA TTCGTGACGC CCCTCATGCG GCTGCAGGAC
CCGGCCCAGA CCAAGGTTGT CCTCGTTACC CTGGCCGAAA CAACGCCGGT ACTGGAAGCC
GAGGAGTTGA AGAGTGACCT GGAACGGGCC GGAATTCATC CATGGGCCTG GGTGATCAAC
AACTCGATCG CAGCCGCGCA CCCGCAGACG CCGTTCCTGC GGGCCCGCGC CGCGAGCGAA
ATTGAACAAA TCACAAAGGT GCACACCCTG ACGGACCGGG TAGCGCTCAT TCCGTTGCTG
CCCGAGGAAC CCATCGGCGA GGAGAAGCTG TCTGCCTTGA CCGTCCTCAG CTCCCTGCCG
GCCTGA
 
Protein sequence
MENLRLPGDR RPEVFCCQEV AALTSIDHRR YAGRVKFLQN APRFLFFTGK GGVGKTSVAC 
ATALTLARAG RKVLLVSTDP ASNVGQVFGV TIGNTVTAIQ DVPGLSALEI DPEQAVEAYR
ERIIAPVRGL LPETELAGIA ESLSGSCTTE IASFDEFTNL LADDSSYREY DHIVFDTAPT
GHTIRLLQLP GSWTDFLAAG KGDPSCLGPL SGLEKHKQVY AKAVQALTDP AKTRLVLVSR
AQTSSLGEIE RTYLELNQIG IGSGYVVVNG VLPDAAGEEA LAQALRAREA AAMEAIPDAV
AGLPRDVLDL KPGNMVGIPA LESLFAPTSG APITDDAALV PEIEDAPLAA LVNEVELDGH
GLVMCMGKGG VGKTTVAAAI AVALAKRGHA VHLTTTDPAA HLTETLHGSI PGLKVSRIDP
EAAIQEYRIH VMETKGRNLD DDGRAALAED LMSPCTDEVA VFRQFSRVVQ ESRRHFVVID
TAPTGHTLLL LDATGSYHRE IARQVGDTMG FVTPLMRLQD PAQTKVVLVT LAETTPVLEA
EELKSDLERA GIHPWAWVIN NSIAAAHPQT PFLRARAASE IEQITKVHTL TDRVALIPLL
PEEPIGEEKL SALTVLSSLP A