Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3879 |
Symbol | |
ID | 5735728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4870578 |
End bp | 4871753 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281030 |
Product | arsenite-activated ATPase ArsA |
Protein accession | YP_001546641 |
Protein GI | 159900394 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTTGA TTTTATATCT TGGCAAGGGT GGCGTTGGCA AAACCACAAC CGCTGCGGCA ACCGCCGTGC GAGCATCGCG CATGGGCTAT CGAACCTTGG TGGTCAGCAC CGATGTGGCT CACTCCTTGG CCGATGCGCT CGATTGCCAA GTTGGCCCTA GCCCCACCAA GCTGAGCGAC AACCTTTGGG CCCAAGAAAT CAATGTGTTG GAGGAAGTAC GCCAACACTG GGGCGAGTTG CAAGGCTTTG TCTCCAATTT GCTCAAGCGC AAGGGCGTAA ACGAAGTTGC CGCCGAAGAA CTAGCAGTAA TTCCAGGCAT GGAAGAAGTT GTCAGTTTGT TACATATTCG CAAACAAGCG AAAGAAGGCA ACTACGATGC AGTCATTGTT GATGCAGCGC CAACTGGCGA AACCGTGCGC TTGCTGACCA TGCCCGAAAC CTTTACTTGG TATGCTTCGC GGGTGATGCA ATGGGAAACC AGCACCATGA AAGTGGCCAA GCCCTTGATT CGGGCATTGG TGCCAGCCTC GGATATGTTC GATACCTTGC CACGCTTTGT TGAGCAGGTT GAAGCGCTGC GGGCAACCTT AGCCGACCCC AAAATCAGTT CCTATCGTTT GGTGGTCAAC CCCGAGCGCA TGGTAATCAA AGAGGCTCAA CGCGCCGCAA CCTACTTGGC CTTGTATGGC TATCCGGTCG ATGGCGTGGT GCTTAATCGG GTGATGCCTA GCGATGTGCG TGGCCATAGT TTTATCGAAC AAATGCAAGA AATTCAGGCT AGCTATCGCG CTCAAGTTCA TGATATTTTC ACGCCACTGC CAATTTGGGA AGCCCCAATG TATGCCCGTG AGATCAAAGG GCTTGATGAT TTGGCCGATG TGGGGGCAGC CTTATTTGGC GAGCGCAATC CACTTGATGT CTTTTATGTG GGTAAAACCA TGGACATCAC CAAGCAAGGC GATCAGTATG AGCTACGTTT GCCTTTACCA CATGTCGAAG TTAATAAAGT CAATATGACC AAACGCGGCG ATCAGCTGTT TATTGAAATT GGCAACTTCC GCCGCGAGAT GATTTTACCG ACGATGTTGG CTGATCGGCC AGCGCTACGC GCGGTGTTTC GCAATGGCGA GTTGGTCGTA CAATTTGGTG CTCCCGCCCC ACTCGAAGCT GTGTAA
|
Protein sequence | MRLILYLGKG GVGKTTTAAA TAVRASRMGY RTLVVSTDVA HSLADALDCQ VGPSPTKLSD NLWAQEINVL EEVRQHWGEL QGFVSNLLKR KGVNEVAAEE LAVIPGMEEV VSLLHIRKQA KEGNYDAVIV DAAPTGETVR LLTMPETFTW YASRVMQWET STMKVAKPLI RALVPASDMF DTLPRFVEQV EALRATLADP KISSYRLVVN PERMVIKEAQ RAATYLALYG YPVDGVVLNR VMPSDVRGHS FIEQMQEIQA SYRAQVHDIF TPLPIWEAPM YAREIKGLDD LADVGAALFG ERNPLDVFYV GKTMDITKQG DQYELRLPLP HVEVNKVNMT KRGDQLFIEI GNFRREMILP TMLADRPALR AVFRNGELVV QFGAPAPLEA V
|
| |