Gene Haur_3879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3879 
Symbol 
ID5735728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4870578 
End bp4871753 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content52% 
IMG OID641281030 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_001546641 
Protein GI159900394 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTGA TTTTATATCT TGGCAAGGGT GGCGTTGGCA AAACCACAAC CGCTGCGGCA 
ACCGCCGTGC GAGCATCGCG CATGGGCTAT CGAACCTTGG TGGTCAGCAC CGATGTGGCT
CACTCCTTGG CCGATGCGCT CGATTGCCAA GTTGGCCCTA GCCCCACCAA GCTGAGCGAC
AACCTTTGGG CCCAAGAAAT CAATGTGTTG GAGGAAGTAC GCCAACACTG GGGCGAGTTG
CAAGGCTTTG TCTCCAATTT GCTCAAGCGC AAGGGCGTAA ACGAAGTTGC CGCCGAAGAA
CTAGCAGTAA TTCCAGGCAT GGAAGAAGTT GTCAGTTTGT TACATATTCG CAAACAAGCG
AAAGAAGGCA ACTACGATGC AGTCATTGTT GATGCAGCGC CAACTGGCGA AACCGTGCGC
TTGCTGACCA TGCCCGAAAC CTTTACTTGG TATGCTTCGC GGGTGATGCA ATGGGAAACC
AGCACCATGA AAGTGGCCAA GCCCTTGATT CGGGCATTGG TGCCAGCCTC GGATATGTTC
GATACCTTGC CACGCTTTGT TGAGCAGGTT GAAGCGCTGC GGGCAACCTT AGCCGACCCC
AAAATCAGTT CCTATCGTTT GGTGGTCAAC CCCGAGCGCA TGGTAATCAA AGAGGCTCAA
CGCGCCGCAA CCTACTTGGC CTTGTATGGC TATCCGGTCG ATGGCGTGGT GCTTAATCGG
GTGATGCCTA GCGATGTGCG TGGCCATAGT TTTATCGAAC AAATGCAAGA AATTCAGGCT
AGCTATCGCG CTCAAGTTCA TGATATTTTC ACGCCACTGC CAATTTGGGA AGCCCCAATG
TATGCCCGTG AGATCAAAGG GCTTGATGAT TTGGCCGATG TGGGGGCAGC CTTATTTGGC
GAGCGCAATC CACTTGATGT CTTTTATGTG GGTAAAACCA TGGACATCAC CAAGCAAGGC
GATCAGTATG AGCTACGTTT GCCTTTACCA CATGTCGAAG TTAATAAAGT CAATATGACC
AAACGCGGCG ATCAGCTGTT TATTGAAATT GGCAACTTCC GCCGCGAGAT GATTTTACCG
ACGATGTTGG CTGATCGGCC AGCGCTACGC GCGGTGTTTC GCAATGGCGA GTTGGTCGTA
CAATTTGGTG CTCCCGCCCC ACTCGAAGCT GTGTAA
 
Protein sequence
MRLILYLGKG GVGKTTTAAA TAVRASRMGY RTLVVSTDVA HSLADALDCQ VGPSPTKLSD 
NLWAQEINVL EEVRQHWGEL QGFVSNLLKR KGVNEVAAEE LAVIPGMEEV VSLLHIRKQA
KEGNYDAVIV DAAPTGETVR LLTMPETFTW YASRVMQWET STMKVAKPLI RALVPASDMF
DTLPRFVEQV EALRATLADP KISSYRLVVN PERMVIKEAQ RAATYLALYG YPVDGVVLNR
VMPSDVRGHS FIEQMQEIQA SYRAQVHDIF TPLPIWEAPM YAREIKGLDD LADVGAALFG
ERNPLDVFYV GKTMDITKQG DQYELRLPLP HVEVNKVNMT KRGDQLFIEI GNFRREMILP
TMLADRPALR AVFRNGELVV QFGAPAPLEA V