Gene Haur_3281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3281 
Symbol 
ID5735151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4147070 
End bp4149169 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content52% 
IMG OID641280429 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_001546046 
Protein GI159899799 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01511] copper-(or silver)-translocating P-type ATPase
[TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGC AAACAATCAG CTTACCGATT CGCGGCATGG ATTGTTCAAG TTGTGCGCTG 
CATATTCAAA CCGCTTTGGG CGAGCTAAGC GGAGTGATGC AAGCGCAAGT GTTGTTTTCA
GCCGAAAAAG CCCTGATTAC CTTTGATGAT CAGCTGCTTG AGCCAAACGC TTTAGTCCAA
ACGATTGAAC AAGCTGGCTA TCAGGTGCCG CAACTTCAGG CGGCTCCGCC TAAATCAACC
CCTGCAAGTG CTCGTTTGTT GCTGCTCTTT GGTGCGATTG TCGTTGTAGT GTTGGGCTTC
AGCGTGCTTG GGGAGCAGCT TGGCTGGCTT GATTTACTAA TCGACTGGAT TCCTTGGCCA
ATTGGCGTGG CCTTGGTATT TGTGGCGGGC TATCGGATTT TCTGGCGCGT GGCTCAGGCT
GCTTGGCGCA AAACAGTGCT TTCGCATAGC TTGATGAGCA TTGGGGTGAT TGCTGCTTTA
GCGATTGGGC AGTGGCCAAC CGCCCTACTG GTAGTGCTGT TTATGAGCAT CGGCGAATAT
GTTGAACAAT TTACTTCGAG CCAAGCTCGC CGTGCAGTCA GAGAATTAAC CGCCTTGGCT
CCAACCCTTG CGCAGGTTGA ACGTGATGGA ACTGAAATTC AGCTACCAAT CGAGGCAGTT
CAAGTTGGTG ATGTGGTAGT GGTGCGGCCT GGCGATCAAA TTCCAGTGGA TGGTGTAGTG
CTGCGGGGAG CCGCCAGCAT CAATCAAGCA ACCATTACTG GCGAATCGAT GCCAATTGAT
GCAATTGAGG GAACCCAGGT TTTTGCCGCA ACCTTGGCAA CTGCGGGAAG TTTGCGCATT
CAAGCAACGG CGGTTGGCCG AGATAGCACC TTTGGCAATG TCATCAAATT GGTTGAAGAG
GCTGAATCGC AAAAAGCCGA AATTCAACGG ATCGGCGATG CTTTTTCAGA CTATTATCTG
CCAGTTGTGG CGCTGATTGC CTTGTTAACT TTTGTGATTC AGCGTGATCC CCTGGCAACG
GCGGCGGTGC TGTTGGTGGC TTGTTCATGT TCATTTGCGC TTGCAACACC GATTGCCATG
CTAGCGACGA TTGGCGCAGC AGCTAAACAA GGCATCTTGA TCAAAGGTGG TAAATATCTC
GAAGCGCTAG CCAAAACCAG TGTGCTACTC ATTGATAAAA CTGGTACGTT GACCCATGGC
CAGCCTGCGG TGAGTCAGGT TGTGGTGCAT GCTGCGACGA ATGAGGCCGA GGTTTTGTGC
TGGGCCGCGG CAGCCGAGCA AGATTCCGAG CATCCTTTAG CCAAGGCGAT TGTGCGAGCC
GCCCGTGATC GATCAATTGA CCTTCCGCAC GTTCGTGAAT TTAAGGCGCT GGCAGGGTCG
GGCGTACAAG CCGTGGTTGA GGGTCAAACC GTGGTTGTTG GGCATCAACG GTTGCTGGGC
GAACATCCCT TACAAGCTCA GGCCAACGCC CTAGAACAGC AGGGTCAAAC CGTGATTTGG
GTATTGCGCG AACAGCAAGT ATTGGGATTA ATTGCCTGTG CTGATCGCTT GCGAGCTGAT
GTTGCCCCAG CGATTGCTCA ACTACGGCGT TTGGGTATTG ATACAATTGA AATCTTGACT
GGTGATAATC GAGCAGTCGC AGCCAATATT GCCGAGCAAT TAGGCATTAG CTATCAAGCT
GAATTATTGC CCGCCGATAA ATTAGCAATT GTGCGCCGTT ACCAAGCGCA AGGCCACCAT
GTGGTGATGA TTGGCGATGG CGTGAATGAT GCTCCGGCTT TGGCTCAAGC CCATGTTGGA
ATTGCGATGG GTGTGGCTGG CACGGCGGTT GCCCTTGATG CTGCGCATAT CGCCTTATTG
CGCGACGATT GGAGTTTGAT TCCACAGGCC TTGGCTTTGG CATTACGCAC AATGCGGATC
GTTAAGGGCA ATCTTGGTTT TACTGTGGCC TATAACGTGA TTGGTCTTAG CTTGGCTGCT
TTGGGAATCT TACCGCCTGT TTTGGCCGCT GCCGCCCAAT CGCTGCCCGA TTTAGGCATT
ATGGTCAATT CAGCCCGATT ATTGCGTTAC AAACCTAAAC CTTCATCACT TCCTCAATAA
 
Protein sequence
MSQQTISLPI RGMDCSSCAL HIQTALGELS GVMQAQVLFS AEKALITFDD QLLEPNALVQ 
TIEQAGYQVP QLQAAPPKST PASARLLLLF GAIVVVVLGF SVLGEQLGWL DLLIDWIPWP
IGVALVFVAG YRIFWRVAQA AWRKTVLSHS LMSIGVIAAL AIGQWPTALL VVLFMSIGEY
VEQFTSSQAR RAVRELTALA PTLAQVERDG TEIQLPIEAV QVGDVVVVRP GDQIPVDGVV
LRGAASINQA TITGESMPID AIEGTQVFAA TLATAGSLRI QATAVGRDST FGNVIKLVEE
AESQKAEIQR IGDAFSDYYL PVVALIALLT FVIQRDPLAT AAVLLVACSC SFALATPIAM
LATIGAAAKQ GILIKGGKYL EALAKTSVLL IDKTGTLTHG QPAVSQVVVH AATNEAEVLC
WAAAAEQDSE HPLAKAIVRA ARDRSIDLPH VREFKALAGS GVQAVVEGQT VVVGHQRLLG
EHPLQAQANA LEQQGQTVIW VLREQQVLGL IACADRLRAD VAPAIAQLRR LGIDTIEILT
GDNRAVAANI AEQLGISYQA ELLPADKLAI VRRYQAQGHH VVMIGDGVND APALAQAHVG
IAMGVAGTAV ALDAAHIALL RDDWSLIPQA LALALRTMRI VKGNLGFTVA YNVIGLSLAA
LGILPPVLAA AAQSLPDLGI MVNSARLLRY KPKPSSLPQ