Gene Haur_3018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3018 
Symbol 
ID5734875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3811517 
End bp3813436 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content55% 
IMG OID641280162 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_001545784 
Protein GI159899537 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000244984 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACCA CAAGTGTTAA TAGCCGCCAC CTGCATGCTG AGGCCCAAGC TGAGCTTGAT 
GAACTCGCAG CATTGCCATG GATGATGCGC CTGACCGCGA TTTGTTTGAT CGCTACAGGT
TTGGGCTGGG TTGCGGGGAG TTTTGCGGGC GTGCCAACAT GGTTGCCTTG GCTGTTGTAT
GTGGTAGCGT TTGCCAGCGG CGGCTGGTTT CCGCTGGGCA ATGCCTGGGA AAGCCTGCGC
CAACGCGAAT TCGATGTTAA TTTCTTGATG ATTGTGGCCG CAATCGGGGC GGCGGCAGTT
GGTCAGCCGC GCGAAGGCGC AATTTTGATG TTTTTGTTTG CGCTTTCCAA TACGCTTGAA
ACCTATGCCA TGGGTCGCAC TCATCGGGCC GTCAATGCCC TGCTGGAGAT GGCTCCCGAC
CAAGCAACCC TGATCGCTGC TGATGGTACG CAACAAGTCG TGGCCGTGGC TGATTTAGCG
ATTGGCGATC GGGTATTGGT ACGGCCTGGC GAACGGATTC CAGTTGATGG GATTGTGCGG
ATCGGCGCTT CATCAATTAA CGAAGCCGCA ATTACTGGCG AATCAGTGCC AGTTGATAAA
GGCGCGGGGA GCAAGGTATT TGCTGGTACT CTCAATACTA CTGGCGCTCT CACCATTGAA
GTAACAGTAG CAGTTGGCGA TACGACCTTG GCGCGAATTA TCGAGACCGT AGCTGAAGCT
CGCAGTCAAA AAGCCAAAGC CCAAGATTTT ACCGATCGGG TGATTGGTCA ATATTATGCC
TATGCTGTCG TCGTGATGAC CTTGTTGGCG ATTGCGATTC CCTTGCTATT TCTCGATTGG
AGCGTTAAAA CCACGCTCTA CCGCGCAATG GCACTGATGG TTGTGGCCTC GCCCTGTGCC
TTGGTGATTT CAATTCCAGC GGCGATGCTT TCGGCCATGG CCAATGCTGC CCGCCACGGG
ATGTTGTTCA AAGGTGGGCG CTATCTTGAA GCTGCCGCTA AAATTAAGGT CGTGGCCTTG
GATAAAACTG GTACGCTGAC CACAGGCCAA TTGAGCGTGA TGCAAACTGT GGATCTTGGT
CAACGCCCAA CGGAACAATG GCTAATCGCC GCAGCAGCGG TCGAAGCCTT CTCAGAACAT
CCCTTGGCCA AAGCGATTGT CGCCCATGCT CAAGATCAGA AACTGAATGT CCCAACTGCG
GTTGATTTCC AATCGATCAC TGGGATGGGT GCCCAAGCGG ATGTGCATGG CGAATTGGTG
CAAGTTGGTC GGCCCCGTTT GTGGGGTGAG GCTGTGCTAG CCCAAGCTGC CAAACTCGAA
GCTCAAGGCG CAACCGTGAT CGGGGTTGGC ACAACCGAAC AAGCCTGGGG CTTAATTGCC
TTGGCTGATA CGATTCGGCC TGATGCCAAG CAAGCGATTG CCGCGCTGCA CGCTGCGGGT
GTTGAACGAG TGGTGCTGTT GACTGGTGAT AATCAAGCCG TAGCACAGCA TGTGGCCAGC
CAAATTGGCA TTGATGATGT GCGTGCCGAA CTGTTGCCTG GCGATAAAGC CCAGATCATC
GAAGAATTAC AACAACGCTA TGGCCCAGTT GCGATGGTTG GCGATGGCGT GAATGATGCC
CCAGCCTTGG CCACGGCCCA ACTTGGTGTG GCAATGGGCG TTGCTGGTAC TGATGTAGCC
GTGCAAAGCG CCGATGTGTT GTTGCTGAGC GACGATTTGC TGAAATTGGC TGAAGCGTTA
CGGCTTGGCC GCCGCACCCA ACGGATTGTT TGGCAAAATA TTGCCTTTGC TGGCGGCGTG
ATTGTGGTGC TGATTGCTTC AGCCTTGTTT GGCAATATTG CCTTGCCGTT GGGCGTGGTT
GGCCATGAAG GGAGCACCTT GTTGGTGGTT GCCAACGGCT TGCGCTTGCT ACGACGCTAA
 
Protein sequence
MTTTSVNSRH LHAEAQAELD ELAALPWMMR LTAICLIATG LGWVAGSFAG VPTWLPWLLY 
VVAFASGGWF PLGNAWESLR QREFDVNFLM IVAAIGAAAV GQPREGAILM FLFALSNTLE
TYAMGRTHRA VNALLEMAPD QATLIAADGT QQVVAVADLA IGDRVLVRPG ERIPVDGIVR
IGASSINEAA ITGESVPVDK GAGSKVFAGT LNTTGALTIE VTVAVGDTTL ARIIETVAEA
RSQKAKAQDF TDRVIGQYYA YAVVVMTLLA IAIPLLFLDW SVKTTLYRAM ALMVVASPCA
LVISIPAAML SAMANAARHG MLFKGGRYLE AAAKIKVVAL DKTGTLTTGQ LSVMQTVDLG
QRPTEQWLIA AAAVEAFSEH PLAKAIVAHA QDQKLNVPTA VDFQSITGMG AQADVHGELV
QVGRPRLWGE AVLAQAAKLE AQGATVIGVG TTEQAWGLIA LADTIRPDAK QAIAALHAAG
VERVVLLTGD NQAVAQHVAS QIGIDDVRAE LLPGDKAQII EELQQRYGPV AMVGDGVNDA
PALATAQLGV AMGVAGTDVA VQSADVLLLS DDLLKLAEAL RLGRRTQRIV WQNIAFAGGV
IVVLIASALF GNIALPLGVV GHEGSTLLVV ANGLRLLRR