Gene Haur_5188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5188 
Symbol 
ID5737146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp271120 
End bp273972 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content60% 
IMG OID641282352 
ProductATPase, P-type (transporting), HAD superfamily, subfamily IC 
Protein accessionYP_001547943 
Protein GI159901697 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0474] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01517] plasma-membrane calcium-translocating P-type ATPase
[TIGR01523] potassium and/or sodium efflux P-type ATPase, fungal-type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAGTA GCCATGTTGT CGCCATCTTT CAACAACCGA CGGATGTGGT GATCGCTGCA 
CTAAACAGCG ACATGAGCCA TGGATTGACC ACGGTGGAGG CGCACCGTCG TCACCGCCAA
TACGGTGCCA ATGAACTTGA TGCTGAACCG CCGATCCCTG CCTGGCGGAC GTTCCTCGCG
CAGTTTCAAA ACGTCCTCGT CATCGTACTC CTAATCGCGG CGGCGATATC GCTAGTAGTA
TGGCTCTACG AGCGCGAGCA GGCGCTCCCC TATGAGGCGA TCGTGATCGT CGCAATTGTG
CTGCTGAATG GAATCTTGGG CTTTATCCAA GAGGCGCGTG CAGCACGCGC CGTCGCGGCG
TTGCGTGCAC TGGCAGCCGC TGAGGCCTCC GTCATTCGCA ATGGGGAAAC CGTGCGTATT
GCGGCGACAG AACTGGTACC GGGCGATATT CTCCTGATCG AAGAAGGCGC AACGATTCCC
GCCGACGGGC GCGTTATCCA ATCGATAGCC CTGCATACCC TCGAAGCCTC GCTGACGGGG
GAGAGCTTGC CCGTCTCCAA GGATACCGAC CCGCTTACCA CCGCCGCGAG TCTTGGTGAT
CGCTCAAATA TGCTGTTTAG CGGCACAACG GCGAGCTATG GCCGCGGGCG AATGGTCGTG
ACCGCCACCG GCATGCAGAC GGAGATGGGC AAAATCGCTG GGCTTCTCCG ACAGACCACG
AGCGAGCGCA CGCCCCTGCA ACGGGAGCTT GATCGCACGG GGAAATGGCT TGGCATAGTC
GTCATAGGCA TGGCCGTCAT CATGATTGGG ACTATTCTGT TGCTAGAAGA GGTTCGGGAT
GTCAAAACCA TCGTTGCGGT GCTCATCCTG GGTGTAGCGC TCGCCGTGGC TGCCGTGCCA
GAGGGCTTAC CGACGATCGT GACGGCGGTC TTGGCGCTCG GGGTGCAACG CATGGCGCGT
CGCAAAGCCA TCATCCGCAA GTTGCCAGCA GTGGAAACGC TCGGCTCGGC CAGCATTATT
GCCTCTGACA AGACAGGGAC GCTCACGAGG AACGAGATGA CGGTGCGCAC CATCGTCACG
GCGAGTGGTC GTGTCGAGAT CGTGGGCATC GGCTATGGGC CAAGTGGGGA ACTTCGGCAG
ACCGACGACG CTTCGTCCAC CGAGGCCATG CGCAGCGAGG TCACCGCCAC GCTGTCGGCA
GCGAATCGTG CCAACAATGC GGTAGTGCTG GAGCGTGACG GACGCTGGAC GATTCTCGGC
GATCCCACCG AAGGAGCGCT GATTGTTGCC GCGCAGAAGG CTGGCCTCAC AGAGGAAACA
CTGACCGCCC GCTTCCCGCG CGTAGGCGAG GTGCCCTTCT CTTCAGAACG CAAGCTCATG
AGCACCGTCC ATACGGACAG CACGCACCCA GAGCGCCTGC TGGTCTTTAC CAAAGGTGCG
CCCGATGTCC TGTTGAATCA GTGTACGGCG GAATGGGTTG AGCACGCTCC ACGTCGATTA
AGCGAGGAGC GCCGCGCAAC ACTGCGCACA CTCAACGAGC AGTTGGCGGG TGAGGGATTG
CGGATTATCG GGATTGCCAG CCGTGTGCTA CCCCGTGATG CGCTCGACCA AGCCCACGCG
CTCAACGATG AGCTGGAACA TGATCTGGTG CTGCTTGGCT TCGTGGGCAT GATCGACCCG
CCGCGCGACG AAGCCAAGGC CGCGATCACA CGGGCGAAAA TGGCCGGCAT TCGGTCGATT
ATGATTACCG GCGATCATCC CAAAACCGCG ATGGCAATTG CGATGGAGTT AGGGATTGCT
GGTACCACCG CAGCGGTAAC AGGAGCAGAG GTGGAGTCGT TGTCAGAGGA GGCACTTCGC
ACGCTCGTGC AGGAATGCTC AGTCTATGCA CGGGTTAACC CGGAACACAA GCTCCGCCTC
GTCAAAGCGC TCCAACAGAA CGGTGCGGTG GTCGCGATGA CTGGTGATGG CGTTAACGAT
GCGCCAGCAC TCAAAGCAGC CGATATTGGG GTGGCGATGG GAATAACGGG AACCGATGTG
TCCAAGGAAG CCGCCGACAT GATCCTCGCC GACGACAATT TCGCCACTAT TGTCGCCGCC
GTCGAAGAAG GGCGTGCCAT CTTTGTAAAC ATCCAGAAGT TCTTATTTTT CCTGCTTTCG
TCGAACATCG GCGAAGTCCT GACGATGTTC GGCGGGGTCG TACTCGCCAG TGTCTTGGGA
TTAAGCGCTG GCAATGACGC GATTATTGTT CCCTTACTGG CAACCCAGAT TTTGTGGATA
AATCTGGTTA CCGACGGCAC GCCCGCGCTA GCGCTTGGCC TCGAACCAGC GAATGCCGCA
GTGATGCATC AACCGCCGCG TCCGCATGGG AGCAGTGTCA TTCCGCGCGG GATGTGGATA
CGCATCCTCG TGGTTGGGGT CATTATGGCC GTCGGAACGC TGCTGGTCCT TGATGCCGCC
TTGCCGGGTG GCCTGATCCA TGGGTCGCAG ACGATGGAGT ACGGGCGGAC AATGGCATTC
ACAACCCTGA TGCTATTCCA AATGTACAAC GTCTTCAACG CCCGCTCCTA TACCCAGAGC
GCATTCTCCC ACCCGTTCCA GAATCCTTGG CTGTGGGGCG CAGTAACCAT GTCTCTTGTG
CTCCATATGA TGGTCATCAC TGTGCCAGTG TTGCAGCGTG CCTTCAGTAC CGTGTCTTTG
ACCGCGCGTG ACTGGCTGAC CTGCCTCCTT GTCGCGAGCA TCGTGCTATG GGTCCGTGAG
CTGGATAAGG TCGGGCAACG ACACCGGCTG CGTGGCACGC AGGGGGAACA CCGGGGCACC
ACAACAAACA CCGAAACGGA TCACAGCGGA TAA
 
Protein sequence
MTSSHVVAIF QQPTDVVIAA LNSDMSHGLT TVEAHRRHRQ YGANELDAEP PIPAWRTFLA 
QFQNVLVIVL LIAAAISLVV WLYEREQALP YEAIVIVAIV LLNGILGFIQ EARAARAVAA
LRALAAAEAS VIRNGETVRI AATELVPGDI LLIEEGATIP ADGRVIQSIA LHTLEASLTG
ESLPVSKDTD PLTTAASLGD RSNMLFSGTT ASYGRGRMVV TATGMQTEMG KIAGLLRQTT
SERTPLQREL DRTGKWLGIV VIGMAVIMIG TILLLEEVRD VKTIVAVLIL GVALAVAAVP
EGLPTIVTAV LALGVQRMAR RKAIIRKLPA VETLGSASII ASDKTGTLTR NEMTVRTIVT
ASGRVEIVGI GYGPSGELRQ TDDASSTEAM RSEVTATLSA ANRANNAVVL ERDGRWTILG
DPTEGALIVA AQKAGLTEET LTARFPRVGE VPFSSERKLM STVHTDSTHP ERLLVFTKGA
PDVLLNQCTA EWVEHAPRRL SEERRATLRT LNEQLAGEGL RIIGIASRVL PRDALDQAHA
LNDELEHDLV LLGFVGMIDP PRDEAKAAIT RAKMAGIRSI MITGDHPKTA MAIAMELGIA
GTTAAVTGAE VESLSEEALR TLVQECSVYA RVNPEHKLRL VKALQQNGAV VAMTGDGVND
APALKAADIG VAMGITGTDV SKEAADMILA DDNFATIVAA VEEGRAIFVN IQKFLFFLLS
SNIGEVLTMF GGVVLASVLG LSAGNDAIIV PLLATQILWI NLVTDGTPAL ALGLEPANAA
VMHQPPRPHG SSVIPRGMWI RILVVGVIMA VGTLLVLDAA LPGGLIHGSQ TMEYGRTMAF
TTLMLFQMYN VFNARSYTQS AFSHPFQNPW LWGAVTMSLV LHMMVITVPV LQRAFSTVSL
TARDWLTCLL VASIVLWVRE LDKVGQRHRL RGTQGEHRGT TTNTETDHSG