Gene Haur_1438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1438 
Symbol 
ID5733302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1664774 
End bp1666024 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content46% 
IMG OID641278576 
ProductATPase central domain-containing protein 
Protein accessionYP_001544210 
Protein GI159897963 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACAAAG AAACATATTT ACATGATGCC TTGCAATTGC CCAGCCAATC GTTTACGTAT 
ATGGTGAGCC AGCAATTAGC CGCGCATTAC CCTGATCAAG GCATTCTCCA AACTGGAGAT
TGCGATTTTG ATGTGCGCAG CTATGCTGGG GCGGGCTTGT GTGAGGTCCA GGTTCATCAA
CAGCCCTATC CGCAAATTAA CTATTCATGG CTACGGGGCG AGGAAGGTGG TGAGATTGCT
CGTTCGCTAG AAAATGCTTG GCAAACTATC ACTTGGCAAG ACCAGAGCTT TGATTTGCTG
CTGCTTTCGT GGACAGCCTA TGGTACAACC TATAGTCTGC AATGGTTGAT TGGCGCTTCA
CAAGCAGCCG TCGAAGCGTT TTTTAGTGCA GTCTGCGATT GGAACACCGA AATTCGTGAT
GAAATTATGG TGTTTGATGA AGGCAGTTGG GAGAAAAGCC AAGAATTATA CTATGCGATT
AAAAATGCCT CTCTCGATAA TTTGATTTTG CCTGGTACGC TCAAACAAGA TATTTTCCGC
GATTTGCAGC GCTTTTTTGA AAGCAAAGCC ACCTACGAAC ACTATAATAT TGCTTGGAAA
CGCGGGATTA TTCTGGTTGG CCCGCCTGGC AATGGCAAAA CCCACATGAT TAAAGGCTTG
CTGAACGCCT TGGATTACCC ATGTTTGTAT GTCAAAAGCT TCGATGCTCA ATATTCGACC
AATAACGCCA ATATTCGGGC GGTGTTTGAT CGGGCACGCC GCAGTGCTCC CTGTATCGTG
GTGCTGGAAG ATTTAGATTC GTTGATCAAC GATACCAACC GAGCCTTCTT TTTGAATGAA
GTTGATGGTT TTAACGCAAA TCAAGGGGTT GTGTTACTGG CAACCACCAA CCACCCTGAG
GATATTGACC CAGCGATTAT GAACCGACCC AGCCGTTTTG ACCGCAAATA TTACTTTAGC
TTGCCCGAAC TGGCCGAGCG TTTGGCCTAT ATTGAACAAT GGAATACGGC GCTGCATAGC
TCAACCCAAT TAACTGAAGC TGGCATTCAG CAGGCTGCCG AAGTAACCGA AGGCTTTTCG
TTTGCCTACT TGAAAGAATT GTTTGTTTCG GCTTTGATGC GCTGGATCGA TATCAAAGAC
CAAACTGATA TGGATACTGT GATTTTGGAG CAAGCTACAT TCTTGCGCGA GCAAATGGTG
ACTGAAGATC CTGAAGCAGC CGAAGAGGAA ACTGAAGACG ACGAAGAATA G
 
Protein sequence
MNKETYLHDA LQLPSQSFTY MVSQQLAAHY PDQGILQTGD CDFDVRSYAG AGLCEVQVHQ 
QPYPQINYSW LRGEEGGEIA RSLENAWQTI TWQDQSFDLL LLSWTAYGTT YSLQWLIGAS
QAAVEAFFSA VCDWNTEIRD EIMVFDEGSW EKSQELYYAI KNASLDNLIL PGTLKQDIFR
DLQRFFESKA TYEHYNIAWK RGIILVGPPG NGKTHMIKGL LNALDYPCLY VKSFDAQYST
NNANIRAVFD RARRSAPCIV VLEDLDSLIN DTNRAFFLNE VDGFNANQGV VLLATTNHPE
DIDPAIMNRP SRFDRKYYFS LPELAERLAY IEQWNTALHS STQLTEAGIQ QAAEVTEGFS
FAYLKELFVS ALMRWIDIKD QTDMDTVILE QATFLREQMV TEDPEAAEEE TEDDEE