Gene Haur_4235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4235 
Symbol 
ID5736089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5398511 
End bp5400121 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content50% 
IMG OID641281390 
ProductAAA ATPase 
Protein accessionYP_001546995 
Protein GI159900748 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0606] Predicted ATPase with chaperone activity 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATTTT TACTCGATGC GCGTTCGTAT GACGAAATGA AAGCCCGCAA TGATCATACA 
CCCGACTGCT ACGTGCGACC GGGCGAATAT TTGCATCATA TGTATGTCGA TGGGATTAAG
TATGCCGTCG CCCGTGCCTT GTATGATTGC TTGAAGGAAG CGCGTGAGAA GAATGATTAT
GAGGCGACGC GCCACTTAAA TATGCACAAA CAACAGCTTG ATGCGCTGGT GGCTGAGGCC
CAAGCCAGTG GCAAATTGAG CGGTGCGCGG GTACTTGGGG GGGATGTGCC GAACGATGAC
GGGTCAGCCC CATCGGGGAA TAGCTATCAC CCCCAAGTAC CAACCACGAT TGAAGAAACT
GGCCTGAATC GAACCCAAAT TCAGGACCAA TTGCTACGGG TAATTTATAA CCGTTCGCGG
GTCACTGGCA TGGAGCTAGC CCAAGAAGTA CGCTTGTTCT ATAGCGTGGT CGATCCGGTG
ATCACCCAAA TGCGCAACTC GGAATATATC GATATTGCTG GGCAACGTGG CTTTGGCGAT
ACCAATTACG AATATATTTT GACTCCGCGT GGCTCGCAAG CTGCCGAAGA TGCCATGAAG
AAAAGCAACT ATAGCGGGCC AGCTCCAGTT CCTTTTGCTC AATTTCTCGA ATCGGTCAAG
GCTCAAACGA TCAAAAATAT GGTGATCACA CGGCGCAATA TTCGCAAAGC CTTTAGCGAT
TTGATTATCA CCGACCAAGT GCTGAACGAA GTTGGGCCAG CGGTCAACTC GGGCGCGTCG
ATCTTCTTGT TTGGCTACCC TGGCAATGGC AAAACCAGTA TCGCCGAACG CATCACCCGC
CTGATGGGTG ATGATATTTT CGTGCCGTAT GCGGTAGATG CCGATGGCCA AATTATCAAA
GTTTACGATA GCATTGTGCA TACGTTGGTC GATAAAGAGG CCAATATCGG AACTTCGGCC
TATGATCTGC GCTGGGCTAA AATCAAACGG CCTGTGGTGG TGGTCGGGGG CGAACTAACG
CTTGAAGCGC TCGATTTGAC GTTCAACGAA GCAGGCCGTT TTTATGAAGC ACCCTTCCAG
ATGAAAGCCA ACGGCGGGAT TTTCTTGATC GACGACTTTG GTCGCCAACA ATGTCGCCCG
ATGGACTTGC TGAACCGCTG GATTGTGCCG CTCGAAAAGC GCTACGATTA TTTGACCACG
ATTACTGGCC AAAAAATCGA AGTTCCATTC GATCAATTGC TGATTTTTTC GACCAACCTC
GACCCTAGCC AAGTGGCTGA CGAAGCCTTC TTGCGCCGGA TCAAATTCAA AATTGAGGTG
CGCGACCCCG ATGAATCGCA ATGGCGACAA ATTTGGTCGT TGGTTTGTAA AGGCCGCAAA
ATTAACCTCG ACCCCAAGGG CTTAGATTAT TTGGTGGAAA AATGGTACAA GCCCGATGAT
CGGCCTTTCC GTATGTGCCA ACCGCGCGAT ATTCTCGATC AGATGATCAG CATCGCCAAA
TATAATATGG AACAAGTCAC CTTTAATCCC GATTTGATTG ATGCAGCGTG TGGCACGTAC
TTTGTCAGCA AAGAAGCCAA AAACTTCGGT GCCAAAGTTC GCCTCGATTA A
 
Protein sequence
MRFLLDARSY DEMKARNDHT PDCYVRPGEY LHHMYVDGIK YAVARALYDC LKEAREKNDY 
EATRHLNMHK QQLDALVAEA QASGKLSGAR VLGGDVPNDD GSAPSGNSYH PQVPTTIEET
GLNRTQIQDQ LLRVIYNRSR VTGMELAQEV RLFYSVVDPV ITQMRNSEYI DIAGQRGFGD
TNYEYILTPR GSQAAEDAMK KSNYSGPAPV PFAQFLESVK AQTIKNMVIT RRNIRKAFSD
LIITDQVLNE VGPAVNSGAS IFLFGYPGNG KTSIAERITR LMGDDIFVPY AVDADGQIIK
VYDSIVHTLV DKEANIGTSA YDLRWAKIKR PVVVVGGELT LEALDLTFNE AGRFYEAPFQ
MKANGGIFLI DDFGRQQCRP MDLLNRWIVP LEKRYDYLTT ITGQKIEVPF DQLLIFSTNL
DPSQVADEAF LRRIKFKIEV RDPDESQWRQ IWSLVCKGRK INLDPKGLDY LVEKWYKPDD
RPFRMCQPRD ILDQMISIAK YNMEQVTFNP DLIDAACGTY FVSKEAKNFG AKVRLD