Gene Haur_1957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1957 
Symbol 
ID5733846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2380632 
End bp2383538 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content56% 
IMG OID641279101 
Producthelicase domain-containing protein 
Protein accessionYP_001544728 
Protein GI159898481 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAGT TTCGGGTTGG CTCGATTGTT CGCTGTCGGA ATCGTGAGTG GGTCGTGCAA 
CCATCAACCC ATCCTGATAT TTTGCAGCTG CGACCACTTG GTGGGATTGA TGGCGAAAGC
TGTGGGATTT ATCTGCCAAT CGAGGGCGAT ACGGTTACAA CCGCCCAGTT TCCGCCACCT
GATCCGGCCA TGGCTGGCGA TTTTGTGGCT GGGCAGTTGT TGCGGAACGC CGCCCGCCTG
AGTTTGCGCA ATGGGGCTGG GCCATTTCGG TCGTTGGGCC GCTTGGGTGC ACGCCCACGC
CCCTACCAGC TTGTGCCGTT GTTGATGGCG CTGCGCCTCG ACCCAGTTCG TTTGCTGATT
GCCGATGATG TTGGGGTTGG TAAAACGGTT GAGGCGGCGT TGATTGTGCG CGAATTGCTC
GATCGTGGCG AAATTAAGCG CTTTACGGTG CTGTGCCCGC CGCATCTCTG CGACCAGTGG
AGCCGTGAGT TGGAGCAGAA ATTTGGTATC GAGGCAACCG TCGTGCGTTC GAGCACTGCC
GCCGCGCTTG ATCGCACACT GCCCAACGCC GATATTAGTG TCTTCGAGTA TTATCCTTTT
ACGGTTGCTA GCATTGATTA CATTAAACAT GATCGGCATC GGGCGAACTT TTTGCGAGCT
TGCCCCGAAT TGGTGATTGT TGATGAGGTG CATACGGCGG CGCGGCCAAG CGGCCCTGGC
TCCGGTCAAC AACAACGCTA TGATCTGATT CGGGCGGTAG CCGACGATCA GGAACGGCAT
TTGTTGTTGT TGTCGGCCAC GCCTCACAGT GGAATTGAGG ATGCCTTTGT GTCGTTGTTG
GGTTTGCTGC GCCCCGAATT TGAAACCTAT TCGATGACCA ACCTTAGCTC CAAGCAGCGC
GATTTGCTGG CTCAACATTT TGTGCAGCGC CGCCGTGGCG ATGTTGTGCG TTGGTTGGAT
GAAGAAACGC CGTTTCCAAA GCGTGAACAT ATGTTGCGTG ATGCCACTGG CAATCGTGAT
ATTACCTACC GCCTCGCTCA ATCATCGGCC TATCGCAGTT TGTTTAATGA GGTGTTTGAG
TTTGCCCGCG AGTTGGTGCA CGAAAGCTTG CGCGAGGCTA ATGCACCACG GGTTGCGGCC
CGCACGCGGG TGCGCTATTG GGCGGCGCTC GCCTTGTTGC GCTGTGTGAT GAGCAGCCCC
GCCGCTGCTG AAAAAGCCTT GATGACTCGC GCTCGCAAAG ATCCAACCAC CCAAGCTGAT
GATCTGAGTT TAACGTTTGA TGATGAGCTA TTTAGTAGTG CGATTCACGA CCGCAGCACA
CCCGCCGCTG ATGATGATGT TGAGCCAACC CATGTGATCG AGGAAGGCAG CACTGCCAGC
GAGGCCGCCC GCTTGCAAAA ATTTGCTGAA AAGGCGCGAC AATTGCGCGG CGCAAACGAC
CCCAAGTTGC AAAGTATTGT GCCGATCGTG GCGCGTTTGC TGAAGGATGA TTTTCAGCCA
ATTATTTATT GTCGCTATAT TGCAACCGCT AAGTATGTGG CCGAGGAGTT GACTCGTGCC
CTGAAGCCTG GCAACGATAC GCGCATTTTG GCGGTAACTG GCGACCATAG CGAGGAAGAA
CGCGAAAGTT TAATTGCTGA ATTGAGCGCC AGCCCCCGCC GCCTGTTGGT GGCAACCGAT
TGTTTGAGCG AGGGCATCAA TTTGCAAGAG GCTTTCAATG CGGTGATTCA TTACGATTTG
CCGTGGAATC CCAATCGTTT GGAGCAACGG GAGGGGCGGG TTGATCGCTA TGGTCAGCCA
GCGCCAGTTG TGCGTATGGC TTTGGTGCGC GGCGAGGATA ACCCCATCGA CGAGGCGGTG
ATGCGGGTGT TGTTGCGCAA AGCAGTCACC ATCCATCGCA CGCTTGGCAT TAGCGTGCCC
TTGCCAGTCA GTAATGAAAC CGTCGTCGAT GCACTGATTG CAACCTTGTT TAAGCCGCCA
GCAACTCAGC TGACCATGTT TGAAGCAATC GATCCGGCGA CCTTTGCGGC GGCTGAACAA
ACCTTGGCCG AGGTCGAGTT GCAGTGGGCG CAGGCCGAAC ATCAAGCCGA GCAAAGCCGC
ACTCGTTTTG CTCAACATCG GATTCAGCCC CAAGAGGTGG CGCGTGAGCT TGAGGAAAGT
GATGCGGTGC TGGGCGATGC TGATACGGTG CGCTCATTTG TGCGTGCTAG TTGTGAACGA
ATTGGTGCGC CATTGGTGGC GCTGGGTGCG CACGCTCCCG ACCATTGGCG CGTGCCGATT
ATTCAATTGC CGCTGCCTGT GCGCGAACGG GTCGAGCCGT TGTTGCGTAA TAAGGCGACT
GAGTTAATCA TCACGTTCAC GGGGATTGCT GCTGAGGGCG TGCTGTTGGT TGGGCGTAAT
CATCCGCTGA CGATGGCCTT GGCTGATTAT GCCTTGGAAA CTGCCTTGAC TCCCGAGGAT
GGTTTGCCCG TTCCGGCGGC GCGATCAGGG GTGTTGCGTA CAAAAGCGGT CGAACGGCGC
ACCTTTTTGG CCTTGTTGCG GGTGCGGATG CTGATTGCGA CCAAACGCAA TGAGCCGCTG
TTGGCCGAGG AGTTGGTGGT GGCGGGGTTT GTGCGCGAGC CAGGCGGTTT TCGCAGCCTT
GAGCCAGCAG CGGCCCTTGA TCTGTTAACC AATGCCTTGC CTGATGCTAA TATTGCCCAC
GCTGATCGCG AACACCAATT GCATTCGGCG CTTGATCTGT TGCCGCAGCT CGCCGCCGAT
CTCACCAGTT TGGCCCATAC TCGTGCCGAA CGCTTGGCTG AATCGCACTC ACGGGTGCGC
ACGGCTACGC GCATGGCAGG CAAAGTTAGC GTCGAGCCAC ATTTGCCGCC CGATGTGTTG
GGTTTGTATG TGTTGTTGCC AGTGTAA
 
Protein sequence
MAQFRVGSIV RCRNREWVVQ PSTHPDILQL RPLGGIDGES CGIYLPIEGD TVTTAQFPPP 
DPAMAGDFVA GQLLRNAARL SLRNGAGPFR SLGRLGARPR PYQLVPLLMA LRLDPVRLLI
ADDVGVGKTV EAALIVRELL DRGEIKRFTV LCPPHLCDQW SRELEQKFGI EATVVRSSTA
AALDRTLPNA DISVFEYYPF TVASIDYIKH DRHRANFLRA CPELVIVDEV HTAARPSGPG
SGQQQRYDLI RAVADDQERH LLLLSATPHS GIEDAFVSLL GLLRPEFETY SMTNLSSKQR
DLLAQHFVQR RRGDVVRWLD EETPFPKREH MLRDATGNRD ITYRLAQSSA YRSLFNEVFE
FARELVHESL REANAPRVAA RTRVRYWAAL ALLRCVMSSP AAAEKALMTR ARKDPTTQAD
DLSLTFDDEL FSSAIHDRST PAADDDVEPT HVIEEGSTAS EAARLQKFAE KARQLRGAND
PKLQSIVPIV ARLLKDDFQP IIYCRYIATA KYVAEELTRA LKPGNDTRIL AVTGDHSEEE
RESLIAELSA SPRRLLVATD CLSEGINLQE AFNAVIHYDL PWNPNRLEQR EGRVDRYGQP
APVVRMALVR GEDNPIDEAV MRVLLRKAVT IHRTLGISVP LPVSNETVVD ALIATLFKPP
ATQLTMFEAI DPATFAAAEQ TLAEVELQWA QAEHQAEQSR TRFAQHRIQP QEVARELEES
DAVLGDADTV RSFVRASCER IGAPLVALGA HAPDHWRVPI IQLPLPVRER VEPLLRNKAT
ELIITFTGIA AEGVLLVGRN HPLTMALADY ALETALTPED GLPVPAARSG VLRTKAVERR
TFLALLRVRM LIATKRNEPL LAEELVVAGF VREPGGFRSL EPAAALDLLT NALPDANIAH
ADREHQLHSA LDLLPQLAAD LTSLAHTRAE RLAESHSRVR TATRMAGKVS VEPHLPPDVL
GLYVLLPV