Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1957 |
Symbol | |
ID | 5733846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2380632 |
End bp | 2383538 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641279101 |
Product | helicase domain-containing protein |
Protein accession | YP_001544728 |
Protein GI | 159898481 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCAGT TTCGGGTTGG CTCGATTGTT CGCTGTCGGA ATCGTGAGTG GGTCGTGCAA CCATCAACCC ATCCTGATAT TTTGCAGCTG CGACCACTTG GTGGGATTGA TGGCGAAAGC TGTGGGATTT ATCTGCCAAT CGAGGGCGAT ACGGTTACAA CCGCCCAGTT TCCGCCACCT GATCCGGCCA TGGCTGGCGA TTTTGTGGCT GGGCAGTTGT TGCGGAACGC CGCCCGCCTG AGTTTGCGCA ATGGGGCTGG GCCATTTCGG TCGTTGGGCC GCTTGGGTGC ACGCCCACGC CCCTACCAGC TTGTGCCGTT GTTGATGGCG CTGCGCCTCG ACCCAGTTCG TTTGCTGATT GCCGATGATG TTGGGGTTGG TAAAACGGTT GAGGCGGCGT TGATTGTGCG CGAATTGCTC GATCGTGGCG AAATTAAGCG CTTTACGGTG CTGTGCCCGC CGCATCTCTG CGACCAGTGG AGCCGTGAGT TGGAGCAGAA ATTTGGTATC GAGGCAACCG TCGTGCGTTC GAGCACTGCC GCCGCGCTTG ATCGCACACT GCCCAACGCC GATATTAGTG TCTTCGAGTA TTATCCTTTT ACGGTTGCTA GCATTGATTA CATTAAACAT GATCGGCATC GGGCGAACTT TTTGCGAGCT TGCCCCGAAT TGGTGATTGT TGATGAGGTG CATACGGCGG CGCGGCCAAG CGGCCCTGGC TCCGGTCAAC AACAACGCTA TGATCTGATT CGGGCGGTAG CCGACGATCA GGAACGGCAT TTGTTGTTGT TGTCGGCCAC GCCTCACAGT GGAATTGAGG ATGCCTTTGT GTCGTTGTTG GGTTTGCTGC GCCCCGAATT TGAAACCTAT TCGATGACCA ACCTTAGCTC CAAGCAGCGC GATTTGCTGG CTCAACATTT TGTGCAGCGC CGCCGTGGCG ATGTTGTGCG TTGGTTGGAT GAAGAAACGC CGTTTCCAAA GCGTGAACAT ATGTTGCGTG ATGCCACTGG CAATCGTGAT ATTACCTACC GCCTCGCTCA ATCATCGGCC TATCGCAGTT TGTTTAATGA GGTGTTTGAG TTTGCCCGCG AGTTGGTGCA CGAAAGCTTG CGCGAGGCTA ATGCACCACG GGTTGCGGCC CGCACGCGGG TGCGCTATTG GGCGGCGCTC GCCTTGTTGC GCTGTGTGAT GAGCAGCCCC GCCGCTGCTG AAAAAGCCTT GATGACTCGC GCTCGCAAAG ATCCAACCAC CCAAGCTGAT GATCTGAGTT TAACGTTTGA TGATGAGCTA TTTAGTAGTG CGATTCACGA CCGCAGCACA CCCGCCGCTG ATGATGATGT TGAGCCAACC CATGTGATCG AGGAAGGCAG CACTGCCAGC GAGGCCGCCC GCTTGCAAAA ATTTGCTGAA AAGGCGCGAC AATTGCGCGG CGCAAACGAC CCCAAGTTGC AAAGTATTGT GCCGATCGTG GCGCGTTTGC TGAAGGATGA TTTTCAGCCA ATTATTTATT GTCGCTATAT TGCAACCGCT AAGTATGTGG CCGAGGAGTT GACTCGTGCC CTGAAGCCTG GCAACGATAC GCGCATTTTG GCGGTAACTG GCGACCATAG CGAGGAAGAA CGCGAAAGTT TAATTGCTGA ATTGAGCGCC AGCCCCCGCC GCCTGTTGGT GGCAACCGAT TGTTTGAGCG AGGGCATCAA TTTGCAAGAG GCTTTCAATG CGGTGATTCA TTACGATTTG CCGTGGAATC CCAATCGTTT GGAGCAACGG GAGGGGCGGG TTGATCGCTA TGGTCAGCCA GCGCCAGTTG TGCGTATGGC TTTGGTGCGC GGCGAGGATA ACCCCATCGA CGAGGCGGTG ATGCGGGTGT TGTTGCGCAA AGCAGTCACC ATCCATCGCA CGCTTGGCAT TAGCGTGCCC TTGCCAGTCA GTAATGAAAC CGTCGTCGAT GCACTGATTG CAACCTTGTT TAAGCCGCCA GCAACTCAGC TGACCATGTT TGAAGCAATC GATCCGGCGA CCTTTGCGGC GGCTGAACAA ACCTTGGCCG AGGTCGAGTT GCAGTGGGCG CAGGCCGAAC ATCAAGCCGA GCAAAGCCGC ACTCGTTTTG CTCAACATCG GATTCAGCCC CAAGAGGTGG CGCGTGAGCT TGAGGAAAGT GATGCGGTGC TGGGCGATGC TGATACGGTG CGCTCATTTG TGCGTGCTAG TTGTGAACGA ATTGGTGCGC CATTGGTGGC GCTGGGTGCG CACGCTCCCG ACCATTGGCG CGTGCCGATT ATTCAATTGC CGCTGCCTGT GCGCGAACGG GTCGAGCCGT TGTTGCGTAA TAAGGCGACT GAGTTAATCA TCACGTTCAC GGGGATTGCT GCTGAGGGCG TGCTGTTGGT TGGGCGTAAT CATCCGCTGA CGATGGCCTT GGCTGATTAT GCCTTGGAAA CTGCCTTGAC TCCCGAGGAT GGTTTGCCCG TTCCGGCGGC GCGATCAGGG GTGTTGCGTA CAAAAGCGGT CGAACGGCGC ACCTTTTTGG CCTTGTTGCG GGTGCGGATG CTGATTGCGA CCAAACGCAA TGAGCCGCTG TTGGCCGAGG AGTTGGTGGT GGCGGGGTTT GTGCGCGAGC CAGGCGGTTT TCGCAGCCTT GAGCCAGCAG CGGCCCTTGA TCTGTTAACC AATGCCTTGC CTGATGCTAA TATTGCCCAC GCTGATCGCG AACACCAATT GCATTCGGCG CTTGATCTGT TGCCGCAGCT CGCCGCCGAT CTCACCAGTT TGGCCCATAC TCGTGCCGAA CGCTTGGCTG AATCGCACTC ACGGGTGCGC ACGGCTACGC GCATGGCAGG CAAAGTTAGC GTCGAGCCAC ATTTGCCGCC CGATGTGTTG GGTTTGTATG TGTTGTTGCC AGTGTAA
|
Protein sequence | MAQFRVGSIV RCRNREWVVQ PSTHPDILQL RPLGGIDGES CGIYLPIEGD TVTTAQFPPP DPAMAGDFVA GQLLRNAARL SLRNGAGPFR SLGRLGARPR PYQLVPLLMA LRLDPVRLLI ADDVGVGKTV EAALIVRELL DRGEIKRFTV LCPPHLCDQW SRELEQKFGI EATVVRSSTA AALDRTLPNA DISVFEYYPF TVASIDYIKH DRHRANFLRA CPELVIVDEV HTAARPSGPG SGQQQRYDLI RAVADDQERH LLLLSATPHS GIEDAFVSLL GLLRPEFETY SMTNLSSKQR DLLAQHFVQR RRGDVVRWLD EETPFPKREH MLRDATGNRD ITYRLAQSSA YRSLFNEVFE FARELVHESL REANAPRVAA RTRVRYWAAL ALLRCVMSSP AAAEKALMTR ARKDPTTQAD DLSLTFDDEL FSSAIHDRST PAADDDVEPT HVIEEGSTAS EAARLQKFAE KARQLRGAND PKLQSIVPIV ARLLKDDFQP IIYCRYIATA KYVAEELTRA LKPGNDTRIL AVTGDHSEEE RESLIAELSA SPRRLLVATD CLSEGINLQE AFNAVIHYDL PWNPNRLEQR EGRVDRYGQP APVVRMALVR GEDNPIDEAV MRVLLRKAVT IHRTLGISVP LPVSNETVVD ALIATLFKPP ATQLTMFEAI DPATFAAAEQ TLAEVELQWA QAEHQAEQSR TRFAQHRIQP QEVARELEES DAVLGDADTV RSFVRASCER IGAPLVALGA HAPDHWRVPI IQLPLPVRER VEPLLRNKAT ELIITFTGIA AEGVLLVGRN HPLTMALADY ALETALTPED GLPVPAARSG VLRTKAVERR TFLALLRVRM LIATKRNEPL LAEELVVAGF VREPGGFRSL EPAAALDLLT NALPDANIAH ADREHQLHSA LDLLPQLAAD LTSLAHTRAE RLAESHSRVR TATRMAGKVS VEPHLPPDVL GLYVLLPV
|
| |