Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2231 |
Symbol | |
ID | 5734118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2837196 |
End bp | 2839532 |
Gene Length | 2337 bp |
Protein Length | 778 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279372 |
Product | helicase superfamily protein |
Protein accession | YP_001544999 |
Protein GI | 159898752 |
COG category | [R] General function prediction only |
COG ID | [COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster |
TIGRFAM ID | [TIGR03158] CRISPR-associated helicase, Cyano-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCACAA TTCAATTAAC CGGCCATAAG GAAAAACTCG CGCCGCCGAA TACCATCCCA AACTATCGCG CAGCAGCACC ATTGCTCTAT CATCAAGTGC GCACGTACCA AGCACTTGAG CAAGCGCCCT TGGTAATGAA TACCTATGCA ACTGGCACGG GCAAAACCAC CGCTGCCTCA TTACGCTTGC TCCACCCTGA TCAACAAAAG CGCAATACAT TAATTATTGC GCCAACCAAC GCGCTGATTG AGCAGCATTG TGCCGATGCC CAAGCATTTA TCGATGACAA CCAGCTTGCC ATGCGCACCC TGCCCATAAA TGCTGCGACA ATCCGCGATA TCACCAGCGA AACTCGGCGG GGCGAAATTT TGCAGCGTTT AATTGCCAAC CCGCTAACCT TTGCTGAGGC ACTAGGGCTG CCTAACGATG CGGAAAAGCT ACCATTCGTG GCGGTCACCA ACCCCGATAT TTTCTACTTG GCGCTCTACT TTCGCTATGG CCCCCTCGAT CAACGTAATG TCTGGGAGAA ATTCATCCGT CAATTTCACT ATATTGTGAT TGATGAATTT CACTATTACG ATAACAAGCA ATTTAGCAAC TTCTTGTTCT TCTTTAATCT CTGGAAAGAA TGGGGCTACT TTGAGGCTGG CTATAAAATT TGCCTGCTCT CGGCAACTCC CCGTGACGCA GTTTATCGCT ACTTGAACCA AGTGTTTGGG GCTGATGGCT GGCAACGAGT CGGTCCCGAC AACGAACCTG AATCCAGCTC CGCTTTGCCG CCGACCCCAA CTTTAGCCCC ATTAACCATG CATATTCACA ATGACAATCT AGAGGAATGG CTGCAAACAC AAAGCCAGAT CGTCGCCGAT TGGCAGGCGA ACGCGCTTGA TAGCGTGATT ATTAGCAGTA GTCTTGGCAA AATCAATCGG ATTCACGCAA CATTACGTCA ACTCAATCCA ATTCGCATCA CTGGGCCAGA GCCACCCGAA CATCGACGTT TGGTGCGGCC AGTGATTTTG GCTACACCGA CGGTTGATAT TGGCTACAAC TTTGGTCGGC CCGGCAAGCA ACGTCAAAGT ATCGATCGGC TGATTTGCGA TGCAAAATTT GGCGACGAAC TAACCCAACG CATCGGGCGG GCTGGGCGGG TTTTGGGGCG CGAGCAAACC GATATCCCTG GCGAGGCCCA TATTTTTATT AGCGAAGATG CCTTTGGTGA ATTAAAACAG TGGGATACCC AAACGCTCAA TCGTGCTGAA TGGGCCAGCA TCATTCACCA ACTTGAACAT CTGCCAGCCA AACATCAACT TGAAGGCTAT ATTCGCCAAT ATGCCCTGCT CGAAACGTTT TATCCATTGC TCAAATTGCA ACAACTAACC CCCAAAGACG ACCCTGATCA AGAGGCGTTA TTTAATGCGA TGCGCGATAT TTTCGCTCCC AATAGCCAAC GCACAATTGG CAGTTTACGC CACTTCTATC GGGTTTATGA GGAGCGCGAA CAATGGCTCA GGCTCTCGGA AAGTGCCAAA TGGGCCGATC GAGGGGCAGT TGCCAAACAT TTTGCCGCCC AACTTTCATG GCTTGCCTCG ACCAAAGATC GGCAACAGGA AATCAAGCCA GAACAAGTAC GCGGTATTCT CGATCAACGC TTGATTGGCC ATAGCACACC GCAAAATGCT CTGATCGATT TTATCGAATC GCAAGTTGTG ATGACCAAAG CGCTCTTCAA CTTTCGCGAG GCTTGGCAAG GCCCAAAGGC CGCAATTTAC GACCCCAATC AGTTACTTTC AAGCCAAACA TTAACCTATT TTGATTTATT TCACGCTTTT TGTCATTTTG ATGTGACAAT CTATCCATCC AAAACCAAAT TTGAGCAAGA CGCTGGGTCA AGCGAATCAG CCGACGTGTA TTTGCGCATC AACCAACTCC GCCCAACCCC GCTGGGCCTC GCCTTCGAGC ATCCAAACAC CGATCAACTT GATCAAACGA CCTTCGATGA TCGCTATTGC AACACGATTA TTGGCATTAA AGGCCTATTG CTCAGCGCCT ACGAATACGG CTCACGAGCA ACCGTGCCAA TCCCAACTGA AATTCGCAAT ACCGTTAAAA GCAACGCCAT CCCTTGTCTG ATTGTTGATC AAGCAAGCAC CAATGCATTA ATTCGGGTAC TTCAGGGCAG CCCAATCTAT CGTCAAGTAT TAAAAGTCGA TTTTGGCGGG TTATTCGAGG AATATACGAT GGTAACCGGA ACCGCCGCCT TCCATGTGAT TCCTGAACTT AAACGCCACT TTTTGATGCG CCAAAAACGG GTCAACGATC AGCCAATATT TTTATAA
|
Protein sequence | MITIQLTGHK EKLAPPNTIP NYRAAAPLLY HQVRTYQALE QAPLVMNTYA TGTGKTTAAS LRLLHPDQQK RNTLIIAPTN ALIEQHCADA QAFIDDNQLA MRTLPINAAT IRDITSETRR GEILQRLIAN PLTFAEALGL PNDAEKLPFV AVTNPDIFYL ALYFRYGPLD QRNVWEKFIR QFHYIVIDEF HYYDNKQFSN FLFFFNLWKE WGYFEAGYKI CLLSATPRDA VYRYLNQVFG ADGWQRVGPD NEPESSSALP PTPTLAPLTM HIHNDNLEEW LQTQSQIVAD WQANALDSVI ISSSLGKINR IHATLRQLNP IRITGPEPPE HRRLVRPVIL ATPTVDIGYN FGRPGKQRQS IDRLICDAKF GDELTQRIGR AGRVLGREQT DIPGEAHIFI SEDAFGELKQ WDTQTLNRAE WASIIHQLEH LPAKHQLEGY IRQYALLETF YPLLKLQQLT PKDDPDQEAL FNAMRDIFAP NSQRTIGSLR HFYRVYEERE QWLRLSESAK WADRGAVAKH FAAQLSWLAS TKDRQQEIKP EQVRGILDQR LIGHSTPQNA LIDFIESQVV MTKALFNFRE AWQGPKAAIY DPNQLLSSQT LTYFDLFHAF CHFDVTIYPS KTKFEQDAGS SESADVYLRI NQLRPTPLGL AFEHPNTDQL DQTTFDDRYC NTIIGIKGLL LSAYEYGSRA TVPIPTEIRN TVKSNAIPCL IVDQASTNAL IRVLQGSPIY RQVLKVDFGG LFEEYTMVTG TAAFHVIPEL KRHFLMRQKR VNDQPIFL
|
| |