Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4537 |
Symbol | |
ID | 5736388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5806414 |
End bp | 5809611 |
Gene Length | 3198 bp |
Protein Length | 1065 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641281699 |
Product | hypothetical protein |
Protein accession | YP_001547296 |
Protein GI | 159901049 |
COG category | [R] General function prediction only |
COG ID | [COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.382016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATAT CGTCTGCCTC AACCCCAATT GGCGACGTGT TGCAGCAAAT TGCCCAACGT TTCGCCCAAA GCAGCCGCTC GCCTTTGTTG AGGAGTGGTG AATTAGCCTT ACCCGATGCT AGCACCCAAC CGCTGCCCTT GGCTCCGGAG TTAGCGGTAG CTTGGCGGGC CTTGCTCGGC GAAACTGGTT GGCCTTGGCA AGCTGAAGCG TTGGCGACCG TGCGGCGTGG CTTGGGCTTG GCCTTGGTTG CGCCAGCCCC ACTTGGCCCA GCATGTTTGT TGCTGTTGGC TGCCGAGCAT GTGAGCACCA ATCAAGGTAG TTTGTTGCTG TTGGCTCCCG ATGCTGCTAG CTTGCACGAT TTGGCGCAAA CAGCGAATAA TATTGATGCG CTGCTTGGTG GCGGCTTTCC CCATTTGGTC GTTGAGCAAA GTACGCGCCC ACCTCACAGT CCACCGCGTT TGATTTTGAC TACTCCTACA ATTTTGCATC AACGCATGCT GCGCTCGCAC CATCGTGGAT GGTCGAATAT CTGGCCGCAT CTGAATGGGG TGGTGTTGCC AGCTTTCGAT CAAGCCAGTA GTACGATTTT TGGCCATTGC CGTTGGTTGA TGCGGCGGAT CGAACGCTTG CGCCCGCGTG CCCGACCATT GACGCTCTAT GCCAGTTTAG CGCCCGTTGC CGAGATTGAT GAATTGCTGG CGCGAGTGTT TGACCACCTG CCGCCGCTGG TTTATGCCAA TACCGCCCGC ATTCCCTTGA CTTGGGCTTT GTGGAATGGT GGCACGCAGC CAGTTGATGC AGCTTTGAAA TTGGCCTTGG CCTTGCGCCA AGCAGGTTTA AGCGTGCAGC TTGATGCGCC CGATAGGCTA GAGCGAGCAA TGTTGGCCCA ACGTGGCGCG GCTCAAGGCC TGAGTTTAGT GCCACGGGCA GCGGCTCCGG CCCATGTGTT GGTGATGTTG GGTGGCGTGA ATGCTACCAG CCTGCCAAGT TTACTCGCCA GCGGCCATCG CGCCGTGGTG TTGGTGACCG ATCAATCGAT TGCTGCCCAA ACCGCCTTGG CTCAACCAGC CTTGCTCACC CCAACTGTGC CGCCAGCCTT GCCCGTAGCC ACCCAAAACA ACTACATCAG CAGTGGGCAT TTGCGCTGCG CCGCCGAAGA ACGGCCCTTG CAACAAAGCG AAATTAGCGC GTGGGAAGTG AGCGATCTGG TTGAGCGTTT GACCCAGCGC AATCAGCTAG CCCAGTTGCC CGATAGCCCA ACTTGGCAAC CAGTCGCCAA TTTGCAGCAG CGCAGCGATA TTTATGCCAC GCTGCATCCA ACCACAATTA GTGATATACC AGTACAGATC GTCGATCACG AAGGCACCTT CTTGGCCGAA CTTGATTCGG TGACGGTTGA GCGGCGTTTA TTTAGCGGCG CTAGCGTGCT CGGCGGGCGA GTAATTGGCT GGAATGATGA TGGTTCATTG GGTTTGCGTT TGCAGGATGT TGCGCCAACC TTGGCTGAAC ATCGCTGTAG TGTTGCGGTG CGCGAGCAAT TTGGCCAACG CCCGCTTGAT GGAGCACGCG CCGAGATCGA GTTGATGATT GGCCGTGTGG TGGCAACCGA AGAAATTGTG GCACGGCGCA GCCTGGCTGA TGATGGCAGC ATTCGGCGTG TGCCGTTTGA GCCGCCGATT CAGCTGCAGT GGAATGCCCC AGCGCTGTGG TTGGCTGCGC CAGAAGCTGG GGCTGGCTTG GGCGAAATCC TGCTTGGCGT GTTGCCATTA TTGCTGCACT GCCAGCCTGA TGCGATGGTG GCGTGTGTCA GCGAACAACA TTTGTATTTG GTCGAAGCTC AACCAGGCGG GCGCGGCATA GTTGAGCAGT TGTATAGCCA ATTTGAGGCT TGGCTACATT TGGCTGGCTT GGCGGCCCGC ACTCTGAGCA AAGACCCGTT GTATGCTAGT TATGCTCAGG CCGAATTGCG TTGGCTTGAA AAGATTTTGG TACCACTCGC TGCGCCGTTA CGCGCCGATA TGCCACCAGA GCCAGCCCAA GTTGCACCGC CACGGGTCGA GCGAGCGAGC CGTCAATCGA TGGTAATTAG CACCAACGAT TTGAATGCCC GCCGCCGTGG ACGTGGCAAT GTGTTTGCGC TGCCGCGTTC GCTACCCAAG CAAGGCGAAG CGCTCAAGCG TAGCCAAACC AACCCTGTGC CAAACCAACC AGCGTTGCCT GCTCAGCCCA TGCGTTTGGC CGCCCAACAA CCGCCAGCCA ACAAGCCGCT GACCAACCAA CGTCCAGCGG TGCGCAACGA AGCCCCGCCA GCGCCGCCTA GCCCTGAAAA AGCCAAGGCC AACCTGACCC GACCGTCACG GCGCAAGGCT AACGTTGGGC GCAATGAGGC GCAACGCTCA ACCCAGCCCT TAGTGCAGCC CAAAACGCCA GCGCCAAGCC AGCCTTTACC GCCTGAACGC GGTTCGGTGG TGATGCCGGT GGCCAACGAG CCGCCGCCCT ACGAACGACC ACCGTTTCAA CAGCGCAACC CTCCAGAAAA ACCAGCGGCT CAGCGTCCAA GCTCGCGGCC TGTCCAGCGC GAAAATCAGC CGCAGCAGCG TCCGATTCAG CGTGAGCAAC AACCGACCCG ACCGTATCAG CGCAACGATC AGCCGACCAA ACCGATGCCG CGCGAAAGCC AGTCGCAGCA GCGTCCGATT CAGCGCGAGC AACAACCAGC GCGACCGTAT CAGCGCAACG ATCAGCCGAC CAAACCGATG CCGCGCGAAA TTCAGCCGCC ACAGCGCCCG CTTCAACCTG AGCAACGGCC AGTACAGCCA AACTTGGCCG AGCCAGTGCG GCCATATCAA CGTAATGAGC AACCAGCCAA GCCCGTGGAA GCCACTGCTG ATCCACAGAC AATGCTTGAA AAAGCGCGGC GTTTACGTGA GCAACGCGAA GCCGAAGCCC GCGTAGCGCA GCCAATCACG CGGCCAAGCA CCAATCAAGC CGCCGAGCCA AGCGAATCGC GCTTCAAACA AGGCGATCGG GTGCATTGCG TGCCCTACGG CGAGGGCGTG GTGCAAAAAA CGCGCATCCG CGATGGCCGT GAGCTATTGC TGGTACAATT TCCAGAGCTA GGTGATCTAC GGGTTGATCC AGCAGTCAAT GCTGTACGCA TCCTACGCCC TGAGATTCAA GCCGAAGACG ACGAATAA
|
Protein sequence | MSISSASTPI GDVLQQIAQR FAQSSRSPLL RSGELALPDA STQPLPLAPE LAVAWRALLG ETGWPWQAEA LATVRRGLGL ALVAPAPLGP ACLLLLAAEH VSTNQGSLLL LAPDAASLHD LAQTANNIDA LLGGGFPHLV VEQSTRPPHS PPRLILTTPT ILHQRMLRSH HRGWSNIWPH LNGVVLPAFD QASSTIFGHC RWLMRRIERL RPRARPLTLY ASLAPVAEID ELLARVFDHL PPLVYANTAR IPLTWALWNG GTQPVDAALK LALALRQAGL SVQLDAPDRL ERAMLAQRGA AQGLSLVPRA AAPAHVLVML GGVNATSLPS LLASGHRAVV LVTDQSIAAQ TALAQPALLT PTVPPALPVA TQNNYISSGH LRCAAEERPL QQSEISAWEV SDLVERLTQR NQLAQLPDSP TWQPVANLQQ RSDIYATLHP TTISDIPVQI VDHEGTFLAE LDSVTVERRL FSGASVLGGR VIGWNDDGSL GLRLQDVAPT LAEHRCSVAV REQFGQRPLD GARAEIELMI GRVVATEEIV ARRSLADDGS IRRVPFEPPI QLQWNAPALW LAAPEAGAGL GEILLGVLPL LLHCQPDAMV ACVSEQHLYL VEAQPGGRGI VEQLYSQFEA WLHLAGLAAR TLSKDPLYAS YAQAELRWLE KILVPLAAPL RADMPPEPAQ VAPPRVERAS RQSMVISTND LNARRRGRGN VFALPRSLPK QGEALKRSQT NPVPNQPALP AQPMRLAAQQ PPANKPLTNQ RPAVRNEAPP APPSPEKAKA NLTRPSRRKA NVGRNEAQRS TQPLVQPKTP APSQPLPPER GSVVMPVANE PPPYERPPFQ QRNPPEKPAA QRPSSRPVQR ENQPQQRPIQ REQQPTRPYQ RNDQPTKPMP RESQSQQRPI QREQQPARPY QRNDQPTKPM PREIQPPQRP LQPEQRPVQP NLAEPVRPYQ RNEQPAKPVE ATADPQTMLE KARRLREQRE AEARVAQPIT RPSTNQAAEP SESRFKQGDR VHCVPYGEGV VQKTRIRDGR ELLLVQFPEL GDLRVDPAVN AVRILRPEIQ AEDDE
|
| |