Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2540 |
Symbol | |
ID | 5734418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3257625 |
End bp | 3262463 |
Gene Length | 4839 bp |
Protein Length | 1612 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641279680 |
Product | hypothetical protein |
Protein accession | YP_001545306 |
Protein GI | 159899059 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.316751 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTTG ATTTCGATTC CAATGATATT CAATTGAACA AAGGCGATAT TCGTCAATTG CAAAATGCCG ATGCGTTGGC GGGCTTTTTG GCGCGTTTGC AGTATGCGAT CGATGTGCGC ACAACGATCG ATTCGTATGC TCAGGTTGGC CTCGATAGCG CCTATTTGCG CCAAGAAATT AAGCATCTGG AGTTGTTGGC AACCGATCCC CACGATCAAG AAATAAAAAT TTATTTGTTA GAAGTGCGCT CACTCACCGC TGCTTTACGC AATGCTATCG CCCGCGCGTT TCGCGATCGG CCTGAGTTGG CGTTGGTTAT TTTGACCAAA GATTATGAGA GCTTTGAGTT TGTGATCTTG CTGCGCGAGT TGGCGCAAAA TACCCAGCGT GGCGCGGCGA TTCGCCAAAT GTTGCGGCCT GTGCCACTCA CGATCAATCG TTTGCATGTT TCGGATGTAG CGTTGCGCGT GCTCAAACGC TTTACCATGA CCGAACCCGA TGCGCTGTAT CAGTGGGATA AGCTGCGCTC GGCCTTTGTG TTGGCCGAGT GGAGCGAGCA ATATTTTAAT AATCGGGCGC TGTTCTCGGA TTATTATTTG AAACATCGTT TGACTGATGC CAAATTAAAT GCTGAATGGA ATGAAGATGT GTTGCCAATT GGCCGCAAAA TCAACCCGCT GCTCAAAACT GCCCGTCAAC AATTAACTGA TCAGCCTCTT GCGACGATGC AAAAGCAATT TTATGCGCCA ATCTTGGCGG ATTTGGGCTT TGTGCTGCAA GCGGTCGCGG GTGAGCCAGC CTATCGCTAC GATTTGGCCT TGCCCAATAA TCCCCAACCA GTCGCTGCTT TTTTTGGCTA TGTTTGGAAT CGTAATTTAG ACGATCAAGA TAGCCAGCGC GATCCCGCTA CGCCGCTGAT CATTCCTGGG GCCGAGGTGG TTTCGACGCT TGAGGCTGGC ACGGTGCCAT GGGTGATTGT GAGCAATGGC AAATATTGGC GCTTATATTC AACCACTGCC AGCAATAAGG CTACCAACTA CTACGAGGTT GATTTAGAAG AGGCTTTCGG GGCGCAGGAT GCGCTGGTTG CACTCAAATA TTGGTGGTTG TTTTTTCGTG CTCAGGCGTT CGGCGGATTT TTGGAGCGCT TGCTCAAACA ATCGGCGGAG TATGCCAAGG GCTTGGGCGA ACGCTTGAAA GATCGGGTGT TTACTGAGAT TTTTCCCCAA TTTGCCCAAG GTTTTATTGC CGACATGCGC CGCCAAGGTG CTAACGACGT TTCTGAGCAG CAACTCAACC CGGTTTTTGA AGGTACGTTG ACCTTTTTGT ATCGATTGAT GTTTGTGTTG TATGCCGAAA GCCTGGATTT GCTGCCGCTG AATGAACATA ACGGCTATCG CGAACGCAGC TTGTACACGC TCAAACGTGA GATCGCTGAG ATTGCTGGTA GCTTGCTCGA TCAGCGGGCG GCCAATTTGC AAGCCCACTA CAGCGCCACT TCAACCGCGC TTTATCAGCG CATCCTCGAT TTGTGTGCGG TGATCGATCG TGGCTCGCCC GATTTGAATA TGCCAACCTA CAACGGTGGT TTGTTTAGCG CAACTAGCAC GAGCGGCCAG TTTTTGCAAC GCTACGCCGT ACCCGATCGC TACTTGGCCT TGGGCTTGGA TCGGCTTGCC CGCGATCTTG ATGATCGGAG CCAAGCATTA GTATTGATTG ATTTTAAGTC GTTGGGCGTG CGCCAGCTTG GGAGCATTTA CGAAGGTTTG CTGGAATTTA AGTTGCACAT CGCCAGCGAA TGCTTGGCCG TGACTAAGGA GAAAGGCAAA GAGGTTTATC AGCCAGCCGC TAAAGTTGCC AAACCACTGG CAATTATCGA GCGGGGCATG GCCTATTTGG TCAACGATAA AAAGGAGCGC AAGGCCACGG GCAGCTATTA CACGCCCGAT TATATTGTGA AATATATTGT GCAGCAGACG GTTGGCGCGG TGCTTGATCA GAAATTTAAG GCCTTGGCTC CGCGTTTGCA CGAGGCCCAG AAACAATATC GCAATTATGC TAATTTGGTC GCTGCTCGGG CCAAATCGAG CAAGCGCCCC GAAAATCCGG CGGTGTTTTG GACTGACCCG AATGGTGCCA TGGGCCAGTT GCTCGACGAT TGTTTGAATC TGCGGGTGCT TGACCCAGCT ATGGGCAGCG GCCATTTTTT GGTTGAAGTC GTCGATTTTA TTAGCAATCG CTTGATTCAC TTTTTGAATG CCTGGAGCGA AAACCCCGTT TGGGCGATGA TCGACCGCAC CCGCAGCGAA ATTGTGGCCG ATATGGAGCG CCAAGGGGTG ACGATTGACC CCGAACGGCT TACGCGGGTG GCGCTGCTCA AACGGGCGGT GCTCAAACGC TGCATCTACG GGGTTGACCT GAATGCGATG GCGGTGGAGT TGGCTAAGGT CAGTTTGTGG CTCGATGCCT TTACCCTTGG CGCTCCCTTG AGTTTTCTCG ACCATCACTT GAAGCATGGC AATAGCCTGA TTGGGGCGCG GGTTGCCGAG GTGCAAACCT ACCTTGATGT TGGTGGCCGC CAAAGCCATA TGTTGGCGGG CAACGAGTTT GCAGGCCTGA GCCTGGCCAC CGATTTGATG CGCCAAGTCA GTTTTTTGAG CGATAACACG GTTGAACAAG CGCAACAAAG CGCGGTGGCT TTTCGCGATG CCGACCAGCA TCTCGCGCCC TTCAAGCGTA TGCTCGATGT TTATACCTCG CGTTGGTTTG GCAATCAGCC CGCCAAAAAA AGCAAAAGCG ATTGGGTCAA CATCTTTTTG CGCATGCCCA GCATCAAGCC TTGGTTGCAC GATTCCACAG TTAAATTAGA TGATAGCTTG ATTGAGGCTA CCAAGATTGG CCAGATTGCG CTTGAAGCCG CTAGCAGCAA GCACTTTTTC CATTGGGAGC TTGAGTTTCC CGAAATCTTT TTTGCGCCCA GCACGCCTGG CGGTCAAGAT GTGCAGCTCA ATCCCAACGG TGGCTTCGAT GCGGTGGTGG GTAATCCGCC GTATATTAGA ATTCAATTTC TAGATAAAAG TGATGTTGAT TATTTTAATA ATATATATCT CTCTCCTAAT GGTTCATATG ATATATATAT ATTATTCATT GAAAAAAGTA TTGAATTATT AAATATTAAC GGAATTAGTG GATATATATG TCCTAATAAG TTTATGACAA ATGCATATGG GGATAAAATT AGGAATATAA TTGGAGAAAG TAGAAATCTT TTTCGTTTAG TTGATTTTGG CGATTATCAA TTATTTGAAG GAGCTACTAC GTATTGCTGT TTAGTATTTT TATGTAAAAA TAATAATCGT ACATTAGATA TTCCTGTTAT TTCTGTAAAA GATTACATTA ATATAGAAAA TAATAAAGTT ATTAAATTTA ATATTGATAA TTTGATTAAT AATAAATGGT ATTTTGGTGT AAATAATGAA TTAAATGAAA AAATTAAAAA AGTGGCAGGT AGAGATTTGG GAGAGATTGC TCATCCTCAT TATTGTCTTT TTACGGGATT AAATGATGCT TTTGTTGTGA GTGAAAGTGA TATATTTAAT TATGAACTTG AGAGAGATAT ATTAAAACCT TTATTAAGAG GACAAGATGT GAAAAGATGG GAGGTTTTGC ATGAAAAACT ATATATTATC TATCCATACG AATTATATAA TAATAAAATG TCTGTAATTG ATATAAATAA ATACCCAAAT GTTAAAAATT ATTTATTTAA ATTTAAAGAT CTATTAATAA ATAGAGTGAA ATTTATTCAA AAAGTAAAAA ATATACACGA ACGAGAATCT AGATGGTATG AATATATTGA TCCAAGATCA ACTTATCAAT TTGATCCATT GAAGATAATT ACTCCTGAGA TTGCCGGATA TGGATCATTT GTTGTTGATA CCGAAAGTTA TTACTGTCTT AATAAAGTAT ATGTTATAAA CCTTGAAAAT GTTGTCGAAA ACCCCTATTA TATTACATCT ATCTTAAATT CATCTATTTC TTTCTATATG TTCAAAGATA TATCTACAAA GCTTGCAAAT GGTTATTATG AATATATAAC TCAATATTTG AAACAAATAC CCATACCACG CATCGACTTC ACGACCGAGC CAAGCGAACG GGAGCAATTA ACGCAGATCG CTATTGATTC GTATACGCAA AAAAATGATA ATAATGTATT GACGCTCGTC AATACATTAT TCACCGCTGA GAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CGCTATGGCG ATGTTGTCCA CGATCTATTA GCCTACCTCG CCCAACAAAT GATCGAGCTG AACAAAGCCA AGCAACAAGC CAGCAAGCGC TTCTTGAATT GGCTCGAAAG CCAACTACGG ATTCAGCCCA AAAAAGGCGA AACTGGCCTC GACAGCCTGA CGGGCAAAAC GATCATCCAA GGCTACCTTG GCGACTACCA AAAAGGCCAA CGCGCCGTCA GTTGGAGCGA CTTCTGGTAT CGGCTGCAAC AAAACCGCAA TCGTTTTGCC GCCAATCTCA GCGAAATCGA AGCCAGCATT GCCCAAGCCT ACCGCCAATC GCTCGATGA
|
Protein sequence | MTLDFDSNDI QLNKGDIRQL QNADALAGFL ARLQYAIDVR TTIDSYAQVG LDSAYLRQEI KHLELLATDP HDQEIKIYLL EVRSLTAALR NAIARAFRDR PELALVILTK DYESFEFVIL LRELAQNTQR GAAIRQMLRP VPLTINRLHV SDVALRVLKR FTMTEPDALY QWDKLRSAFV LAEWSEQYFN NRALFSDYYL KHRLTDAKLN AEWNEDVLPI GRKINPLLKT ARQQLTDQPL ATMQKQFYAP ILADLGFVLQ AVAGEPAYRY DLALPNNPQP VAAFFGYVWN RNLDDQDSQR DPATPLIIPG AEVVSTLEAG TVPWVIVSNG KYWRLYSTTA SNKATNYYEV DLEEAFGAQD ALVALKYWWL FFRAQAFGGF LERLLKQSAE YAKGLGERLK DRVFTEIFPQ FAQGFIADMR RQGANDVSEQ QLNPVFEGTL TFLYRLMFVL YAESLDLLPL NEHNGYRERS LYTLKREIAE IAGSLLDQRA ANLQAHYSAT STALYQRILD LCAVIDRGSP DLNMPTYNGG LFSATSTSGQ FLQRYAVPDR YLALGLDRLA RDLDDRSQAL VLIDFKSLGV RQLGSIYEGL LEFKLHIASE CLAVTKEKGK EVYQPAAKVA KPLAIIERGM AYLVNDKKER KATGSYYTPD YIVKYIVQQT VGAVLDQKFK ALAPRLHEAQ KQYRNYANLV AARAKSSKRP ENPAVFWTDP NGAMGQLLDD CLNLRVLDPA MGSGHFLVEV VDFISNRLIH FLNAWSENPV WAMIDRTRSE IVADMERQGV TIDPERLTRV ALLKRAVLKR CIYGVDLNAM AVELAKVSLW LDAFTLGAPL SFLDHHLKHG NSLIGARVAE VQTYLDVGGR QSHMLAGNEF AGLSLATDLM RQVSFLSDNT VEQAQQSAVA FRDADQHLAP FKRMLDVYTS RWFGNQPAKK SKSDWVNIFL RMPSIKPWLH DSTVKLDDSL IEATKIGQIA LEAASSKHFF HWELEFPEIF FAPSTPGGQD VQLNPNGGFD AVVGNPPYIR IQFLDKSDVD YFNNIYLSPN GSYDIYILFI EKSIELLNIN GISGYICPNK FMTNAYGDKI RNIIGESRNL FRLVDFGDYQ LFEGATTYCC LVFLCKNNNR TLDIPVISVK DYINIENNKV IKFNIDNLIN NKWYFGVNNE LNEKIKKVAG RDLGEIAHPH YCLFTGLNDA FVVSESDIFN YELERDILKP LLRGQDVKRW EVLHEKLYII YPYELYNNKM SVIDINKYPN VKNYLFKFKD LLINRVKFIQ KVKNIHERES RWYEYIDPRS TYQFDPLKII TPEIAGYGSF VVDTESYYCL NKVYVINLEN VVENPYYITS ILNSSISFYM FKDISTKLAN GYYEYITQYL KQIPIPRIDF TTEPSEREQL TQIAIDSYTQ KNDNNVLTLV NTLFTAENPI QSNPIQSNPI QSNPIQSNPI QSNPIQSNPI QSNPIQSNPI QSNPIQSNPI QSNPIQSNPI QSNPIQSNPI QSNPIQSNPI QSNPLWRCCP RSISLPRPTN DRAEQSQATS QQALLELARK PTTDSAQKRR NWPRQPDGQN DHPRLPWRLP KRPTRRQLER LLVSAATKPQ SFCRQSQRNR SQHCPSLPPI AR
|
| |