Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4643 |
Symbol | |
ID | 5736490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5932500 |
End bp | 5935553 |
Gene Length | 3054 bp |
Protein Length | 1017 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281807 |
Product | hypothetical protein |
Protein accession | YP_001547402 |
Protein GI | 159901155 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.162315 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTACAAC GTTGGTTAAT TGGCTGTTTG GTTGTGTCGC TGATGTTGAT TCGACAGCCC GTGATTGCCC AAATTACCAC GGCGACACTC ACGCTACAAC GCCACGATCA GCAACTGCAT ATCAACGTGC AATTACCCAA ACCAACCCTG CAACCCAATA GCATCAACAT TGCAGGCTGG CAAAATGATG CCACACCTGA TCAGCCGGCC TTGCCGCGTT CAAGCCACTG GTTAGTTGTG CCAGCAGGCT ACCAACTCAG ACTAAAATCG GTTAATCCTC AACAACTACA ACACTATCAG CAACAACTTA GCCTCACTCC TAGCAGTGGT TGGCAGGTTG ATCCACTCGC GCCAAGCAAG GCCATAGCGC TAAGCGCACC AAGTGTAGCC GTCAAACAAG CCCAATATCC AACCACATGG GCCAACCTTG GTCAGAGCGT CCAAGTGCGG GAGCAACAAC TTGTGCCATT AACTATTTTC GGGGCGCAAT GGCAACCAAG CAAACAGCAA ATAGTTGTAC CAAGCTCAAT CGACATAGCG CTAGAATTTG TGGCAAGTAC CGAGCAACCC AGCTTGCGAG CTGATCCATT TTGGAACGAA CTGCTACGTC AGCAAGTGCT CAATCCCAGC GATCTACAAA ATCCAGCATT ACGCCCAGCC TTTGCCACAA CCACGCCAGT AACCAATGGA GTGCGAGTGA GTTTTGCCAA CCCAGGCATC AGCGAAATTC GTTGGAGCGA TTTGCAGGCG GCGGGCGTGC CAAGCCAATG GCTCAATCAA TCGGCTAATT TACAACTATG GCAAGGGCGC AATCAACTGC CACGGTTGCT GACTGCCACG GGCATGATTT TTTATCTACC GCCCTACAAT CGTGATCAAA GCCTGCAAGG GAGCGTGATT GTGCGCTGGA ATGGACAGCA ACCAGGCAAT GTGTTGGTTA GCGAATCAGT CAATTCGGCC AACCCAAGCC TGAGCTACTA TAGCGAGACC TTGCGCTTGG AAGAACAAAA ACTCTATCTG AGCGCCTTTC CGGCCAGCGG CACAAATCGT TGGTGGTGGC AATATTGGTA TAGCCCAGGC TCCGGCCAAA GCGCTCAGCC CTTGCAAATT AATTGGAATT TGGATAATGC AACTCGCTTC GATCAGCCAG CCCGCTTGCG GTTGCGCTTG CATGGCGGCA AGCTTGGCAA TCGGCATCAA GCCGAAATTC GCCTGAACAA TCGTTTGCTC ACAACCGTCA CAATAACCGG CTTTCAACTG CTTGAGTCAA CGATTAACCT GCCAAGTGGC TGGCTTAGCG CAACCAACCA ACTCACAATT ACCCCAATGA GTACCGAGCG CGAAACCAGT TTTCTCGATT GGGTTGAGCT AGATTACCAG CGTCAAGCGC AGGCAGTTGC GGGCCAATTA CAATGGTCAA GCAGCCAAGC CAACCAAAGC ATTAGCAATA TCATCAGCGA AAATCCGTTG CTATTCGATG TGCAAACGCC CTTGGCTCCG CGCCGCTTGA TTGGCTGGAA TTTGCAGCAA GGCCAATTAA GCTGGCAAAC CAGTGGCAAT CGCCGCTACC TTGTGCAGAG CCAACGCCAA ACACCGTTGA GCAGCGTTTG GTTTAGTCAG CCCGATTTGA GCAGCACCAG CCAGCAAGCC GATTATTTGC TGATTAGCTA TAACCCAGCC AACTCCTCGA GCTGGAGCGA TGCACTGCAA CCATTGATTA CCCAACGCGC CAGCCAAGGC CTCAAGCCAT TATTAATTGA TGTGCAGCAG ATTTACGATC AATTTGGCGA TGGGCGGGTT GATCAACAGG CGATCGCTGA TTTTATCAAG TATGCCTATC ATAATTGGCA AGCACCAGCG CCTAGTTTTG TGGTGTTAGT TGGCGATGGC ACGGCAGATC CGCACGATTA TGCTGATATT ATTGGACAAC CCGTGACCAA TTTTATTCCA CCCTATTTGG CCGATGTTGA CCCGTGGTTG CGCGAAACAG CCGCCGACAA TCGCTATGTA ACGGTTGCTG GCAACGATAC TTTGCCCGAT TTGTTCTTGG GGCGCATTCC AGCGCGTTCG CTGAGCGATG TTGAACATGT AGTTGCGAAA ATATTAAGTT ACGAAGCCAC GCCCAGCAAC GCCGATTGGC TGAACAAGCT GTTGTTTATC GCCGATGATC CAGATGTATC GGGCGATTTT CCGTGGCTTT CAAATGAGGT GGTGGAAATT TTACCGCCAA CGGTTGATGA TCAGCAGTTG TACTACACAG CGAATACCAA TCTGACGAAT TTTCGGGCAG AAATCGTTAA TCAGATCAAT AATGGTCAAT TTTTGGTCAA TTATGTTGGC CATGCGGGCA TCGATGTTTG GGCTGATCCG ACGATTTTCA ACCAGCAATC GGTGGCAAGT TTAAGCAATA GCGCCTTGCC GTTGATGCTC TCGTTGAGTT GCTATGCTGG CCATTATCAA CAAAATGACC TTGAATCGTT GGCCGAAATG CTGGTGTTGC AGCCTGAGCA TGGAGCAGTT GGTATGTGGG CGGCTAGTGG TTTGGGCATC GCCCATGGCC ACGATTACCT GAATCGTGGG TTTGTAAACT CGATTATCAA CGATGGTTGG CGCTTGGTTG GGCCAGCAAC AATTCAAGGC AAGCTTGATT TAGCAGCGGC CAATATCTCG CCCGATTTGC TCGACACCTT CAGCTTTTTT GGTGATCCGG CCTTGCGTTT GCCCTTACCA ACCAACAATG CTTGGCAACC ACAGGCCGAT TATTACGAAG TTTTGCAATA TTCACAGGCT AATCGGCTAA CTCCATTGGC GAATGATCAA GCCGATTTTA GCCAAATTAT CAGCCTTGAA CAACCACAAC ATGGCCAAGT ATGGCTCGAT GCAGATCAAC GCAGCGTTCG CTACACGCCT GATCCCGTTT ATAATGGGCT TGATTCATTT AACTATCAGG TGCGCAATTT GAGCTTGAAT CAAACCCTAA GTGCGACAGT GACGATTAGC GTAACCGCGA TTGCGCCGCA GCTTTACCTA CCATTGACCA TCGCAGATTA TTAA
|
Protein sequence | MLQRWLIGCL VVSLMLIRQP VIAQITTATL TLQRHDQQLH INVQLPKPTL QPNSINIAGW QNDATPDQPA LPRSSHWLVV PAGYQLRLKS VNPQQLQHYQ QQLSLTPSSG WQVDPLAPSK AIALSAPSVA VKQAQYPTTW ANLGQSVQVR EQQLVPLTIF GAQWQPSKQQ IVVPSSIDIA LEFVASTEQP SLRADPFWNE LLRQQVLNPS DLQNPALRPA FATTTPVTNG VRVSFANPGI SEIRWSDLQA AGVPSQWLNQ SANLQLWQGR NQLPRLLTAT GMIFYLPPYN RDQSLQGSVI VRWNGQQPGN VLVSESVNSA NPSLSYYSET LRLEEQKLYL SAFPASGTNR WWWQYWYSPG SGQSAQPLQI NWNLDNATRF DQPARLRLRL HGGKLGNRHQ AEIRLNNRLL TTVTITGFQL LESTINLPSG WLSATNQLTI TPMSTERETS FLDWVELDYQ RQAQAVAGQL QWSSSQANQS ISNIISENPL LFDVQTPLAP RRLIGWNLQQ GQLSWQTSGN RRYLVQSQRQ TPLSSVWFSQ PDLSSTSQQA DYLLISYNPA NSSSWSDALQ PLITQRASQG LKPLLIDVQQ IYDQFGDGRV DQQAIADFIK YAYHNWQAPA PSFVVLVGDG TADPHDYADI IGQPVTNFIP PYLADVDPWL RETAADNRYV TVAGNDTLPD LFLGRIPARS LSDVEHVVAK ILSYEATPSN ADWLNKLLFI ADDPDVSGDF PWLSNEVVEI LPPTVDDQQL YYTANTNLTN FRAEIVNQIN NGQFLVNYVG HAGIDVWADP TIFNQQSVAS LSNSALPLML SLSCYAGHYQ QNDLESLAEM LVLQPEHGAV GMWAASGLGI AHGHDYLNRG FVNSIINDGW RLVGPATIQG KLDLAAANIS PDLLDTFSFF GDPALRLPLP TNNAWQPQAD YYEVLQYSQA NRLTPLANDQ ADFSQIISLE QPQHGQVWLD ADQRSVRYTP DPVYNGLDSF NYQVRNLSLN QTLSATVTIS VTAIAPQLYL PLTIADY
|
| |