Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3824 |
Symbol | |
ID | 5735688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4798090 |
End bp | 4800270 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641280976 |
Product | hypothetical protein |
Protein accession | YP_001546588 |
Protein GI | 159900341 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.510661 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAATACC CAAGGCTCCA CCAAGTTCAT CACAAGCTGG TGCCCCAATT GAAGGTTCAT GGAACAAGCT TCATAATCAG CATCCTGCAG CATAATCCCA AATCACAAGC TGGTGCCCCA ATTGAAGGTT CATGGAACCC CAACCTTGCG AGCCATCTAC GCAATCCAGC GCTGATTCGC CATGCTATTG TCGGCCCAGA CCAGAATTTG TATATTGCCG GAGCCTTTAG CAATATCAAC AATACCGCTG CTAATGGGTT AGCTCGTTGG GATGGTAGCC AATGGCATAG TTTGGCAACC TCGGGCGCTG ATGTTGATCG GGTGCAGAGC ATGGCTTTTT TTAATAACAA GCTGACCGTA GGCGGCGCAT TTCGCACATG GGCTGGCCAA CCATTTGCCC AGCTTGTGCA ATGGGATGGG GCAGATTGGA TGCAGCTTGG GAGCGGATTT CAAGGGTCTT TTAATAATAG CCCAACCCAG ACCACGGCAG TTAATGCCCT GACCGTGCTT AACACTATGC TGATCATCGG CGGCAATTTT ACTCAATTTC ATGGTCAACC AGCAAACGGT GTGGTTGGAT GGAATGCAAC CGATGCAATC CCATTTGGTT CTGCTAATGG TCAGATTAAT ATGACGGTTG CAAGTACCGA TACCTTAATG ATCCATGGTG ATTTTCGAAC ATTTAATAAT CAAACAGTTC CATATGGTAC GATCCCAAGC TGGAAAGCTG GTATATGGAA AATTCTCGTG CTCCCAGCCA TTCCCAGTGG ATTTATCTAT AAGGCCAATT TAATAAGTAT TGATCAGACC ATTTATCTCT TGGCGAATGA ATCTTATTTC AATGAAACGT TTGTATTTCG CTGGCAAAAT GAACGTTGGG TTCAACTCGG AACAGGCCTG CCTGGTCAAT TCACCAAACT CACCAACGCT AATGGCTCGC TCTATCTGGC GCAAGCTGAT GGTGATGGAA ATGCCAACGA TAGCTATGGG GTGGTGCTTC GACTAGTGGA TAATCAATGG CAAACAGTTA ATCTGCCGCA TAGTTACACA AGCATTAGCC AATTAGTAGC GATTGGCTCT GATGTATATA TTATTGGGCT GCCAGCGGAA AATCAGCAAT GCCCAAATCT TGTCTGTACC TTTACCGTTG AACGCTGGAA TGGCACAACC CTTCAGCTGA TTGGTGAAGC TTGGCAAGCA CCAAGCATTG TGTCGCTGGT TGGCGACGTA GATCATGTTT GGGCAACCAG TCGGCCTACC TATCTTGATC GGCAGGCAGC GCCGACAGTG TTGTTTTGGA ATGGTCAATT GTGGCAAGGC TCTTCCAATA CAGAATCGTT TACCACTACG ATCGTTCCAA CGCTGTTCAA AACAGCAGAT AATAGCGTCT ACTATACGAC GCGCTTCGAA GGGAGTATTG ACCGTCAGGT TTGGGGTAAT GTCTGGCGCT TGGATCGCCT TACCCGCACA TGGAACCCCA ATATTGACAT TGGTGGTTGG TTTGGTGGCT GGAACACAAG TGGCAAGGAT CTCTTGGGGT CTGCTGGTAG CGTCGTGATG TATAGTAGGC CAGTCGATGG AGTTCTCCGG TTGCGGAGCA ACGTTTGGTC TGAAGAAACC AGCGAGTTCG AGGTGGCTGG CGCGTGTGAT ATCTGTGTGC CATTTGAAGT TAATGGTGAG TTTTATCAGC TTGTGGTTAG AGCGCAACTT CAGCTTATCC ATTGGAATGG CAGTAGCTGG GATACACTCA ATAGTTGGGA GAATAGCTAT CCTGTCCAGC TGACATCCTA CCCAGTTGTC GTATGGCGTG GCGATTTTTA CTTGATCAAT GGTCGCAAGC TCCAACGCTA TAACTTAACA ACCCAAATGG TCGAAGATAT TGCCCTGCTT GATGGTGATG GCTATAGTTT AGCTACCTTC ACTGATCAAT ATTTGTATGT TGGGGGCGCT TTTTCCAGCG TCAATGGGAT TGCCGCCCAG AACCTCGCCC GCTGGAATGG CACGCAATGG CAGGCGCTGA GCCAAGCCCC AAATGGGCCA GTCTACGTCA TTGCCACCTC GCCAAATTAT CTGTATATTG CAGGAAACTT CAGCCAAGTT GGTACAACCA ACTCCCTCGG GGTAGGCGTA TATCACTTAA CCAGCTCGTA TCAAGTTTTT GCCCCGATAA GCAATAAATA A
|
Protein sequence | MQYPRLHQVH HKLVPQLKVH GTSFIISILQ HNPKSQAGAP IEGSWNPNLA SHLRNPALIR HAIVGPDQNL YIAGAFSNIN NTAANGLARW DGSQWHSLAT SGADVDRVQS MAFFNNKLTV GGAFRTWAGQ PFAQLVQWDG ADWMQLGSGF QGSFNNSPTQ TTAVNALTVL NTMLIIGGNF TQFHGQPANG VVGWNATDAI PFGSANGQIN MTVASTDTLM IHGDFRTFNN QTVPYGTIPS WKAGIWKILV LPAIPSGFIY KANLISIDQT IYLLANESYF NETFVFRWQN ERWVQLGTGL PGQFTKLTNA NGSLYLAQAD GDGNANDSYG VVLRLVDNQW QTVNLPHSYT SISQLVAIGS DVYIIGLPAE NQQCPNLVCT FTVERWNGTT LQLIGEAWQA PSIVSLVGDV DHVWATSRPT YLDRQAAPTV LFWNGQLWQG SSNTESFTTT IVPTLFKTAD NSVYYTTRFE GSIDRQVWGN VWRLDRLTRT WNPNIDIGGW FGGWNTSGKD LLGSAGSVVM YSRPVDGVLR LRSNVWSEET SEFEVAGACD ICVPFEVNGE FYQLVVRAQL QLIHWNGSSW DTLNSWENSY PVQLTSYPVV VWRGDFYLIN GRKLQRYNLT TQMVEDIALL DGDGYSLATF TDQYLYVGGA FSSVNGIAAQ NLARWNGTQW QALSQAPNGP VYVIATSPNY LYIAGNFSQV GTTNSLGVGV YHLTSSYQVF APISNK
|
| |