Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3352 |
Symbol | |
ID | 5735222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4228127 |
End bp | 4230175 |
Gene Length | 2049 bp |
Protein Length | 682 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641280499 |
Product | hypothetical protein |
Protein accession | YP_001546116 |
Protein GI | 159899869 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0148489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGATCG ATTCCCAGCA TTTAACCCAA GCCAATCAAT CAACTTGGCA ACGCTATCAA ACGGTTATCG CCAGCCTTAG TGCAGGGGTA GCCATCCTGC TTGCATTGTT CAGCTTTGGC CTTGCTGCAA GCTTCGACTG GCTCGTAGCT AATGCTCAAC CAATTATTGG CTTTGGTAGC TTTGTGCTTT TGGTGATTGG GCTACCTGGC TGGGCTACAA TTCGCTGGCT TACCCCGCAG CATGTGCTCA CTCGCAGCGA GCGTTGGGCT TTGAGCTGGG CCGTGGGCAT TGCCTTACCA CCAATTTTAT TCAATCTCTT TCACATCGTA GGCCTATCAA TTAATCGTTG GGTTGTGGTT GGCTATGCGT TGCTTGGGCT ATTGTTGACA ATCTGGCCTG AGCCACGGTC AGCTTGGAAT ACCCGCCTTG CCCAACTCAA ACAAATCCGC ATCAGCAGCC ATGCCTGGAT CTTGCTGAGC ATCACGCTAG TCAGTATCCT TCAGCGTTTG TTGGCTGTAC GCGAATTGAG CGTAGCCCAG TGGAACGATG CCTATCACCA TACGATCATC ACCCAACTTT TTTTAGATCA TGGTGGTATT TTTGAGACAT GGCAGCCCTA TGCTGATTTA AACACCTTCA GCTACCACTA TGGGTTTCAT GCCAATAGCG CCTTTTTGGC ATGGTGGAGC CAACTGCCCG CCACCACTAG CGTGCTCTAC ACAGGTCAAC TGCTGGGCAT TGCCACCGGC GTGATGGCCT ATTTACTGGG CCGTCGCTTG AGCAATCGGC CAAGTGTGGG CTTAATCGCC TTTGGCTTGA CCAGCTTCTA CAACCTCATG CCCGCATATT ATGTCAATTG GAGCCGTTTT ACCCAATTGA TTGGCCAAGT TATTTTAGTA GGCTTGGTGG TCATTTGGCT GCTAGTGTTG GAATATCCAC AGTTTAGTTG GAAATTGGTT GGTCTCGCCA GCGTGCTAAC GACAAGTTTG CTGCTGACTC ATTATTTAGT CACGATATTT GCGGTGGTTA TGGTTGGTTT CGGCATTTTA GCCTTATTAG CCCGCCAGCC AAGCTTAACC AATTTGAAGC AAATCAGCTT ACGTGCAACA GCAATTAGCC TTGCAAGCGC TCTGATCGCA GCACCATGGC TGTATACCAT TATTCAAAGT AAACTTACGG CGATTGCCCG TAACTATGTG ACTGGTTATA GCGTTGGCTA TGCCACAACG GTAGCAACCC TCGACCAAAT TGTGCCAACC TATATTAAAG CTCCAATTAT GCTGTTGGCC GTTGTCGGGA TTTGGTTGGC TTGTGCCCAA CGGGCTTGGC GTATGCTGTT ACTGGTGGTT TGGAGCCTTG GGCTTCAGAT TTTGGCAGTG CCCTATGTCT TTAACTTACC AATTAGCGGC ATTATTAGCG GATTCGCCGT TTCAATTATG CTCTATCTGA CGCTGATTCC ATTAGCAGCC TATCCCTTGG GCCTTGTGCT GGAGCGCTTT AACCAGCAAT GGTATGTCAA AGGCTTGGCT CTGATCGGCT TATATGGGCT GATTGTGTGG TCTACACCAT GGCAAACTGC GATTGTCAAC GAGCAAAATC GTTTATTGAC ACGGGCTGAT GAGCAAGCGA TGCACTGGAT TCGCACTACA ACTGAGTCAG AAGCTCGCTT TCTGATTAAT GGACTTTTTA GCTATGGCGA TGCATTAATT ATTGCCGATG ATGGTGGCAT GTGGATTCCC TTCCTGACTG GTCGCCAAAC TACCATTCCA CCACTGACCT ATGGCTCGGA AAAAGCGATT AATCCGCAAC TTGATCGCGA AGTCTACGCC TTATACGATG CATTACGCAC CACCAACCTA GAGACTGCCG CAGGCCTTGC CTTGCTGCAA CAGCATCAGG TTGATTATAT TTATACTGGG CCGCATATGG GCAAAAATGC TCAAAAAATT CAACTCAACA CCCAAGCACT CCGCTATCGT CCTGAACAAT TTCCAATTGT CTACGAGCGC GATGGAGTGG TGATTTTTGC AGTAAAGGCG CAACAATGA
|
Protein sequence | MQIDSQHLTQ ANQSTWQRYQ TVIASLSAGV AILLALFSFG LAASFDWLVA NAQPIIGFGS FVLLVIGLPG WATIRWLTPQ HVLTRSERWA LSWAVGIALP PILFNLFHIV GLSINRWVVV GYALLGLLLT IWPEPRSAWN TRLAQLKQIR ISSHAWILLS ITLVSILQRL LAVRELSVAQ WNDAYHHTII TQLFLDHGGI FETWQPYADL NTFSYHYGFH ANSAFLAWWS QLPATTSVLY TGQLLGIATG VMAYLLGRRL SNRPSVGLIA FGLTSFYNLM PAYYVNWSRF TQLIGQVILV GLVVIWLLVL EYPQFSWKLV GLASVLTTSL LLTHYLVTIF AVVMVGFGIL ALLARQPSLT NLKQISLRAT AISLASALIA APWLYTIIQS KLTAIARNYV TGYSVGYATT VATLDQIVPT YIKAPIMLLA VVGIWLACAQ RAWRMLLLVV WSLGLQILAV PYVFNLPISG IISGFAVSIM LYLTLIPLAA YPLGLVLERF NQQWYVKGLA LIGLYGLIVW STPWQTAIVN EQNRLLTRAD EQAMHWIRTT TESEARFLIN GLFSYGDALI IADDGGMWIP FLTGRQTTIP PLTYGSEKAI NPQLDREVYA LYDALRTTNL ETAAGLALLQ QHQVDYIYTG PHMGKNAQKI QLNTQALRYR PEQFPIVYER DGVVIFAVKA QQ
|
| |