Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0175 |
Symbol | |
ID | 5732084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 204764 |
End bp | 206662 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277299 |
Product | hypothetical protein |
Protein accession | YP_001542955 |
Protein GI | 159896708 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000546306 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGTC AACGAATGAT TGGGCTGGGT GGCCTGCTTG GGCTAGCACT ATTGATCTTA ATCCTGCCAA ACAGCCGTTT TTGGTTTGCG GCGCTGGTTT GTGCGTTGGC TCCAGGCTAC GTGCTCGAGC GCTGGCTGGA TTTGGATTTA GCGCCATTGG TGCGACCAAG CCTATGGATT GGGCTAAGCC TGGCAGTTTG GCCGTTGGGC TATTTATGGC TGACCACGCT TGGTTTATCG CTGAGCACTG GTATGATCAC GTTGATTGCC TTTGGCCTGC TGGCTGGCGT AGGTTGGCGT TTATGGCGCG AGGGCGAGCG ACCTTGGGCC TTGCCAGCGC CAGTGCCAAT TCTTGGATTG GCGTTATTAA TTGTGACCTT CGCGATTAGC ACTAGAATTA GCCATATTCG CAACGTAGCT TTTCCTCCGT GGGTCGATTC GCTGCACCAT GCCACGATTA TGCGAGTGAT TGCTGAAAGC GGCCAAGTGC CCTACTCGTT GCGCCCCTAC ATGCCAGTTG ATAATTTTGG CTATCACTGG GGCTTTCACG CCACGGCAGC CACGATCTAC AATCTGAGTG GCATGAGCAT TCCCCAATTT ATGCTGTGGT ATGGTCAATT CTTGGGTGTG TTGGTGGTGA TTTCGGTTGG CAGCGCGACG ATTGGCCTAA CCAAAAGCCC GATTGCTGGC CTAGCCGCCG CCACCATGAC GGGCTTTATC TCGATTATGC CCGCTTATTA CCTGAGTTGG GGCCGTTACA CCTTGCTTTC GGGTTTGGCG ATGGTTCCAG TGGTGTTGCT GTTGGCGTGG GTTGCGCTTG ATCGACCTGA TCGCAAAGGC CTTATTTTGC TAACGCTCGT GGTTGGTGGG CTACTGCCAA CTCACTTTGT GGCGGCTGGC TTTGCGTTGT TATGGTGTGT CGCAGTTTGG TTGGGCCGCG ATGTTTGGAC CGAGCAGCGT TGGCAAATCT TGGGCAAGCA AGCGGCATCA GTCGGCATGG CGATTTTGTT GATGTCGCCA TGGCTGGCCC TATTGATTCG TGAAATTCAG CCTGCTGGCA GTGGCACACC CAAGCAATTG ATTGGCGGCG GCTACAACAC CTATGAAGCT GCCAAAGGCT TGTATTGGAC GTGGAATAAC CTTTTGCTCT TCTTAGTAGG TTTGTTGGCG GCTTGGATTG GCTTGTTTCA ACACTGGCGT TTAGTGTTGA TCAGCTTTTT ATGGGCTAGC CTTGTCATGC TGTTTGCTAA TCCAGTGGTA ATTGGCTTGC CCTACCTCTC GTTTTTCAAC AACAACATTG TGGCCTTAGC AATCTTTTTG CCGATTAGTT TGTGGTTTGG CTTTGGGGTT GCTTCATTAG ACCAAGGCTT GAGCAAACAT CTCAAACAGG GAGTAGCCCG AGGTTGGCGG GCGATTCGCA CCGCAATTTT GGCAATAACC GTGCTGATTT CGGCTACCAA AATGCACAGC GTAATCAACG ATGGTACGAT TATCGCCAAA GCTGATGATT TAACTGCCCT GAATTGGATT GTGCAGCGCA TTCCCAAAAA TGCGCGGTTT GCGATTAATA CCGAAGGTTG GTTGTATAAC GTGGCCCGTG GCAGCGATGG TGGCTGGTGG ATTTTGCCCT ATGCTGGCTT GCAAGTGAGC ACACCGCCAG TTGTCTACAA CCAAGGTACA GCTGAGTATA TTGCGGCGGT TGAGGCTGAA ACCAGTTGGT TGCGCAATGC CAACGAAAAA AGTGCTGCTG AATTGGCTCA GTGGATGCGT GAACATAACT ATGATTACGC CTATGCTACC ACCAATGGCA AAATCTTCAA TCAAGCCAAA TTAGCCAATA CAGCTGAATT TGAGCTGGTC TATGAAAATG CAAGTGTGGC GATTTATCTG CGGAGATAG
|
Protein sequence | MNRQRMIGLG GLLGLALLIL ILPNSRFWFA ALVCALAPGY VLERWLDLDL APLVRPSLWI GLSLAVWPLG YLWLTTLGLS LSTGMITLIA FGLLAGVGWR LWREGERPWA LPAPVPILGL ALLIVTFAIS TRISHIRNVA FPPWVDSLHH ATIMRVIAES GQVPYSLRPY MPVDNFGYHW GFHATAATIY NLSGMSIPQF MLWYGQFLGV LVVISVGSAT IGLTKSPIAG LAAATMTGFI SIMPAYYLSW GRYTLLSGLA MVPVVLLLAW VALDRPDRKG LILLTLVVGG LLPTHFVAAG FALLWCVAVW LGRDVWTEQR WQILGKQAAS VGMAILLMSP WLALLIREIQ PAGSGTPKQL IGGGYNTYEA AKGLYWTWNN LLLFLVGLLA AWIGLFQHWR LVLISFLWAS LVMLFANPVV IGLPYLSFFN NNIVALAIFL PISLWFGFGV ASLDQGLSKH LKQGVARGWR AIRTAILAIT VLISATKMHS VINDGTIIAK ADDLTALNWI VQRIPKNARF AINTEGWLYN VARGSDGGWW ILPYAGLQVS TPPVVYNQGT AEYIAAVEAE TSWLRNANEK SAAELAQWMR EHNYDYAYAT TNGKIFNQAK LANTAEFELV YENASVAIYL RR
|
| |