Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2600 |
Symbol | |
ID | 5734478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3337357 |
End bp | 3339750 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641279740 |
Product | hypothetical protein |
Protein accession | YP_001545366 |
Protein GI | 159899119 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000916945 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAGAA GTTCAAAATT AATCAGCATC GTTACCCTGT TTTTAATTAT GTTAAGTGCT ATCACCAAGC CAAGCGCTCC GGTGGCAGCC GTGGAGCCGC AAATTCCAAG CAGCCCTGGC ACATGGAGCC AAGATTTTGG CTTATATATA AAAACTGCCG CTAACAGCTA TACCCCGTTG CGCCCAATTG AATGGAATGG CACGCTGTAT GCAATTTTGA ATGGACATCG CAACTTTGAG TCGGGGGTTG GTTATTGGAA TGACCAACAA TGGCTGAAAA TAGATGGCCT GAACGGTCAA GTTGATGTGC TGACGGTGCA TCAAAATCGT TTGTATGTGG CTGGGCGCTT TACCATCGCT GGTAAGAACC TTAACCTTGC CTATTGGGAT GGCAACTTGT GGACGGCTAT GCCCATGCAT GTTGTTCAAG ATGGCCCGTT TTTGTTAGCA AGTTATGCCA ATCAACTGTA TCTTGGGAGC GAAGCCCTGT TGATCGATGA TCAAGCCTAT GGCAGCTTAG CCCGTTGGGA TGGTGAACAG TGGCATGCGG CGGCAGCTGG CATTGAAGGT GTTGTGTTTA CCATGCTTTC ACGCCCCGAT GGCCTCTATC TCGGCGGCTC CTTTTTATAT AATGGTCAAG CAACTAGTTT ACTGTATTGG AATGGTTCCC AATGGCAAAC TGTTGGTGGT GGGGTTTATG GCCATGTGGT TGATTTGAAA TGGGCCAATA ACCAACTCTA CATCAGTGGG AATTTCACCT CAACGCTTGT GCCAAGTATG CGCAATGTCG CCGCCTGGAA TGGCACGAAC TGGAATACCT TCGACACGGG TTTGACTGGC TGGGTTCATA ATGTAACCGT TATGGATGGT GAACTCTATG CACTCCATCG GCCAACTGAT AGTTCAGGCT ATGAGCATCT GCATTGGGAT GGGAGTGAAT GGATCTTCTT AGCTGATCTT GGTTCAATGG ATTATTCTTG GCTATTTTCC CCAAGAGCTG TATTTGTACA GTATAAGCAA GAGTTATTGC TGGTTGGTGA AATTAGCTAT TATAGTAGTG CCGTTGAACC AACCAGCCAA GGTGATTCAA CCTTACGTTG GAATGGCACA GAGTGGGAAC CAATGAATTC CCATGGCTTA TTGACCCGAA TGGCAACTGC GATTGCCACA GTTGATGAAA ATCTTTTGGT GGCATCGAGT AACTTTTACT GGAGCCAAGG TGTTGCTTCG CTAGCCCAAT TTGGCAGCAA TCAGCATTGG CAGAAGCTCG TTGATCGGCG ATCTACTTCA GACCCGGTGC AAGCACTCGA AGCCTTTCAA CAAAGCCATT TTATGGTTAG TTACTATACG CTGTACGAAG CTGCTAGCAC CACCTGGAAC CAAATTGGCG CGTTCCGTGT GGTTCGTGCA ATTGCCCAAT CCAATAACAA ATTGTATGTT GCTGGTGATT TTGAGCAATT CAATGGGGTC ACGGCGCATA ATTTGGTGAC TTGGGATGGC ACGCAATGGC AAGCCTTAAA TGCGCCAGCC TCATTCGATC AAGTGGTGCT AATTGAGGCC CATGGCAATC ATGTGTATAT CAGCGATGGC TTTCAATTGG CTCGCTGGGA TGGCACGCAG TGGACAACCC TCGCCACCAA TGTGGTCAAT ATTGGTTCAA TTGAAGCAAC TACCAATGGG GTCTATATCG CTGGCACCTT TAGCAGTGTT GGCGGCGTGA CAGCCCCCAA AATCGCCTAT TGGAATGGCA CGGCTTGGTC GGGCTTGAGT GGGGTGATTA GTGGTTCAAT TCTTGATCTG GAAATGGGAG CCGATGGCCT ATATGTGGCA GGCAACTTTT TCGGTTTGAA CAATGGCATC GTTAGCCCAG GGATTCTGCG CTGGGATGGG TCGGCATGGC ATGGAGTTGG CGGTGGTGTT CAGACGTATC GATCGTTTGG TATAGTGAAC AATGGGACGG TTCAGCGCTT AGCCGCCACT CCGACCCGTA TGTTTATGTA TGGTGACTTC AATCGGGTTG GCAACCAATA TGAGTCGTAT ACTTTGGCCG TGTGGGAATA TGGTGACGAA CCATTGATCA AAGCCAAACC TGATTATGCG TTTACCAATC GCCCTCAAGC GGTTACGGTC AATGTTTTAG CCAATGATTG GAGCTACCAT CCATCTCAAT TGCAACTGGT GAATCTCACA GCGCCAAGCC ATGGCACGGC TGTGATTAGC GGTAATTCGG TGGTCTACAC ACCGTATCCT CAATTTACCG GGATTGATAC TCTGACCTAT ACCGTGCGCG ATCCAATCAA TGCCGTAACA ACCACAGCTC AATTACGGCT CTATGTTTGG AATACCCCCA ATGTTGTTTT GAATGAGCTG TATTTACCAG CAGTTATCCG ATAA
|
Protein sequence | MSRSSKLISI VTLFLIMLSA ITKPSAPVAA VEPQIPSSPG TWSQDFGLYI KTAANSYTPL RPIEWNGTLY AILNGHRNFE SGVGYWNDQQ WLKIDGLNGQ VDVLTVHQNR LYVAGRFTIA GKNLNLAYWD GNLWTAMPMH VVQDGPFLLA SYANQLYLGS EALLIDDQAY GSLARWDGEQ WHAAAAGIEG VVFTMLSRPD GLYLGGSFLY NGQATSLLYW NGSQWQTVGG GVYGHVVDLK WANNQLYISG NFTSTLVPSM RNVAAWNGTN WNTFDTGLTG WVHNVTVMDG ELYALHRPTD SSGYEHLHWD GSEWIFLADL GSMDYSWLFS PRAVFVQYKQ ELLLVGEISY YSSAVEPTSQ GDSTLRWNGT EWEPMNSHGL LTRMATAIAT VDENLLVASS NFYWSQGVAS LAQFGSNQHW QKLVDRRSTS DPVQALEAFQ QSHFMVSYYT LYEAASTTWN QIGAFRVVRA IAQSNNKLYV AGDFEQFNGV TAHNLVTWDG TQWQALNAPA SFDQVVLIEA HGNHVYISDG FQLARWDGTQ WTTLATNVVN IGSIEATTNG VYIAGTFSSV GGVTAPKIAY WNGTAWSGLS GVISGSILDL EMGADGLYVA GNFFGLNNGI VSPGILRWDG SAWHGVGGGV QTYRSFGIVN NGTVQRLAAT PTRMFMYGDF NRVGNQYESY TLAVWEYGDE PLIKAKPDYA FTNRPQAVTV NVLANDWSYH PSQLQLVNLT APSHGTAVIS GNSVVYTPYP QFTGIDTLTY TVRDPINAVT TTAQLRLYVW NTPNVVLNEL YLPAVIR
|
| |