Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0426 |
Symbol | |
ID | 5732325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 495956 |
End bp | 498265 |
Gene Length | 2310 bp |
Protein Length | 769 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641277552 |
Product | hypothetical protein |
Protein accession | YP_001543205 |
Protein GI | 159896958 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTGGA TACGCTGGTG TTGGCTGCTG TTGCTGATAG CCCTACCGAT CAGTGCAGCA TCGAGCCAAG AGCAATCAAA TAATGGGCTA GTTTGGCAGG TTGAGCCAGT TTTTGGCACA GTTTATCGCG CTGGTGAGTG GATGGCTTTG CGGGTTAAGG TCAGCAATAC TGGCGTTGAT CGCATGGTCG AAATTCATAT TGGCAACTAC AATGTTGAGC TTGATGTGCC AGGTGGCTCG CAAAAAGAAA CTTTGGTCTA TGGCCAATTT GATCAGGCCT TTCGGGTGCC AATGGCCATG TATTTAAATG GTGAGAAGCG CGAAACCGCT ACGCTGCAAC TACGTTCCAC CGATCGCCCA ATGGTCGCAA CCTTGCTGGA ACAGCCACTA CCCCAATTAA CCAAAGTGGT ATTAGTTGCC ACTGATAGCC AACAATTGCC CAACGTTCAG ATTGGCCTTG ATATGCTGAG TGGGATTGTG CTCAATCAGC GTTCATGGGG CGAACTCACT CCTGAGCAAC AGCAAGCGAT CAAGCAATGG GTCAATAACG GCGGGGTGTT GGTCGCTGAG CCAAATCTGC TCGAAGATTT ACCAGCCGAA CTCCAACCAG CTCAAGCTAC TGGCCAGCAA ACGATTGCCA CCACAACCCT AAGTCAAGTG TTTGGTCAAC CATTTAATGT GCCAAACTTG GATGTTGCCC TACTTCAGCC CAATCAACAA GCAGCGACCA GCCTCAAACA AGAGCAAACG CCGCTGTTGG TTAAGCGCTC GGTTGGCCGT GGCTTCGTGA TTGCCAGCGC CTTTAATTTT GATCAAGCTG AAATTACGGC GTGGCGGGGC TATGAAAACC TACTGGCGTC GCTCTTCCCC CAGCAACAAG ATCTAGGCTG GATGGGTGCT GGTAGCAGTG AAAGTATGTT ACGTGATAGT GCCGCGCCTG CCTTGCTGAA CAATTTGCCA GCCCTCGATT TGCCGCCACT GCAAACGCTG TTAATTCTTT TGGGAATTTA TATTGTTTTG GCCGGGCCAG TCACTTATCT TGTGTTACGG CGGCTTGATC GCATGGCCTT AGCTTGGCTG ACGATTCCGA CATTAACCTT AATTTTTGCC GTGCTAGCCT ATGGAATTGG CTCGAAACAA CGGGGCACTG ATATTTTGGT TCACGATGTG GCCTTGATTT ACCCTGACGA GCAAGCGAAT GCCACGGTCT TAGGCTATGC GGGTATTTTC TCGCCAGTAC GCGAAGACTA TCAAGTTACA ATCAACCATG GCGCATACCC TCGCCCATTG TTGGTTAACC CCAATCTGGG CACTAATGGT GGCATTAACG CCACCAGAGC GCTCTACAGT CATCAGCCAG GCGATGTGCG TAATCTCACG ATCAATCAAT GGTCGATGGA AATGTTTGCC TACGAACGTG ATATCAGCGA TGCGCCGCAT GTGCAAATCG AAATCAGCCT GCAAGGTAAA AAAATTATTG GCAAGGTCAA AAATACCAGC ACCTATACGT TGAGTGATGC GGCGTTTATT ATGTTGGGCA ATGCTCAAAA GTTTGGCACA ATCGAGCCAG GCGCTGAAAA ATCGATCGAA TTGCAAATCA ACAATAATCC CTCGATGTCT GGGGCAAGTT ATTTGTTGTT TCAAAAAGAG CTTGATGAAG GCTATCAGAC TGAATTTGGC CCGCCACGTG AGCTTTTGGC CAAAACCCAG GCGCTCGATA TTGCCCTTCC TACCACCTAT GGCAGTATCG ATCGGAATGC AGTATTTGTA GCATTTGTCG ATTTCAATCA AGCAGTAATC GACTTGCCGG AAAAACGCTT TGCTGCGACC CAACGTAGTC TGCTCATTCA ATATCCGCAA CTGCGCTATA GCCAAAATGC CGTTGAACTG ACCAATAATT GGTTTAGCTA TATCTTAGAA GACGACCCAA ATATGGGCAT GGGTTCAACC TGCACAACCA GTGCTCGTGG TGCTGGCACC AACATCTCAC AAAAAACTGG CATTATTCAA CTTAGCCTAC CCACAACATT TAGCTCAATC GAAGCCCAGT CGTTACGGCT CTTGCCAACG ATCGATGGTG GCATGGTTGC GCCCAACAAA TTAACCTACG AACTATACAA CTGGGCTACT AGTCAATGGG ATGTCGTTGA ATTCAAAAAT GGAGCGATTG ATCTGACCAC CAGCCCCAGC ACCTATTTGC AACAAGGCAA CTTGCGCACC CGCTACACCC TCGATCCAAC GGTTGATACC ACGGCAGGGC TATGGGGTTG CTTCTCGGTT GGGCCAGTGC TTTCGGGAGT TGTCAAATGA
|
Protein sequence | MRWIRWCWLL LLIALPISAA SSQEQSNNGL VWQVEPVFGT VYRAGEWMAL RVKVSNTGVD RMVEIHIGNY NVELDVPGGS QKETLVYGQF DQAFRVPMAM YLNGEKRETA TLQLRSTDRP MVATLLEQPL PQLTKVVLVA TDSQQLPNVQ IGLDMLSGIV LNQRSWGELT PEQQQAIKQW VNNGGVLVAE PNLLEDLPAE LQPAQATGQQ TIATTTLSQV FGQPFNVPNL DVALLQPNQQ AATSLKQEQT PLLVKRSVGR GFVIASAFNF DQAEITAWRG YENLLASLFP QQQDLGWMGA GSSESMLRDS AAPALLNNLP ALDLPPLQTL LILLGIYIVL AGPVTYLVLR RLDRMALAWL TIPTLTLIFA VLAYGIGSKQ RGTDILVHDV ALIYPDEQAN ATVLGYAGIF SPVREDYQVT INHGAYPRPL LVNPNLGTNG GINATRALYS HQPGDVRNLT INQWSMEMFA YERDISDAPH VQIEISLQGK KIIGKVKNTS TYTLSDAAFI MLGNAQKFGT IEPGAEKSIE LQINNNPSMS GASYLLFQKE LDEGYQTEFG PPRELLAKTQ ALDIALPTTY GSIDRNAVFV AFVDFNQAVI DLPEKRFAAT QRSLLIQYPQ LRYSQNAVEL TNNWFSYILE DDPNMGMGST CTTSARGAGT NISQKTGIIQ LSLPTTFSSI EAQSLRLLPT IDGGMVAPNK LTYELYNWAT SQWDVVEFKN GAIDLTTSPS TYLQQGNLRT RYTLDPTVDT TAGLWGCFSV GPVLSGVVK
|
| |