Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1986 |
Symbol | |
ID | 5733875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2443419 |
End bp | 2445467 |
Gene Length | 2049 bp |
Protein Length | 682 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279130 |
Product | hypothetical protein |
Protein accession | YP_001544757 |
Protein GI | 159898510 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATACG TGGATCTGGT GATCAAGGGC TATCTTAATG CTTCAAATTA TTTTCAGATG AATGTTCAAA TGCGCGGTTT GGTTTCCGAT GGTGGTGCGG TTGATAACAT TCCAATTAGT TTAGAAACGA TTACCAAATT GCGCGAACGC TATCAACGCT TGATCAAAGA CTTTCATCAC AGCATCTACA ACCATGCGAA GCCTGAAGAT CCGAGTATGT CCAGGTTGCT CACACAGGCT GCGGCTGTGG CTGTCTACCA CTCTGCCGAT TCGATCGAGT CCTTGCGGGC CTTGCGATCG GCGTGCGGAG AGTTTATGAC GGTATGGGAT ACGTCAATTA AGCCACCCGC CTTGTTTCAG GAACACTATC ACCAGCTTTG GCAACTTGGT AATCAAATGC TGCAAATGCT TCCTGGTACA GCAAAAACTA TTCTGCGTGA CAGTATTTGG CAAACCCAAT CACGCCAAGC AAAACAGGGT TTACGGGTTA TTCTCGATGT TGCTGAGAAT GCCCGTAGCC TGCTTGACTT ACCATGGGAG TTGTTGGTGA TTCCCACAGC GCATCAAACA CTGGCGGCCC AGGGTATCGA GCCGCTAAAT GAAGCTGAAT TTTCAACCCG TTTTTTGTTT TTGCACCATC ATATGGTGTT AATTCGCCAA GTTTCGAGCA TGCTGCCCTA TCAGCCAATT ACAATTGACC GTAATTTAGC GTTACAAGTT ATTGCGGCAC CGTTAGTCCG CTCGCCAATC GATACCAGAT CGTTTATTGC TGAATTAGCC CCCTTGTTTG CTGGCGAAAC ACTTGAGCGC TGGTGGAGCA GTGATCCAAA TACGATTGAA ACATTGCATA AGCGCTTGCT TGAACATCAA CCTCAAATTG TCCAATTGTT GTGTCATGGG CATTCGCCCA AGCCCGAAGC CAAACTACAG CGTCATGATA TGCTTGTAAC CTATATTCTG AATAATCAAC AGGTGGTTTA TCGAGTTAGC TCGCATGATC TTTGGCCGAT TCTGAGTGCT TCGGCACGGC TTCAGTTGGT TGTCTTAACG GTGTGTCATA GCAGTGGTGC GCAACAAACC GAAGAAAATA CGGCAACTGT TAGCAATATT GCCTACGATT TGGTGCGGGC TGGCGTGCCA ATGGTTATTG GAATGCAAGG AGCGATTGCC CAACATGCCG CCGCACGCTT CTGTGGAGTT TTATATACGG CCTTGCGTGA AGGCCATACG ATCGAATGGG CGATCACGGC GGCACGGGCG GCGCTGAGTG GCAATCGCTG GTTTATCGAT TGGACAATTC CCGTGGTGTA TCGCCAAGCT GATCAACGTG AACGCCCAGC ATGGCATACC CGTTTGGCCG ATTTTCTTGA TGCGCGGTTA CTTTCCCCTT CCTATCGCCG TGGTTTTCGA GCTGCGGTGA TTGTGCTCGC TTTGGGCTTG ATCATTGGTG GCTTGAGTCG CGCAATCTTT TGGCCTAGTC AATTGAGGGT CAATCTTGAA CTATTGCGCA CTGGAGCGTT TCTGTGGGCC ATAATTGGGG TGACTTGTAC CCTACTGGTT GATCACTTTA TGACCAGTTG GCGGCCCCCT CATTTAGCGC CGCATGAAAT TGTTGCTCGT CGTTATGCTA GCCGTGGTGG AATGTTGTTA GGCTATGCGA TTGGCGGCTG GGCTGGGGCA TTATTGCTTG GTGGGCTGTT TTTGAGCATT GGCGAATTAA TCAGTCCACC GATTTGGCAA GCGCTTTTTC TGGGGCTGGT TGGTTGGTCG AGCTTGTGGG GCTATGTGGT TGCCCGCTCG GAAAGCCGTG CTGCGCATAA CAATTGGCGG CTTTATCCGC AACTGTATGT AGCCAAGAAT GGCTGGTTGT CGGTGTATCT TGGGATGCTC CTGCTGCTCT TTATTGTGCC ATTGGGTCTG TTGACCGCCT ATGGTCAAAG TCTGGTTGGG CTTATCTTGA GTTCAGGTTT GGGCAGCGCA GCGATGGGGG TAACTGCGCT GACCATGATC TACAGTTTTG ATCGTGAACG GCGGGCTAAG CTGGGTTAG
|
Protein sequence | MQYVDLVIKG YLNASNYFQM NVQMRGLVSD GGAVDNIPIS LETITKLRER YQRLIKDFHH SIYNHAKPED PSMSRLLTQA AAVAVYHSAD SIESLRALRS ACGEFMTVWD TSIKPPALFQ EHYHQLWQLG NQMLQMLPGT AKTILRDSIW QTQSRQAKQG LRVILDVAEN ARSLLDLPWE LLVIPTAHQT LAAQGIEPLN EAEFSTRFLF LHHHMVLIRQ VSSMLPYQPI TIDRNLALQV IAAPLVRSPI DTRSFIAELA PLFAGETLER WWSSDPNTIE TLHKRLLEHQ PQIVQLLCHG HSPKPEAKLQ RHDMLVTYIL NNQQVVYRVS SHDLWPILSA SARLQLVVLT VCHSSGAQQT EENTATVSNI AYDLVRAGVP MVIGMQGAIA QHAAARFCGV LYTALREGHT IEWAITAARA ALSGNRWFID WTIPVVYRQA DQRERPAWHT RLADFLDARL LSPSYRRGFR AAVIVLALGL IIGGLSRAIF WPSQLRVNLE LLRTGAFLWA IIGVTCTLLV DHFMTSWRPP HLAPHEIVAR RYASRGGMLL GYAIGGWAGA LLLGGLFLSI GELISPPIWQ ALFLGLVGWS SLWGYVVARS ESRAAHNNWR LYPQLYVAKN GWLSVYLGML LLLFIVPLGL LTAYGQSLVG LILSSGLGSA AMGVTALTMI YSFDRERRAK LG
|
| |