Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3236 |
Symbol | |
ID | 5735104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4096464 |
End bp | 4099337 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280382 |
Product | hypothetical protein |
Protein accession | YP_001546001 |
Protein GI | 159899754 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGATG TTGAGCGAAT CCGTCAGAAA ATTATTAATT TACAAGCCCG CCGTGCCCAA GAAAGCGACC AAGATTTGAT CGAGGCGCTT GATCAGATGA TTGGGCAACA TCAACACCAA CTAAGTTTGT TGGCTCAACA AGCTGCCGCT GTGCTCGAAC CACCAGCCCC CGCCGCTACC CAAGGCCAAG TTTATGGTTC CAGTGTGATG ACCAACCTTG GTCATGTTTG GATCAACCAA GTTTTTAATG CGCCCTTGAG CACTGGCGAA CGCACCGATC AATTGATTTT GCACTATCTA GCTCATGTTC AGGGCAATGC TGATCGCTTA GTGCTGGGCA CTGATGTCCA AGATAGCGAT CAAGAGCGTA TGTCGTTGCA GCGGGTGTTT ACCATGCTGC AAGCCGATTT TTATGGGCCG ATTGCAGGCG ATGCCTTGTT GCAATCGCTG CTGCTGGGTA GCTCACAGCC ACGCCCGCAA TTGCAAGATT CGATTCTCAA GGCGCTCAAC CAAATTCAAC ATCAGCGTTC GGTGATTTTG GGCGTGCCTG GTGGCGGCAA AACGACGGTG GTGTCGTATC TGGCTTCGGC TCAAGCGGCG GCATTGCTCG ACCCAAGCAA AGCTGAATTT TTGCATAGCC AAGGCTGGAT GCACACCAAA CTTGTGCCAG TGCGGGTGCG ACTGAAACAT GTGCAACCAC CAGCCGACCC TGAGCAACAA ACTGCCGACG CTTTTTGGGA AGTCGTGACC GAATTGCAAT TGGCTGGACG GTTTGGCCAA CGTGCCATCA ATAGCGATGT TGGCTTTCAG CGGATCGAAG TTGAGCGCCA ACAACGGCTG CAAGATGCAA TTGAAGAATT ATTGACCAAT GGCAATGGCT TGTTGCTGCT CGATGGACTT GATGAAGTGC AGCCCGAACA TCTGGCTGCC GTCAAACGCT GTATCGAGCA TGCTCAACGC ATGTTTCACA AAAGCCGAAT TATTGTCACA TGTCGGGTGT TTGATTATGA GCATCCGCTG CCACCGCCGT TTCCCTCGCG CCAGCTCTAC GATTGGCCGA CAATTCAATT GATGCCGTTT GACCTAACCG CGCAGCATGT GTATATCACC AATTACTATA GCGAATTGGG ACGCTTGCAT GCCTCACACG CTGATATTGT CAGCATTCGT AAAAAGCATG CCAAACTGCA TGCCGAGTTG ACGAAATCGG GGATGTTGCA TGAACTGACA CGCACGCCGT TACTCTTGGC GCTGACCGTG CACGTGAATA TGGTGCACAC TGATTTGCCT GAAAGCGAAG GCAAATTGCT ACACATTTGT ATCGACGAAT TGCTTAAACG CCGTGCGCCT GAGGCCATCG CCGTTTCACT TGAAGATTTG TATGACTTGG TTGCATTGTT AGGCTATTAT GCCCATAGTC AAGAAGAAAT TCAAGGCAAA GCGTTGCTTA GTTTGCAGCA AATCAATATG GTGGTGCAGC GCTATTATAG CGATCGCTAT CCGCATCCCA ATCAAATTCA TATGCTTGCG CAGGCGGTTG GCAACGCAAC CTTGCGCTTG ATCAATAGCA ATGGCTTATT GCAAGAGGCT AGCAATAATC AGCAAGATAA TCCACATTAT GATTTTGCCC ATCGCTTGTT TCAGCAATTT TTGGCAGGCA TGTATTTGCT CAATCAAGAG CGCCACGACG AATGTATTGA ACGGGCCGGC AGCGAGCATT GGCGGGTTTG CTTGCACTAC ATGGCCAATT TTGCCCCTTA CATTCAAAAG CAAAGCTTCA TTTTTGGGGT GATTCAAGAT TTATTGGGTG GTATGCCCGA TGAACAGGTG CAAGGCTCGC ACCTGCTTTT GGCGATTGGC AAAGCCAAAG TTGCTAGCGC TGGGCGGCGC AATTTATGGC AGCAAGCAGT CAATCATCTC AAAAATATTG GCGGCTTGAG TGGCAGTAGC CGAGGCGTGC GCCGTTGGGC ACCACCACGC ATCGCCTTCC CCACTCGTTT GCGGGCGGCC TTAGTGCTGG GCGCAATTGG CGATACCCGC TTTATTGGCG AAGATGGCGC ACTCATTGCG CTGACTGAGC GGGTTGTTTG CATTCATGCT GGCAGCGTTG AGCTGGAAGA TCCTCATTCA CGCCCCCAAA CCTATAGCGT TGAGCGCTTT TGGATGAGCC GCTATCTTCT GACCAACTTT GAGTATAGCC AATTTATTGC TGCCCGTGGG TATTTGGATG ATCAGTGGTG GCAGAACGAC GATGCGCGGC ATTGGCGACG CGGTGATCCA TCGTGGCTAC ATGGTTTGCC ACCGTGGGCG GCTCCGCGCC GCTTGCCCGA TTTATGGCAT AACGAGCGTT TCAATCACCC AACTCAGCCA ATTGTTGGAG TGAATTGGTA CGAAGCGAAT GCCTTTTGTG CTTGGCTCAC GGTGCAACTG CGGCCTCAAC TTGATGCGCT TGGGCCAAAT TTGGTGGTAC GTTTGCCGAG TGAGGCTGAA TGGCAACTCG CCGCCAGCCA ACATAGTGAG CGCGAATACC CGTGGGGCGA TGAATGGCGC AACGATCATG CCAACACCAG TGAAAGCGAG CTTGAGCAAC CAACTCCGGT TGGTATGTTT CCCTATGGCA CCTGGAGCGA TGGCCCAATG GATTTAGCCG GCAATGTTTG CGAATGGACT AATAGCATCA ATCGCGACCC TGAACTTGAA CCACAAACCA CGCGTAATCG CCGTTTGCAA CATCGCGATT TGATGGTCGC TGTGCGTGGT GGTTCGTGGT TTCATAATCG CTATTTTGCG CGTTGTCGTT CGCGCTTACA GCATCGCCCA TTTAGCCATG GCCCAAATAT TGGTGTGCGC TTGATTATCG GCCACGCCGA ATAA
|
Protein sequence | MHDVERIRQK IINLQARRAQ ESDQDLIEAL DQMIGQHQHQ LSLLAQQAAA VLEPPAPAAT QGQVYGSSVM TNLGHVWINQ VFNAPLSTGE RTDQLILHYL AHVQGNADRL VLGTDVQDSD QERMSLQRVF TMLQADFYGP IAGDALLQSL LLGSSQPRPQ LQDSILKALN QIQHQRSVIL GVPGGGKTTV VSYLASAQAA ALLDPSKAEF LHSQGWMHTK LVPVRVRLKH VQPPADPEQQ TADAFWEVVT ELQLAGRFGQ RAINSDVGFQ RIEVERQQRL QDAIEELLTN GNGLLLLDGL DEVQPEHLAA VKRCIEHAQR MFHKSRIIVT CRVFDYEHPL PPPFPSRQLY DWPTIQLMPF DLTAQHVYIT NYYSELGRLH ASHADIVSIR KKHAKLHAEL TKSGMLHELT RTPLLLALTV HVNMVHTDLP ESEGKLLHIC IDELLKRRAP EAIAVSLEDL YDLVALLGYY AHSQEEIQGK ALLSLQQINM VVQRYYSDRY PHPNQIHMLA QAVGNATLRL INSNGLLQEA SNNQQDNPHY DFAHRLFQQF LAGMYLLNQE RHDECIERAG SEHWRVCLHY MANFAPYIQK QSFIFGVIQD LLGGMPDEQV QGSHLLLAIG KAKVASAGRR NLWQQAVNHL KNIGGLSGSS RGVRRWAPPR IAFPTRLRAA LVLGAIGDTR FIGEDGALIA LTERVVCIHA GSVELEDPHS RPQTYSVERF WMSRYLLTNF EYSQFIAARG YLDDQWWQND DARHWRRGDP SWLHGLPPWA APRRLPDLWH NERFNHPTQP IVGVNWYEAN AFCAWLTVQL RPQLDALGPN LVVRLPSEAE WQLAASQHSE REYPWGDEWR NDHANTSESE LEQPTPVGMF PYGTWSDGPM DLAGNVCEWT NSINRDPELE PQTTRNRRLQ HRDLMVAVRG GSWFHNRYFA RCRSRLQHRP FSHGPNIGVR LIIGHAE
|
| |