Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0564 |
Symbol | |
ID | 5732285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 653980 |
End bp | 655413 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641277691 |
Product | hypothetical protein |
Protein accession | YP_001543340 |
Protein GI | 159897093 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAAAG CTGTCGAATT GGCAAAGTGC CATGCCGACC CTGCCTACTT CACCCACAAC TACGGACGAA TAGACGACGC GCAGGGTCTT GGTGATGGCA GCGGCGATAT GCCGTTTACG CTCTGGCCTG CGCAGATTGA AGTCCTGTGG ACGCTTCTGC TTCAGCGTCT AATTCTGATC TTGAAGGCGC GGCAGTTGGG TATTAGCTGG TTGTGTTGTG CCTATGCCCT GTGGCTCTGC CTGTTCCAAC CGGGCAAGGT GGTGCTGATC TTCAGTAAAG GTCAAGGCGA AGCGGACGAG ATGCTTCATC GGGTCAAACG GTTGTATGAA CGCTTACCCG ATTGGATGCG CGAAGCCTCG CCAGCGCTGG TGACGGACAA CACGACCGAA CTGGAATGGG CGAATGGCAG TCGGGTTAAA TCACTGCCCG CGACCAAAGG GGCAGGGCGT TCGTTCACTG CATCGCTCGT GATTTTGGAC GAAGCCGGAT TCTTGATTTG GGCTAAGCAG TTGTATACCG CGCTCAAGCC CACGATTGAC GGCGGCGGCC AACTGATTGT TCTCTCCACA GCCAACGGGA TTGGCAATCT GTTTCATCAA TTATGGGTCA AGGCACTCAG TGCCAAGAAT CGGTTCAAAA CTATTTTTCT GCCATGGTGG GCGCGACCAA CCCGTGATGC CCAGTGGTAT CAAGAGCAGC TTGAGGAGTA TACCGACCTT GATATGGTTC GGCAGGAATA TCCCTCAACC GCGCAAGAAG CCTTTTTGGT GTCAGGGCGC ACACGGTTCA AAATGCCGTG GTTGCTTCAG CAGACACCGA GTGACGGTTT AGCAGCGGAA TCGTTACCCG ATGCGCTGGA CAAGCTTGAC GGCGTAACGA TGTATCAATT GCCGCAACAA GGACGGCGCT ACATTCTGGC AGCGGACGTA GCCGAAGGGC TGGAGCACGG CGACTTCTGC GCTGCCACCC TAATCGACGC AGTGTCATGG GAGGAAATGG TCAGCGTCCA CGGCAAGTGG GAACCGGATG AATACGCTCG CATCTTGATG GCGTTGTCGG ATGGGTACGG GGCCACGGTT GCGGTCGAAC GGAACAATCA CGGCCACGCG GTACTGACGA CGATGAAGCT GGCCGGATTC ACGCGCATCG TGTATGGCCT TGATGGGCGA GCAGGCTGGC TGACCAACGC CCAGACCAAG CCACAAATGA TTGACCTGTT AGCAACGGCA CTGCGCGATG TGTTGGTAAA AATTCGCAAT CAAACGGCGC TGAATGAACT AGCGATTTAT CGGATTTTGA AGAACGGCGG GACAGGCGCA CCCGCAGGCT ATCACGATGA TTTCGTGATG GCATGGGCCA TCGCTCTCAT GGTTGCCAGT CAACCAACGG AAGTCGAAGA CGAAGCCATT GCTGGCTCGT GGGATAGCTA CTAA
|
Protein sequence | MSKAVELAKC HADPAYFTHN YGRIDDAQGL GDGSGDMPFT LWPAQIEVLW TLLLQRLILI LKARQLGISW LCCAYALWLC LFQPGKVVLI FSKGQGEADE MLHRVKRLYE RLPDWMREAS PALVTDNTTE LEWANGSRVK SLPATKGAGR SFTASLVILD EAGFLIWAKQ LYTALKPTID GGGQLIVLST ANGIGNLFHQ LWVKALSAKN RFKTIFLPWW ARPTRDAQWY QEQLEEYTDL DMVRQEYPST AQEAFLVSGR TRFKMPWLLQ QTPSDGLAAE SLPDALDKLD GVTMYQLPQQ GRRYILAADV AEGLEHGDFC AATLIDAVSW EEMVSVHGKW EPDEYARILM ALSDGYGATV AVERNNHGHA VLTTMKLAGF TRIVYGLDGR AGWLTNAQTK PQMIDLLATA LRDVLVKIRN QTALNELAIY RILKNGGTGA PAGYHDDFVM AWAIALMVAS QPTEVEDEAI AGSWDSY
|
| |