Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0654 |
Symbol | |
ID | 5732554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 751114 |
End bp | 752532 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641277783 |
Product | hypothetical protein |
Protein accession | YP_001543430 |
Protein GI | 159897183 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.485589 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTGCCCT ATGCCCATGA TCCGGTGGCC TATGCGCGGG AGGTGTTAGG GGAAGTTTGG TGGACGAAGC AGGAACTGAT CGCGCGGTCG TTGCTCACGC CGCCGTATCG CACGTTGGTC AAAGCGTGTC ATAAGGTTGG CAAAACCCAC CTTGGCGGCG GCTTAGTCAA CTGGTGGTAC GACAGTTTTG ACCCGGGACT GGTGCTGACA ACCGCCCCCA CCGATCGCCA AGTGCGTGAC CTGCTTTGGA AGGAAGTGCG CATGCAGCGG CGCGGACGCG CGGGCTTTAC TGGCCCTAAG TCGCCGCGCT TGGAGAGTAC GCCCGACCAT TTCGCCCATG GCTTCACGGC CAAAGATGGT GATTCGTTTC AAGGCCATCA CTCGCCGCAT ACCCTGTTCA TCTTTGATGA AGCGGTTGGC GTGGCCAGTG TGTTTTGGGA AACCGCAGAG TCGATGTTCA ACGAGGGCGG CGCATGGCTG GCGATTTTCA ATCCGACCGA TACCAGCTCG CAAGCCTATG CTGAGGAATT GAGTGGGGGT TGGCATGTCA TTTCGATGAG CGTGCTCGAG CATCCTAACA TTCTGGCCGA ATTGCAGGGC TTGCCGCCGC CGTTTCCCTC GGCGATTCGC CTGAGCCGCG TCGATACCTT GCTCAAAAAG TGGTGTCGCG CACTCAGCCC CGAAGAACCA AAACGCGCCA CCGATATTCA CTGGCGGGAT GCCTGGTATC GGCCTGGACC GATCGCCGAA GCGCGGTTGT TGGGCCGTTG GCCATCCCAA GCGACCAACA ATGTTTGGAG CGATGGCGCG TTTCAGGTGG CAGAAAGCCT GCTGTTGCCT GCGAGTGATG AACCGTGCGA GCTGGGCTGT GATGTCGCCC GCTATGGCGA TGACTTTACC GAGATTCATG TGCGCCGTGG CGGCCACAGT CTCTATCACG AAGCGGCCAA TGGCTGGAGC ACGGTCGAAA CCGCAGGGCG CTTGAAGCAA TTGGCCAACG AATATGGGCG ACGATGTGGG GTTGATGGTC GCGCCGTTGC GGTCAAAATC GACGATGACG GCATCGGCGG CGGCGTGGTC GATCTCGCCG ACGGCTATAC CTTCCTCGGA GTCAGTGGGG CACGCACGGC CTACGATCCC GAGAAGTATC CCAATCGCCG TAGCGAATTA TGGTTCAGTG TGGCCGAACG GGCGATGGAG CAACGTCTGA GCTTCGTTGC GCTCGATGCG GAAACCCGCC GCGAGCTGCG TCGCCAAGCG ATGGCTCCCA CGTGGAAGCA AGATAGTCAG GGACGGCGGG TCGTCGAGCC AAAAGCCGAC ACCAAGAAAC GGATTAAGCG CAGTCCCGAT GGCATGGATG CGGTCAATTT GGCCTATGCC CCCGCCCCGA TCGTCAGTTT TGGCCCAAGT CTTTGGTAA
|
Protein sequence | MLPYAHDPVA YAREVLGEVW WTKQELIARS LLTPPYRTLV KACHKVGKTH LGGGLVNWWY DSFDPGLVLT TAPTDRQVRD LLWKEVRMQR RGRAGFTGPK SPRLESTPDH FAHGFTAKDG DSFQGHHSPH TLFIFDEAVG VASVFWETAE SMFNEGGAWL AIFNPTDTSS QAYAEELSGG WHVISMSVLE HPNILAELQG LPPPFPSAIR LSRVDTLLKK WCRALSPEEP KRATDIHWRD AWYRPGPIAE ARLLGRWPSQ ATNNVWSDGA FQVAESLLLP ASDEPCELGC DVARYGDDFT EIHVRRGGHS LYHEAANGWS TVETAGRLKQ LANEYGRRCG VDGRAVAVKI DDDGIGGGVV DLADGYTFLG VSGARTAYDP EKYPNRRSEL WFSVAERAME QRLSFVALDA ETRRELRRQA MAPTWKQDSQ GRRVVEPKAD TKKRIKRSPD GMDAVNLAYA PAPIVSFGPS LW
|
| |