Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4021 |
Symbol | |
ID | 5735882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5131741 |
End bp | 5133042 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641281171 |
Product | hypothetical protein |
Protein accession | YP_001546781 |
Protein GI | 159900534 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAGTA TGGTTGCTGC TTCGCCGAAT CCTTTGTATG CTGCTTCAGC CTTTGTTGAT GTGTATCCAT TTGTGCGTCA GCAAGAGAGC GAAGAAGAGT TATTAATTGG GCGAGTTGAT ACTAATAATT TTATTATGCT GCCGAAAGAG GCAGTCGAGG TTTTGGATGA TTTGGCGCAG GGCAAGAGTG TTGGTGAAGC CCAAGCCCTC TATGCCGAAC GCTACGGCGA AATCCCCGAT CTGGCCGATT TGCTTGAACA ATTAGAATCT GAAGGTTTTG TTCAGCCACT GCATTCCGAT ACAGTGCGTT TTGGTCAACA ATCCCCCGTA ACTGCTGCTA CTGCTAGCGC CAACCCCAAT CAACCACGTG CTGTACGCTT TCACTTTACC TTCTTTCCAA TCCGTTTGGC TCAAGTGTTG TTCAGCCCGA TCTTACTGGT TTGCTATGCG CTCTTTATTG GTGGGGCGGC GGCAATCGTT GTAGCTCAAC CATCAATTGT GGCCGGCTGG CGAGCCATGG TCGTCGATCA ACAAATGGCG CTCTTTACCT TGATCATCAT GCTGCATGGG TTTGTGATCA CCTTTTTCCA TGAGCTTGGG CATGCGGTGG CGGCCCGTTC GCGGGGAGTC GATGTGCGCT TTGGCATTGG GCGACGTTTA TGGGTGATTG TGGCTGAAAC TGATATGTCA GGCATTTGGT CGATTCAGCG CAACCTGCGA TTTTTACCAA TTTTTGCCGG CATGATTGTC GATTTGCTCA GTGCAGCAAT TATGGTCTAC CTCGCATTTA TGCATCAACG CCAAATCATT AATCTTTCTG ATTTTGGCTA TATCTTGGTG CGGGCGTTTA TGTGGAGCTA CCTGCTGAAT TTGCTGTTTC AATTTTATTT CTTTGTGCGC ACCGACATCT ACTATGTCCT CTCGACATGG CTTCGTTGCT CAAACTTAAT GGGCGATACA GCAAATTATA TGATCAATCG TTTCAATCGT TTATTGGGAC GCGCTGAAGT TCATAATCAG GCAGCTATTC CTGAACGTGA GCGCAAGATT ATCAAACGAT ATGCCTTTTT CTGGCTAATT GGGCGGATGC TGGCGTTTTA TTCACTCTTC TTCTTAACCC TGCCAATTTT ATGGAGCTAT GCCAGCATTT TATTTGAACG AATGTTTGGT AGCGCCAGCG CTGGTATGCA GGTGCTCGAT TCAATTTTGG CAGCCATTTT GATCTTTATT AGCCAAGCTG TTGGAATTTT CCTCTGGCTT TGGAGCTTGA TCCGCAGAAA GGTTAGCGTC GATGACATCT AA
|
Protein sequence | MTSMVAASPN PLYAASAFVD VYPFVRQQES EEELLIGRVD TNNFIMLPKE AVEVLDDLAQ GKSVGEAQAL YAERYGEIPD LADLLEQLES EGFVQPLHSD TVRFGQQSPV TAATASANPN QPRAVRFHFT FFPIRLAQVL FSPILLVCYA LFIGGAAAIV VAQPSIVAGW RAMVVDQQMA LFTLIIMLHG FVITFFHELG HAVAARSRGV DVRFGIGRRL WVIVAETDMS GIWSIQRNLR FLPIFAGMIV DLLSAAIMVY LAFMHQRQII NLSDFGYILV RAFMWSYLLN LLFQFYFFVR TDIYYVLSTW LRCSNLMGDT ANYMINRFNR LLGRAEVHNQ AAIPERERKI IKRYAFFWLI GRMLAFYSLF FLTLPILWSY ASILFERMFG SASAGMQVLD SILAAILIFI SQAVGIFLWL WSLIRRKVSV DDI
|
| |