Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0226 |
Symbol | |
ID | 4709289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 260467 |
End bp | 261507 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639854685 |
Product | OmpA/MotB domain-containing protein |
Protein accession | YP_001001822 |
Protein GI | 121997035 |
COG category | [N] Cell motility |
COG ID | [COG1360] Flagellar motor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.174216 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGAGG AGAACAAGAC CCGCCCGGTC GTCATCAAGA AGGTGGCGAA GCACGGCGAC GATCACCACG GTGGCTCGTG GAAGATCGCC TTCGCTGACT TCATGACGGC GATGTTCGCC ATCTTCCTGG TCCTGTGGCT GCTGCTCGCC CTGGATGACG ATCAGCGCCA GGGCATCGGG CAGTACTTCC GCGACCCGCA GGCGGCGCAT CCGCCCGCCT CGCGGGATAT CATCGACTTT GAGGGCGAGC GCCGCGCTCC CATTGATCTC AGCGGGCTGC CCATGGGTCA GGGCGGTTTT ATCCCCACCG AAGAGATGCA GGAGCTGGCC GAGCAGTTCC AGGACGCCGT GCTTGACGAC CCGGACCTGG CCGAGTACGC CGATCAGATC CTGCTGGAGA TCACCGACGA CGGGCTGCGC ATCCAGCTTG TCGACCACGA CGGGCGGCCG ATGTTCGAGC TCGGCAGCGC CGACCCCAGG GAGCATACCG AGGAGATCCT CCGCGCCCTG GCCCGGGTGC TGGAGGATGT GCCGAATCCG GTCTCGCTCT CCGGCCACAC CGACGCCCGA CCCTTCGCCC GCGACGATTA CGACAACTGG TCGCTGTCCA CCGATCGGGC CAACGCGGCC CGGCTGACCC TGCTCGACGG CGGGTTGCCC GCCGAGCGCA TCGGTCAGGT GGTCGGCTAT GCCGATACCG TACCCTTCGA CCCGGACGAC CCCCGCGCCG ATATCAATCG CCGGATCTCC GTGGTGCTGC TCAGCCGCGA GGCGGTGCAG GGCATCGCCG AGCGCGAGCG GCGCATCGAC CCCGATGAGC AGACTCTCGA CGCCCTGCCG CGCCGTCCGC GGGAGCTGCT CACCCCGGAG GAGCGGCGGA TCGAGGAGGG GCTCGATGAG GTCGAGGAGA CCGTCCCCGA GACCGGGGAC GAGGCGCCGG AGGCCGACGA CGCGGCCGAG GAAGCGCCGG ACGACGACGA GGCGGCCCCG GAGGTGGAGA TGCCCGACCT GGAGCCGCCG GCCGAGCCGG AGACCTGGTA A
|
Protein sequence | MVEENKTRPV VIKKVAKHGD DHHGGSWKIA FADFMTAMFA IFLVLWLLLA LDDDQRQGIG QYFRDPQAAH PPASRDIIDF EGERRAPIDL SGLPMGQGGF IPTEEMQELA EQFQDAVLDD PDLAEYADQI LLEITDDGLR IQLVDHDGRP MFELGSADPR EHTEEILRAL ARVLEDVPNP VSLSGHTDAR PFARDDYDNW SLSTDRANAA RLTLLDGGLP AERIGQVVGY ADTVPFDPDD PRADINRRIS VVLLSREAVQ GIAERERRID PDEQTLDALP RRPRELLTPE ERRIEEGLDE VEETVPETGD EAPEADDAAE EAPDDDEAAP EVEMPDLEPP AEPETW
|
| |