Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0504 |
Symbol | |
ID | 4710310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 572518 |
End bp | 573948 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639854962 |
Product | flagellar hook-associated 2 domain-containing protein |
Protein accession | YP_001002093 |
Protein GI | 121997306 |
COG category | [N] Cell motility |
COG ID | [COG1345] Flagellar capping protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCTCAC CACTGGATCA GATGCCGAAT ATGCCGAGCC AGATGGACGT CGGCTCCGGC ATCGATACCA ACAAGATGGT CCAGGATCTA GTGCGAGCCG AGCGGGCTCC CACCGAGCAG CGCTTGGATC GGCGTGAGCA AGAGCTCCAG GAGAAGCTCG AGGCCCTCGG GCAGATGCGC GGGACCATCG GCGAACTGCA GGAGGCCGTG CAGGGGTTGG GGGATCCGAG TGCCTACTCC GGCATCGACG CCGAGTCGAG CAATGCCGGT GTGGCGGCGG TCTCGGCCAG CGAAGAGGCG CGCCCCGGTC AGTACGACGT GGAGGTCGAG CAGCTGGCGC GGACCCAGCG CCTCGCCACG GCCAGCGGTG CCTTCGAGGA CAGCGCCGAT GCGGTGGGCA CTGGCCGGCT GGTGATCACC GACGGCGAGG GCAACGAGCA GGCAGTGACC ATCGACGAGG AGTCGGGCAC GCTGCTCGGC ATCCGCGACG CCATCAACGC CCAGGCCGAA GGGCTTCGCG CCTCGGTGGT GGACGACGGG GCGGGGCCGC GACTGGCGAT CGCCACCGAG CAGACCGGCC GGGCGAACGC CATCGCCCAG ATCCGTGCCG AGCAGGACCC GGAGGACGAT CAGGGCAACC TGTCAGCTCT GCAGTACAAC GTCGCGGACC CCCAGAGTGG CGAGCCCATG GGCGCCTTTC AGGAGGTCCG GCCAGCCAGT GATGCCGTTG TGACCATCGA CGGCATGCAG ATCACGCGAC CCGAGAACCG CATCGAGGGG GCCATCGAGG GCGCCACCCT CAGCCTCAAG GAGGAGGGCC GCAGCCGCGT CTCCATCGAG CAACAGACGG GGCTGGCCGA AGAGAACATC CAGCGTCTGG TGGACTCGTT CAACCAGGTG CGGGCCCAGC TCAACCAGCT CTCCGACTAC GACCCCGAGG CCGAGAAGGC GGGTCCGCTG CAGGGGGATC ACACCCTGCG TAACCTCCTC TCGCAGCTCA GTCGGGCCGT CAACGAACCA GTGGAGGCGC TGGACGGGGC GCCCATTTCC TCCCTCGGCG ACCTCGGTGT GCGCACCAAC CGTGACGGCA CCCTGGATCT TGATGGTGAG CGCATGCAGC AGATGGTCGG TGAGCACTCA GAGCTGGTGA CGCGCATGAT GACCGACCCG GAGAGCGGGG TGATGTCGCG GCTTGAGGGG GTGCTTGAGA ACGCCCTCGG CCGGGATAGC GTGATCGACA TGCGGACCGA CGGGGTCGAG AGTCGGCTCG ACCGCATCGC CGATGATCGC GAGCGCCTGG ATCGGCGCAT GGAGCGGCGC GAGGACCAGC TGCGGAGCGA GTTCTCGCGC ATGGACTCGA GGGTGGCTGA GCTCAATCAG ACCTCGGAGT TTCTTGAGCA GCGCCTGGCT GCCATGAACA GCAGGGATTA A
|
Protein sequence | MVSPLDQMPN MPSQMDVGSG IDTNKMVQDL VRAERAPTEQ RLDRREQELQ EKLEALGQMR GTIGELQEAV QGLGDPSAYS GIDAESSNAG VAAVSASEEA RPGQYDVEVE QLARTQRLAT ASGAFEDSAD AVGTGRLVIT DGEGNEQAVT IDEESGTLLG IRDAINAQAE GLRASVVDDG AGPRLAIATE QTGRANAIAQ IRAEQDPEDD QGNLSALQYN VADPQSGEPM GAFQEVRPAS DAVVTIDGMQ ITRPENRIEG AIEGATLSLK EEGRSRVSIE QQTGLAEENI QRLVDSFNQV RAQLNQLSDY DPEAEKAGPL QGDHTLRNLL SQLSRAVNEP VEALDGAPIS SLGDLGVRTN RDGTLDLDGE RMQQMVGEHS ELVTRMMTDP ESGVMSRLEG VLENALGRDS VIDMRTDGVE SRLDRIADDR ERLDRRMERR EDQLRSEFSR MDSRVAELNQ TSEFLEQRLA AMNSRD
|
| |