Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0506 |
Symbol | |
ID | 4709959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 574515 |
End bp | 575993 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639854964 |
Product | flagellin domain-containing protein |
Protein accession | YP_001002095 |
Protein GI | 121997308 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACAGG TGATCAACAC CAACGTTGCA TCGCTGAACG CGCAACGGCA CTTGAATTCT TCCCGGGGCG ACCAGGAGGT GGCCCTGGAG CGGCTCTCCT CGGGGCTGCG TATCAACAGC GCCCGGGACG ATGCCGCCGG TCTGGCGATC AGTGAGCGCT TCACCGGCCA GATCAACGGC ATGGATCAGG CCGCCCGCAA CGCGAATGAC GGCATCTCGT TTGCCCAGAC GGCGGAAGGC GCCATGGAAG AGATGAGCAA TCTGCTCCAG CGGGTCCGCG AGCTGGCGGT GCAGTCGGCC AACGACACCA ACTCGCCGTC GGACCGTGCC GCTCTGGACC GCGAGGTGCA GGCTGCCGTG CAGGAGATCG GCCGGATCGC CGAGAGCACC CAGTTCAACC AGCAGAACGT GCTCAACGGC ACCCTGCGCG AGCTGGTCTT CCAGGTGGGG CCGAACCGCG GGCAGACGAT CAATGCCGGG GGCGTTGACG TGCGTGCCGA GAACCTCGGG GCCAATGTGG CCGAGGGTCG TGCCGTGCAC CAGACCGCCG GCGGCGAGGG AGTGCAGCTC CCGAGCGGCC TGCAGGTCAA CGGCCAGGAA ATCGATCTCG GCGACGCCCG CGAGCTTAAC GACGTGGCGA GCGAGATCAA CGAGCGTCAG GCCGAGACCG GGGTCTCCGC CATGCGCGCC GACCGTGCCG AGACCCAGGC GGTGGAGTTC GACGGTCTCG CGGAAGGGGA GCGTGCCCAG CTGCGGATCA ACGACCACGC CATTGAGCTC GATGGCGACA TGGAGGACAT GAGCGACTTT GCCGCTCGGG TCAACGACCA GGCCTCGGAG ACCGGGGTGC GTCTGGAGAA CGGCGAGAAC GGCTGGTCCT TCGTCTCCAA CAGCGACTTC GAGCTGGAGT ACATCTCGGA CGATGCCGAG GGAGCGCTCT CCGTCGGTGG CACCACGGTC GGACAGGGCC TGGATCGCAC CGACGAGGAG AGCACCGGGC TGATTGTCGA GCGGGGCATC ACCCTGTCCA CGGAGATCGG TGGTGAGCTC CGGGTCGATC CGCTGGAGGG GGACGACGAC GCCGACCTCG GGGCGATCGG GCTCAAGAAC TGGCAGCCCG GCGGCGACTA TGAGGATCTT CAGGCTGAGG CGTACACGGT GGGTGGCGTG GATCCGGTGG ATGTGCGCAC CCGGGAGACT GCGTCGGATA CCATCGTGGC GGTGGACTTC GCCCTGCAGC AGATCAACAA CACCCGGGCT GATCTGGGTG CGATCCAGAG CCGGTTCGAT GCCACGATCA ACAATCTGAA CATCTCCTCG GAGAACCTGA GCGCATCCCG CTCGCGGATC CTGGATGCCG ACTTCGCCGA GGAGACGGCC GAGATGACCA GGACGCAGAT CCTGCAGCAG GCGGGCACCT CGGTGCTGGG TCAAGCCAAC GAGATCCCGC AGCAAGTGGC ACAGCTGTTG CAGCAGTAA
|
Protein sequence | MAQVINTNVA SLNAQRHLNS SRGDQEVALE RLSSGLRINS ARDDAAGLAI SERFTGQING MDQAARNAND GISFAQTAEG AMEEMSNLLQ RVRELAVQSA NDTNSPSDRA ALDREVQAAV QEIGRIAEST QFNQQNVLNG TLRELVFQVG PNRGQTINAG GVDVRAENLG ANVAEGRAVH QTAGGEGVQL PSGLQVNGQE IDLGDARELN DVASEINERQ AETGVSAMRA DRAETQAVEF DGLAEGERAQ LRINDHAIEL DGDMEDMSDF AARVNDQASE TGVRLENGEN GWSFVSNSDF ELEYISDDAE GALSVGGTTV GQGLDRTDEE STGLIVERGI TLSTEIGGEL RVDPLEGDDD ADLGAIGLKN WQPGGDYEDL QAEAYTVGGV DPVDVRTRET ASDTIVAVDF ALQQINNTRA DLGAIQSRFD ATINNLNISS ENLSASRSRI LDADFAEETA EMTRTQILQQ AGTSVLGQAN EIPQQVAQLL QQ
|
| |