Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0507 |
Symbol | |
ID | 4709960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 576316 |
End bp | 577725 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639854965 |
Product | flagellin domain-containing protein |
Protein accession | YP_001002096 |
Protein GI | 121997309 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACAGG TCATCAACAC CAACATTGCG TCCCTGACCG GTCAGCGGCA CCTGAGCAGC AGCCAGGCCG AGCAGCAGCA GGCCCTGGAG CGGCTCTCCT CGGGGCAGCG GATCAACTCC GCGGCCGACG ACGCCGCCGG CCTGGCGATC AGCGAGCGCT TCACCTCGCA GATCGGTGGC ATGAACCAGG CGGAGCGCAA CGCCAACGAC GGCATCTCCT ACGCCCAGAC CGCCGAGGGG GCCATGGAGG AGATGGGCAA CCTCCTGCAA CGGGTCCGTG AGCTGGCGGT GCAGTCGGCC AACGACACCA ACACGGCCGA AGACCGTCAG GCCCTGGAGG CCGAGGTGCA GCAGGCGGTG CAGGAGATCG ACCGGATCGC CTCCAGCACC CAGTTCAACA ACCAGAACAT CCTGGACGGC TCGCTGGATG AGCTGGTCTT CCAGGTGGGC GCCAACCGTG CGCAGAGCAT CAACACCGGC GGTGTCGATG TGCGCGGCCA CAACCTGGGT GCCGAGATCG GTGAGGGGCA GGCCGTGCAG CGGGCCCTGG ACGAGAACGG TGACTACGGC GATCTCGACC TGGACGGCTC GATCAACATC AACGGGCTGG ATGTTGATGT CAGCGGCTCG CGGAGCGTCT CCGACGCCAT GGACGCCATC AACGCCCAGT CCCGTGCCAC GGGCGTGACG GCCTTCCGGG CTGACCGCGC TACCACCGAG GCGTTCGACT TCAACAACGA CGGCGGCTCC AGCCTGGAGA TCAACGGCAC CACCGTCAGT GTGGGCGAGG ACGCCGGGGT GGGTGAGTTC GTCGACGAGG TGAACGCGGC CTCGGGCAAC ACGGGTGTGC GGGCCGAGAT GGTCGGCGAT GACCAGGTGC GCTTCGTCTC CGAGTCCGAC TTCCGCATCG AGCCGGGTGA CAACAGCCCG ATCGGTGATC TGGGTCTCGA GGCTGAAGAA TCGGGGATGC GCTTCGAGCG GGGTGTCCAG CTCTCCACCG ATCTGGGGCA GCGCCTGGAT GTCAATGGGG ATGCGGACAC ACTGGCGGCT CTGGGCATGA GCGACGAGCA GATGGACATG AGCCGTCACC GGGTCAGCGG GCCGGATGCG CTGAGCGTGG CCACCCGCAC CGATGCCGAT GACGCCATCC GCACGGTGGA CTTCGCCCTG GGGCAGATCA ACGACGCCCG GGCCGACCTG GGTGCGGTGC AGAACCGCTT CGAGGCCACC ACCAGCAACC TGCAGAACGT CTCCGAGAAC ATGGAAGCCT CCCGTTCCCG GATTCTGGAT GCGGACTTCG CCGCCGAGAC CGCCGCCATG ACCCGCGCCC AGGTGCTCCA GCAGGCCGGC ACCTCGGTCC TGGCCCAGGC CAACGAGGCA CCGCAGAACG TCCTGACCCT GCTGCAGTAA
|
Protein sequence | MAQVINTNIA SLTGQRHLSS SQAEQQQALE RLSSGQRINS AADDAAGLAI SERFTSQIGG MNQAERNAND GISYAQTAEG AMEEMGNLLQ RVRELAVQSA NDTNTAEDRQ ALEAEVQQAV QEIDRIASST QFNNQNILDG SLDELVFQVG ANRAQSINTG GVDVRGHNLG AEIGEGQAVQ RALDENGDYG DLDLDGSINI NGLDVDVSGS RSVSDAMDAI NAQSRATGVT AFRADRATTE AFDFNNDGGS SLEINGTTVS VGEDAGVGEF VDEVNAASGN TGVRAEMVGD DQVRFVSESD FRIEPGDNSP IGDLGLEAEE SGMRFERGVQ LSTDLGQRLD VNGDADTLAA LGMSDEQMDM SRHRVSGPDA LSVATRTDAD DAIRTVDFAL GQINDARADL GAVQNRFEAT TSNLQNVSEN MEASRSRILD ADFAAETAAM TRAQVLQQAG TSVLAQANEA PQNVLTLLQ
|
| |