Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1530 |
Symbol | |
ID | 4710430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1659845 |
End bp | 1662466 |
Gene Length | 2622 bp |
Protein Length | 873 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639855997 |
Product | TPR repeat-containing protein |
Protein accession | YP_001003099 |
Protein GI | 121998312 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | [TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein [TIGR03504] FimV C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0304158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGGTT GTGAAGCGTG GCAAGGGTTG TCCGAAGAGG AGTATCTGGA GCGCGCGGCG GAGCAACTAG AGGCTGGGGA GTACCGTGCG GCGGTTCTTG ACTATAGGAA TGCCCTTCAA AAAGTAGAGA CGCCCGAGAC CCGCGGTCAG CTCGGGTTGG CGTACGCCGG CGATGGCGAT ACCGACGCAG CGATTAATCA CCTCACCCGC GCGCTGGAGC AGGGGGCGGA GCCCAAGCGC TACGCCCCGA CCCTCGCCCG CCTTTATCAC GAGACAAGGC AACACGGCGA GCTGGTCGAC TTCGAATACG AAGCGCTTGA GGGGGAGGCC AAGGCCCGGG TATTTGCCTA CCGCGCAGTG GCGGCCTACC ACCGGGGGGA TGACGAAGCG GGTAGCGAGT TGTTGACGGC CGCGGTGAGC GAAGCAGCGG ACCTTCCGGA GCTGGAGTTG GCAAAGGCCT TTGGTGCCCT CGTTGCCGGT GATCGGGCGC AAGCTGCGAG CCACCTGGAG GCCGCTGTGG CGCTCGATGA AGACTACTAT CCGGCATGGA CGCTTCGCGG CAGCCTGGCT GAGGTCGTTG GTGAAAGGCA GAGCGCACGG GAGGCCTTCG ATCGAGCCGT CGATCTACGA CCCGACTATG TGAGCGATCG TTTCTCGCGA GCGATGGTTC GGCTTCGCGA AGAAGACTTT GAAGGGGCCC GAGCTGACGC CGAGCAACTG GTCGAAGGGG CGCCGGGTTT TGCCGGCGGC TACTTCTTGA TGGGGCTCGT AGAACTCGAG GGTGATTCCC CCGATGAAGC CCGAACGCAG TTCGAGGAAG CGCTGGCACG CCAACGGGAA TATCGCCCGG CGCGCCCCTA TCTCGCAGGG CTGGAGCTTG AGCGTGGGAA CCTTGCCCAG GCCGAGCACC ACCTGAATCG ATACCACGCG AGTGGTCCGG GGTCTGTGAC CAGCTACACC TTGCGGGCAA GGTTGTACAT GGCGCAAGAT CAGCCCGAGG AGGCGCGGGA GGTCCTCAGT GAGGCGCTCG CGGAACGGCC GGAATGGGTT TCTGAGCTGG GCGATCGGCT AGGTGCGCTC TATCTGGATA CCGGCGATGT GGAGTCCGGT ATCCAGACCC TGAGGCGTGT TTTGGACGCG CAACCCGATG CCCTGGAGAC GCGCGAGGTG CTCGGAACGG CGCTGCTGCT GGCCGGAGAC GAAGAGAAGG GCCTTGAGGA GTTAGAAGCC GTGGCGTCCG CTGATGGGGC CCTGCGTGCA GCGGATTTGA CATTGGTGCA GGCTTATATC GAGGCTGGGC GTTTCGAGGC GGCGGCGGTC GCTGCTGAGC GGACCCAGGA TAAGGCCCCG GAGGATCCCG TAGGATATAA CCTCCGTGCC GCGGTCCTCC TGGGTCAAGG GGATCGCCCG GGGGCGCGGC GTGTGTTGCG AGAAGGGCTG GAGGCGGTTC CGGACAGTGC CGATCTGGCG ATGAATCTTA GCAGCCTCGA GCGCGCCCGC GGTGAGGCCG AGGCGGGGAT CGAAGTGCTT CGTGGGGTGC ATCAAGCAGT ACCCGATGAG CCGCGGGTGG CGATGCGGCT GGCTGAGTGG CTGATCCAGA TGGATTCCGC AGCCGAGGGG CTGTCCGTGC TTGAGCAAAC CCTGGAAGAA CGTCCGGAGG ATGCACAGTT TCTAGGCGAG GCCGCACGGA TCTACGGCAT GGCCGGTGAG GACCGAGAGG CGCGGAGGCT GCTCGAGCGC GCCACGGAGT TGGACCCGGA TGCCGCCGAC CTGCATTACC TGCTCGCGCT GGCGCGTGCG GCCGCGGGGG ATGAGCAAGG GTCGGTGGAG GCCTTGGAGT CGGCGCTTTC GGCTGATCCC AGCCACTACT CGGCGGCCCA TGCCCGGTCA CGGCAGTTGG CTCGCCAGGG GCAAAGGGAG GAGGCTGAAG AGGTTTTCGC ACCGGTGGCC GAAGCCAACC CGGATGCACC GGAGGTGCAC GCCCATCAGG GTTGGTTGGC GTATCGCGAT GGCCGGTTTG CGCGTGCGGC CGAGCATTAT GGACACGCCC TCTCAGGTCG TATGGAGCGG TCCTGGGTGC TCGAGGGCTA CACCGCCAAG CGCGCCGCAG ATGACCTGGA TGCCGGTATC GATCGTCTAA ACGACTGGCT GAGCGCCAAT CCGGCAGATG CGCAGATGCG CCATGTGCTG GGCTCTGCAC TACTGGAAGC AGGTCGCGAC GAGGATGCCA TCCAGGTTTA TACGCACCTG TTACAGCAGC GTGAGCAGGA CCCGCTGGCC TTGAATAACC TGGCGTGGCT TCTGCGCGAG GACGCGCCGG ACCAAGCGCT GGAATACGCC CGGCGCGCCT ATGACCTGGC CCCGGGGGAG GCTGCCGTGC TGGACACCCT CGGCGTGGTG CTCCTTCGCA ACGACCGGAC GTCCGATGCG GTTGAGTATC TCGATCAGGC CGTGGCCCTT GCCGGCGGCG ATCCGGACGT GCAACTCAAT CTTGCCAGGG CGCTTCGTGC TGACGGTCAG GTGAGCCAAG CCCGTGATAT CCTCGAGCAG TTGCTGGAGG CGCATGCGGA CTTTTCGGGG CGCCAGGACG CAGAGGAAAT GTTGACGAAC CTCGGCGATT GA
|
Protein sequence | MVGCEAWQGL SEEEYLERAA EQLEAGEYRA AVLDYRNALQ KVETPETRGQ LGLAYAGDGD TDAAINHLTR ALEQGAEPKR YAPTLARLYH ETRQHGELVD FEYEALEGEA KARVFAYRAV AAYHRGDDEA GSELLTAAVS EAADLPELEL AKAFGALVAG DRAQAASHLE AAVALDEDYY PAWTLRGSLA EVVGERQSAR EAFDRAVDLR PDYVSDRFSR AMVRLREEDF EGARADAEQL VEGAPGFAGG YFLMGLVELE GDSPDEARTQ FEEALARQRE YRPARPYLAG LELERGNLAQ AEHHLNRYHA SGPGSVTSYT LRARLYMAQD QPEEAREVLS EALAERPEWV SELGDRLGAL YLDTGDVESG IQTLRRVLDA QPDALETREV LGTALLLAGD EEKGLEELEA VASADGALRA ADLTLVQAYI EAGRFEAAAV AAERTQDKAP EDPVGYNLRA AVLLGQGDRP GARRVLREGL EAVPDSADLA MNLSSLERAR GEAEAGIEVL RGVHQAVPDE PRVAMRLAEW LIQMDSAAEG LSVLEQTLEE RPEDAQFLGE AARIYGMAGE DREARRLLER ATELDPDAAD LHYLLALARA AAGDEQGSVE ALESALSADP SHYSAAHARS RQLARQGQRE EAEEVFAPVA EANPDAPEVH AHQGWLAYRD GRFARAAEHY GHALSGRMER SWVLEGYTAK RAADDLDAGI DRLNDWLSAN PADAQMRHVL GSALLEAGRD EDAIQVYTHL LQQREQDPLA LNNLAWLLRE DAPDQALEYA RRAYDLAPGE AAVLDTLGVV LLRNDRTSDA VEYLDQAVAL AGGDPDVQLN LARALRADGQ VSQARDILEQ LLEAHADFSG RQDAEEMLTN LGD
|
| |