Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4693 |
Symbol | |
ID | 5736540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5995222 |
End bp | 5996808 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641281857 |
Product | leucine-rich repeat-containing protein |
Protein accession | YP_001547452 |
Protein GI | 159901205 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAGC AACTTTGCCG GGCGCGGATT GCCCAAAATG CCCAAACCCG TGAACCAACC CTCGATCTTT CATCACTCAA CCTCACCAAC CTACCAGAAA CGATTGGCGA ATTAACTCAT CTTGAAGCAT TAAATCTTGC CTGCAATCGT CCGCTGCAAC TACCGCCTGA ACTTGCAAAC CTAACCAAGT TGCGCAAATT GGATCTCAGC TTTCCCCACC AATCTATAAT TCCAGCATGG CTCGATCAAT TAACCAGCCT CGAAGAATTA GATATTCGGG CAAACCCGAC CACTGGCATT CCCGAGGTTT TAACGCGATT ACCACGTTTG CAGAAACTTA ATCTTTATCT TGATGGATTT GAAGCATTGC CGAGCGAATT GCTTAATCTA TCAACGCTAC ACAATATTAC GATTGGTTCA ACAAAACTAA CCAGATTACC TGATTGGTTC AGTGATTTGC GCATCACAGC TTTAGAATAC TATCTCAATT CTATTCCTAA CGAGCACTTA ATTGGGGCGA TAGACTCGCT TCAAGTGCTG GATCTACAGC TTTACTATGG GAAACCTACA GCTTTTCCCG CATGGCTGAG GCAGATGCAT CATTTACGCT GGCTTCGTTA TAACAGCCAA GCACTGGTGC CACCTTGGTT AATCGAATTA CCGCAACTCT CGTATTTGGA AAGCAACGCC GATTTTAGCG AAATCAGCCA AGTTTGGCAT TCATGGGAAT CACTTGAACA GCTCAAGTGT GGTTATGTTG ACGCGACGAC CTTGCCACCA AACCTCAAAA CCTTAGAAAT ACCCATCAGC GGATCAGCAA TTCCTGAATC AATTCGCCAA GTGCGCCAGC TTGAAGTGCT TCATTTATCG GGTAAAGGGT TTCGCGAGTT GCCTGGCTGG GTTTTAGCAT TGCCCAACTT GCACACCTTG GATCTTATCA GCACAGAGAT TGATTACATC CTAGCGCCTG ACCAGCCGAA CAATAGCCTA CGAAAACTTA TGATGCATAC CCTCTACTGT GGTCGCAATC ACCGCTTGGA TGGTCTACGC AGCCTGCATA GGCTGGAAGA ATTAAGCCTG AGTAATCATC GTCTCGGCCA GCTACCCGCA TGGCTCTCCG AATTGCAGCA TTTACGCGAG TTGTCGATTG ATGATTGCGA GTTGACCGAT CTTGATCCAA GTTTGGGACA ACTTCATCAG CTTGAAGCAC TCTATCTTCA TGGCAATGCT ATTCCGGTCG CGAGCTTAGA GTTAATGTTC CCCCGATTAA CCAAACTCCA GCATTTATCA TTTGGGGTCG CAAACGATGA GTCATTTCCC GCTAGCCTTC GCCAATTGCA TCAACTGCGT AGCTTGTATC TGAGAATCGG GCCAGAGCAC AGCATCCCTG AATGGTTGAA TCAATTGACC AAACTCGAAA GCATTATGCT TGGCTATAAC ATTCAGCCAA CACAGATCCC TTGGATCGAA GGCTGGCTGG CACTGCCGAA ATTACGCGAG ATCGATATTC ATATCAAGCC AGAATTGTTT GATCCTGAGT TACTACAACG CTTTACCCAA CGCGGCGTAA AAGTCAATCT TGGCTAA
|
Protein sequence | MSEQLCRARI AQNAQTREPT LDLSSLNLTN LPETIGELTH LEALNLACNR PLQLPPELAN LTKLRKLDLS FPHQSIIPAW LDQLTSLEEL DIRANPTTGI PEVLTRLPRL QKLNLYLDGF EALPSELLNL STLHNITIGS TKLTRLPDWF SDLRITALEY YLNSIPNEHL IGAIDSLQVL DLQLYYGKPT AFPAWLRQMH HLRWLRYNSQ ALVPPWLIEL PQLSYLESNA DFSEISQVWH SWESLEQLKC GYVDATTLPP NLKTLEIPIS GSAIPESIRQ VRQLEVLHLS GKGFRELPGW VLALPNLHTL DLISTEIDYI LAPDQPNNSL RKLMMHTLYC GRNHRLDGLR SLHRLEELSL SNHRLGQLPA WLSELQHLRE LSIDDCELTD LDPSLGQLHQ LEALYLHGNA IPVASLELMF PRLTKLQHLS FGVANDESFP ASLRQLHQLR SLYLRIGPEH SIPEWLNQLT KLESIMLGYN IQPTQIPWIE GWLALPKLRE IDIHIKPELF DPELLQRFTQ RGVKVNLG
|
| |