Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lcho_4093 |
Symbol | |
ID | 6159941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Leptothrix cholodnii SP-6 |
Kingdom | Bacteria |
Replicon accession | NC_010524 |
Strand | - |
Start bp | 4588782 |
End bp | 4591622 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641666871 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_001793110 |
Protein GI | 171060761 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0000000114159 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTGCTGGC CTGTCATCCC GAATTCCCGC ACCGGCCCCC CACCGGCACG CGTCGCCGCG CTCGCGCTGG CCGTGCTGGC GCTGCTCGGC GCACCCGCCG TCCGCGCCCA GACCAACCTG CCGGTGCCGG CGACCACCGC GCCGCAGAAC GCCACCATCA GCACGCCCGC CGTCGGTCAG ATGGTGATCG TGCAGAACGA GAGCGCGCCA CGCGCCTACA TCGAGTGGCG CGATTTCTCG ATCGGCGCGC AGGCCGGCGT GACGGTGCAG CAGCTCAACG GCAGCTCGGT GCTGCTCAAC CGCGTGGTGG GCGTGGGCGG CGCGCCGCCG ATCAGCCGCA TCGACGGCAC GCTCGGCGCC ACCGGCCAGG TCTTCATCCT CAACCCGGCC GGCATCGTGT TCGGCGCCGG CGCGCAGGTG CAGGTCGGCA GCCTGCTGGC GGCGGCGCTG GACACCTCGA CCGGCGGCGC CGACTTCCTG TCCGGCGCGC CGCTGGTGCT CGATGCCTCG CTCGGCGGCG GCGCGCTGAG CGTCGCGCCG GGCGCCCGCA TCGCTGCGGC GGCGGGCAAC CCCAACCTGC CCGGGGGCGA GATCGTGCTG ATCGGCGCCG GTGGCGCAGC CGGGGCGGGC GGCCTGACGA TGGACGGCCG CGTCGTCGCC TCGGGCGGCC AGGTGCTGCT GGCGGTGAGC GACGGCACCA GCGTGCCGGC CGCCGGCATC CCGGTCGGCG CCAGCGGCTT CATCACGCTG CAGCTGACCC CGGCCGCGGC CGGCACGCTG GCGATGCACG GCAGCGTCGA CGTGGCCGGC GCCGTCAGCA CGCCCAGCGG CAACCTGCGC ATCGAGGCGC CGACGGTGCA GATCGGCCGC CGCGCGGGCC TGGCCGCCAC CTCGGTGTCC TTGAGTGGCG CCAGCGTCAC CGCGCTCGGC AGCGCCGGCC CGGGCAGCCT CTCGATCAGC GGCAGCGCCG ACGGCGGCTT CGGCATCGAC CTGAACAACG TCACGCTCAG CGGCGACCGC ATCGAACTGC GTGGCACCTC GCTGCTGACC GATGGCAATG CCGACCTGCT GCCGATCGGC GTGCGCCTGA ACGGGGTGTC GATCGACATC GGCGACGGCA GCCTGCTGAT CGCCGGGCGC GGCGAGGTGG CGTCGACCTT GTCGCCCGAC ACGTCCCCGG CGTTCGGCGT GGGCCTGTCG GATCTGTTCG TCACCAGCAA CGCCAACGCC GGGCGCTCGG TCACGATCGT CGGCGAGGCG GTCAACTCAT CGACCGGCGC GGGCATCGGC GCGGTCAACG ACGGCTTCTT CATCATCGCC AGCAACAGCG AGGTCGACCC GTCGGCCGCG AACGTGGTGC TGGCCGGGTA TGCCGGCGCG CAGGGTCGCG CCTACGACCT GTTCGGCGCG CCCGACGTGT TCACCACCGG GCGCGTCAAC CTGCGCCCCG CCGGTGTCGA CGCCGACGGC ATCGTGCAGG AACGCCCGGC GGTGCCGATC ACGCTCGGCG GTGCCGTCTC GAGCGTGCCG CTCGGCTTCA ACCTGCCGTC GGCCTGGCTG CTCGATCCGC GCATCAACCC GGGTGGCAAC ATCGAGAGCG CCGGCATCGT GGTCGGCTCG AGCGGCCATG TCGGCGCGAT CGCGGTGGCG GCCGACGCCC TGCAGGCCGG CATCACGCCC GAGCTGACGC TGCACAACGG CGGCGCCGGC GCGCAGGGCA TCCAGCTCGA CGGCGGCCTG ACGACGACCG GCGCGGTGCG CCAGCTCACG CTGGTCAGCG CCGGTGCGGT GACGCAGACC GGCCCGGTCG CGGTGCAGCA GCTGCTGCTC GCCGGCACGG GGGCGGATGC CACGGTGCAG CTGCTCGACC CCGGCAACAC GATCGACCAG ATCGGCTTCA CCGGCCTGCG CAGCGTGCAG GTGGCGAGCG CCGGGCCGCT GTCGGTGGCG GGTGGCAGCG TGGCGGCCTA CGACAGCGCG AGCGGCAGCT TCACCCCGCA GGCCTTCACC ACCAGCCTGG CCAGCGATCG CGTGCTGCTG CGTGCCGACG ACGGCGACCT GACGCTGCAG CAGGGCATTC GCGCCAGCGC GGCCGGTGGC CAGATCGACC TGGTGGCCGG CGTGCGGTTC CAGAACCCGG CCAACGCCAC GCTGGAGGTC GGCGCGGGCG GGCGCTGGCG GGTCTGGTCC GACAGCTGGG TCGACAGCCA GCGCGGCGAG CTGCCCGGTC GCGCCAGCCT GCCCACGCTC TACGGTTGCA GCTACGGCGA CGCCAGCCTG TGTTCGGTGT CGCAGATCGC CTTGCCCGAG GCCGGCAGCG GCTTTCTCTA CAGCGCGCAG CCGGACCTCG TGATCGTGCC CGACCCGGTC AGCGCGGCCC AGGGCACCTT CCTGCCGCCG ATCCCCTACA GCGCGACCGG CCTGGTCAAC GGCGACGTGC AGCCGCAGGC CGTCACCGGC CAGCTGGCCA GCAGCGCCGG CCTGCTCAGC CCGCCGGGGC TCTATCCGGC GACGGCGGGC ACGCTGCAGT CGCCCACCGG CTACCGGCTC AGCCTGGCGT CGGGCACGGT GCTGCCGGTG CGCATCACGC CGCCCGACCC GCTGCGCAGC GCCTATCCCG ACGGCATGGC GCAGCTGCTG GCGGCGTCGG CCTCCGAGAC CCACGGCCGC AACCTCGCCA CGCCACGCAT GTGCCTGGCC AGCGGCCCGG CCCGCACGGC GGCGATCGAC GACGCCACGG CCGATCTGCT CGGCCTGGAA TGGGGCCGCG TGCGCCAGCA GCCGCAGTTC TCGAGCTGCC TTGAGCTCGA TCGTGGTGGC GGCGCCTGTG CCGGTTTCTA G
|
Protein sequence | MCWPVIPNSR TGPPPARVAA LALAVLALLG APAVRAQTNL PVPATTAPQN ATISTPAVGQ MVIVQNESAP RAYIEWRDFS IGAQAGVTVQ QLNGSSVLLN RVVGVGGAPP ISRIDGTLGA TGQVFILNPA GIVFGAGAQV QVGSLLAAAL DTSTGGADFL SGAPLVLDAS LGGGALSVAP GARIAAAAGN PNLPGGEIVL IGAGGAAGAG GLTMDGRVVA SGGQVLLAVS DGTSVPAAGI PVGASGFITL QLTPAAAGTL AMHGSVDVAG AVSTPSGNLR IEAPTVQIGR RAGLAATSVS LSGASVTALG SAGPGSLSIS GSADGGFGID LNNVTLSGDR IELRGTSLLT DGNADLLPIG VRLNGVSIDI GDGSLLIAGR GEVASTLSPD TSPAFGVGLS DLFVTSNANA GRSVTIVGEA VNSSTGAGIG AVNDGFFIIA SNSEVDPSAA NVVLAGYAGA QGRAYDLFGA PDVFTTGRVN LRPAGVDADG IVQERPAVPI TLGGAVSSVP LGFNLPSAWL LDPRINPGGN IESAGIVVGS SGHVGAIAVA ADALQAGITP ELTLHNGGAG AQGIQLDGGL TTTGAVRQLT LVSAGAVTQT GPVAVQQLLL AGTGADATVQ LLDPGNTIDQ IGFTGLRSVQ VASAGPLSVA GGSVAAYDSA SGSFTPQAFT TSLASDRVLL RADDGDLTLQ QGIRASAAGG QIDLVAGVRF QNPANATLEV GAGGRWRVWS DSWVDSQRGE LPGRASLPTL YGCSYGDASL CSVSQIALPE AGSGFLYSAQ PDLVIVPDPV SAAQGTFLPP IPYSATGLVN GDVQPQAVTG QLASSAGLLS PPGLYPATAG TLQSPTGYRL SLASGTVLPV RITPPDPLRS AYPDGMAQLL AASASETHGR NLATPRMCLA SGPARTAAID DATADLLGLE WGRVRQQPQF SSCLELDRGG GACAGF
|
| |