Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2664 |
Symbol | |
ID | 7091133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 2912062 |
End bp | 2915238 |
Gene Length | 3177 bp |
Protein Length | 1058 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643465978 |
Product | filamentous hemeagglutinin family outer membrane protein |
Protein accession | YP_002362948 |
Protein GI | 217978801 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTCAA CTTCAGTTGG AACGTCGAAC AAGACGCGCG CCGTCGCCCC GCCCCGCCGC CAGCGCCTTT CGGCTCGGTT GCGCGCTTTG CACGGCCCCT TCGCCACGAT GACCGCGATC GCGGCCTTGC TGGCGCTGCC GCGTCCCGGG CTTGCGGGAC CTTCGGGCGG CGTGGTGATC GATGGCTCGG CGAACATCGG CCAGGCCGGC AGCGTCACCA ACATCAATCA ATCCTCCAAC CGGGCCATTA TCAATTGGCA AGGCTTCTCC ATCGGCTACG GGGAGACCGT CAACTTCAAT CAACCGGGCG CGTGGGGGGT CACGCTCAAC CGCGTCGTCG GCAATGAGGC GAGCATTATC TCGGGCGCCC TCAACGCCAA TGGCCAGGTC TTCCTCGTCA ATTCCGCCGG CGTCCTCTTC AGCAAGAGCG CGCAGGTCAA TGTCGGCGGG CTCGTCGCCT CGACGCTCGA TATTTCCAAC GCGAATTTCA TGGCGGGGAA CTACGTCTTC TCCGGCTCGT CGGCGGCGTC CGTGGTCAAC CAGGGCTCCA TCCATGCGAG CGATGGCGGC TATGTCGCGC TGCTCGGCAA GACCGTCTCG AACGAGGGCG TGATCACCGC GACGCTCGGC GCGGTGGCGA TGGCCTCGGG CAACAAGATC ACCCTCAATT TCGCCGGCGA CTCGCTTTTG GACGTCACGA TCGACGAGGG AACTCTCAAT GCGCTGGTCG AGAACAAGCG GGCGATCCGG GCGGACGGCG GCCGGGTCAT TCTGACGGCG AAGGCGGCCG ACGCCGTCTT GTCGGCGCAG GTCAACAATA CGGGGATCAT CCAGGCCCGC ACCATGGCGG CGCTGAAGGG CGGCGCAACC ACACGTGGAA CGGCGCGGAT CGGCTCGATC AAGCTGATCG CCTCCGGCGG CACGGTGAGG GTGGGCGGCA CGCTCGACGC CTCGGCGCCC AGGGGCGGCA AGGGCGGCAA GATCGAAACC AGCGGCGAGA AGGTCAAAAT TGCCGATAGC GCCTTCGTCA CGACCAAAGC GTTGAGCACG GCGAACGGAA GCTGGCTGAT CGACCCTGAC GGCTTCTCCA TTGCGGCGAC GGGCGGCGAC ATCACCGGCG CGCGGCTCGG CGCCTTGCTC GACGAAAACA ACATCACCTT GCAATCGACG AGCGGCAAAG GCAAAAGCGG CAATATCAGC GTCAACGACG CCGTGAGCTG GGCGGCCAAC ACGCTGCTGA CGCTGGACGC CACCAAAACC ATCCATATCA ACGCGCCCAT TACCGCCACC GGCGACGGCG CCGGGCTTGT GCTGAATTAT GGCGGCTACG CGACGACAGG GAGCGCCGCG GCCGGGACCG ATTATCGAGT CAACAGGAAC GGCGGCGCTT CGATCACGCT CAGCGGCGCC AACGCCAGCC TGAGCATCAA CGGAAACGCC TACACGCTGA TCCACAGCAT GGCGGATCTG ACGGCGATCA CGCCCCTTCT CTTCGACGCC AACGGCAATC CCGTGTACGA CCCGGACACA CTGCTCCAGG CCTATGGGCC CGGGGGGACG GGCTATTATG CGCTGGCGCA AAACCTCGAC GCCGCGGGCG TCACCTATAA CGGCCCCTTG ATCTCCAACC TCTCCGGCGT CTTCGCAGGG CTCGGCCATA CGATCAAAAA CCTCAAGATC GACTCCACGG CGCCGAACGC GAACGGTCAA TACGCCGACG CCGTTTTAAT CGACCAGATC GGGGAAGGCC GCAACTCCCC CGCCGTGGTC CGCGACATCA AGCTGGCGAA TGTCAATATC GCGGGCTTCC AGGAGGCAGC CGGCCTGGCC GGCAATAACC TCGGTACGAT CAGCAACGCC TATGTGTCGG GAAAAGTGCG GGTGACGGAC GGCGGCTCCG CCGCCGGGCT GGCCGCAAAC AATGGCGGAC TGATCACGAA CGCCCACACC AATGTCGCCG TGACGGCCAC ACATGGCGGA TCCAACGTTG GCGGGCTGGT CGGATTCAAT TCCAGCAGGG GCGTCATTCG CTATTCTTCC GCCGATGGGT CCGTGCGGGC CGCCGGGTAC AGCACCTCCC CGGACAGCGG CCTTCTTTTC AGCAGCGGCA TCGGCGGCCT TGTCGGAACC AATATCGGAA CCATCGCCTA TTCGAACGCC AATGTGACGG TGACGACAAA AGACAGCGTC AACGTCGGCG GTCTCGTCGG CGTCAATTAC AACTTCAATA CGCCGGGGGA CGGCACGGCA GGCGTCATCA TCAACTCCTC CGCAACAGGC AATGTGACGG CGAACTACAC CAGCAGCCAG TTGCTCGGTC AGCCTGGATT TGGCGTCGGC GGATTGGTCG GGAGCAACAG CGGAGGCACG ATCACGGGCG GCTTCGCCAG CGGGGACGTC AAAGTGCACG CGACGGCCGC TTCCGGCGCC TATGATATCG GCGGACTGGT CGGATATAAT GAATTCGGCA CGATCTCCCA TTCATCCGCG ACCGGCGATG TTTCGGGCAT AGGTAAAAAT GTTTCCGAAA TCGGCGCCCT GGTCGGAATG AATCTGGACG ACATCGCAAA TGGCTTTGGC GGCATCGATC ATTCCACCGC CAGTGGAACC GTGACAGGAA ACGACGCTGG CGGACTGTTC GGAGCTGGAA ACCTGCAGGC GGTATCGGAC TCCGTCTTCA CCGGATCTGT GAATGGCGTC GGACCGGCCG CGGACGCCGC CGCGGCGGCG CAAGCCAGAG CGCAGACCGC TGCGGAGCAG GCCACGGCGG ACGAAGACGC TCGGCAGATC GCAGCGATCG CTCAAGCGGC GGCGAGCTCA GCAACGGTCG TCGCGACGAC GGACGCGGAA GAGTCAGCGA CGCCTCCGAA CCCGGTGAAG GCGACCGCGG CGGGCAAGCG CGCGACGGCG GCGATCGCGG GACCGAAGAC CGAGGATAAT GTCAAGGTCG AGCAGCCCGC GCCGCGCGTC GCATCGACCG AGGAGACGTC CGCCGCGTCA AGCGAACCGG CGCCCAGCCA CCGCAAGGCG GAGACCAGGA CAGCGCAGAA ATCCGCTGTC AAAGGCAAGG GCGCGGGGTT CGGCGCCGCG ATCCGCAGCA TCGACATCGA TGGTCAGCAT TACGATCTAC AGGACGACGC CTCGAAGAAA AACGCGCCGG GCCGGAAAGT CCAATAG
|
Protein sequence | MPSTSVGTSN KTRAVAPPRR QRLSARLRAL HGPFATMTAI AALLALPRPG LAGPSGGVVI DGSANIGQAG SVTNINQSSN RAIINWQGFS IGYGETVNFN QPGAWGVTLN RVVGNEASII SGALNANGQV FLVNSAGVLF SKSAQVNVGG LVASTLDISN ANFMAGNYVF SGSSAASVVN QGSIHASDGG YVALLGKTVS NEGVITATLG AVAMASGNKI TLNFAGDSLL DVTIDEGTLN ALVENKRAIR ADGGRVILTA KAADAVLSAQ VNNTGIIQAR TMAALKGGAT TRGTARIGSI KLIASGGTVR VGGTLDASAP RGGKGGKIET SGEKVKIADS AFVTTKALST ANGSWLIDPD GFSIAATGGD ITGARLGALL DENNITLQST SGKGKSGNIS VNDAVSWAAN TLLTLDATKT IHINAPITAT GDGAGLVLNY GGYATTGSAA AGTDYRVNRN GGASITLSGA NASLSINGNA YTLIHSMADL TAITPLLFDA NGNPVYDPDT LLQAYGPGGT GYYALAQNLD AAGVTYNGPL ISNLSGVFAG LGHTIKNLKI DSTAPNANGQ YADAVLIDQI GEGRNSPAVV RDIKLANVNI AGFQEAAGLA GNNLGTISNA YVSGKVRVTD GGSAAGLAAN NGGLITNAHT NVAVTATHGG SNVGGLVGFN SSRGVIRYSS ADGSVRAAGY STSPDSGLLF SSGIGGLVGT NIGTIAYSNA NVTVTTKDSV NVGGLVGVNY NFNTPGDGTA GVIINSSATG NVTANYTSSQ LLGQPGFGVG GLVGSNSGGT ITGGFASGDV KVHATAASGA YDIGGLVGYN EFGTISHSSA TGDVSGIGKN VSEIGALVGM NLDDIANGFG GIDHSTASGT VTGNDAGGLF GAGNLQAVSD SVFTGSVNGV GPAADAAAAA QARAQTAAEQ ATADEDARQI AAIAQAAASS ATVVATTDAE ESATPPNPVK ATAAGKRATA AIAGPKTEDN VKVEQPAPRV ASTEETSAAS SEPAPSHRKA ETRTAQKSAV KGKGAGFGAA IRSIDIDGQH YDLQDDASKK NAPGRKVQ
|
| |