Gene Msil_2664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2664 
Symbol 
ID7091133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2912062 
End bp2915238 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content64% 
IMG OID643465978 
Productfilamentous hemeagglutinin family outer membrane protein 
Protein accessionYP_002362948 
Protein GI217978801 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCAA CTTCAGTTGG AACGTCGAAC AAGACGCGCG CCGTCGCCCC GCCCCGCCGC 
CAGCGCCTTT CGGCTCGGTT GCGCGCTTTG CACGGCCCCT TCGCCACGAT GACCGCGATC
GCGGCCTTGC TGGCGCTGCC GCGTCCCGGG CTTGCGGGAC CTTCGGGCGG CGTGGTGATC
GATGGCTCGG CGAACATCGG CCAGGCCGGC AGCGTCACCA ACATCAATCA ATCCTCCAAC
CGGGCCATTA TCAATTGGCA AGGCTTCTCC ATCGGCTACG GGGAGACCGT CAACTTCAAT
CAACCGGGCG CGTGGGGGGT CACGCTCAAC CGCGTCGTCG GCAATGAGGC GAGCATTATC
TCGGGCGCCC TCAACGCCAA TGGCCAGGTC TTCCTCGTCA ATTCCGCCGG CGTCCTCTTC
AGCAAGAGCG CGCAGGTCAA TGTCGGCGGG CTCGTCGCCT CGACGCTCGA TATTTCCAAC
GCGAATTTCA TGGCGGGGAA CTACGTCTTC TCCGGCTCGT CGGCGGCGTC CGTGGTCAAC
CAGGGCTCCA TCCATGCGAG CGATGGCGGC TATGTCGCGC TGCTCGGCAA GACCGTCTCG
AACGAGGGCG TGATCACCGC GACGCTCGGC GCGGTGGCGA TGGCCTCGGG CAACAAGATC
ACCCTCAATT TCGCCGGCGA CTCGCTTTTG GACGTCACGA TCGACGAGGG AACTCTCAAT
GCGCTGGTCG AGAACAAGCG GGCGATCCGG GCGGACGGCG GCCGGGTCAT TCTGACGGCG
AAGGCGGCCG ACGCCGTCTT GTCGGCGCAG GTCAACAATA CGGGGATCAT CCAGGCCCGC
ACCATGGCGG CGCTGAAGGG CGGCGCAACC ACACGTGGAA CGGCGCGGAT CGGCTCGATC
AAGCTGATCG CCTCCGGCGG CACGGTGAGG GTGGGCGGCA CGCTCGACGC CTCGGCGCCC
AGGGGCGGCA AGGGCGGCAA GATCGAAACC AGCGGCGAGA AGGTCAAAAT TGCCGATAGC
GCCTTCGTCA CGACCAAAGC GTTGAGCACG GCGAACGGAA GCTGGCTGAT CGACCCTGAC
GGCTTCTCCA TTGCGGCGAC GGGCGGCGAC ATCACCGGCG CGCGGCTCGG CGCCTTGCTC
GACGAAAACA ACATCACCTT GCAATCGACG AGCGGCAAAG GCAAAAGCGG CAATATCAGC
GTCAACGACG CCGTGAGCTG GGCGGCCAAC ACGCTGCTGA CGCTGGACGC CACCAAAACC
ATCCATATCA ACGCGCCCAT TACCGCCACC GGCGACGGCG CCGGGCTTGT GCTGAATTAT
GGCGGCTACG CGACGACAGG GAGCGCCGCG GCCGGGACCG ATTATCGAGT CAACAGGAAC
GGCGGCGCTT CGATCACGCT CAGCGGCGCC AACGCCAGCC TGAGCATCAA CGGAAACGCC
TACACGCTGA TCCACAGCAT GGCGGATCTG ACGGCGATCA CGCCCCTTCT CTTCGACGCC
AACGGCAATC CCGTGTACGA CCCGGACACA CTGCTCCAGG CCTATGGGCC CGGGGGGACG
GGCTATTATG CGCTGGCGCA AAACCTCGAC GCCGCGGGCG TCACCTATAA CGGCCCCTTG
ATCTCCAACC TCTCCGGCGT CTTCGCAGGG CTCGGCCATA CGATCAAAAA CCTCAAGATC
GACTCCACGG CGCCGAACGC GAACGGTCAA TACGCCGACG CCGTTTTAAT CGACCAGATC
GGGGAAGGCC GCAACTCCCC CGCCGTGGTC CGCGACATCA AGCTGGCGAA TGTCAATATC
GCGGGCTTCC AGGAGGCAGC CGGCCTGGCC GGCAATAACC TCGGTACGAT CAGCAACGCC
TATGTGTCGG GAAAAGTGCG GGTGACGGAC GGCGGCTCCG CCGCCGGGCT GGCCGCAAAC
AATGGCGGAC TGATCACGAA CGCCCACACC AATGTCGCCG TGACGGCCAC ACATGGCGGA
TCCAACGTTG GCGGGCTGGT CGGATTCAAT TCCAGCAGGG GCGTCATTCG CTATTCTTCC
GCCGATGGGT CCGTGCGGGC CGCCGGGTAC AGCACCTCCC CGGACAGCGG CCTTCTTTTC
AGCAGCGGCA TCGGCGGCCT TGTCGGAACC AATATCGGAA CCATCGCCTA TTCGAACGCC
AATGTGACGG TGACGACAAA AGACAGCGTC AACGTCGGCG GTCTCGTCGG CGTCAATTAC
AACTTCAATA CGCCGGGGGA CGGCACGGCA GGCGTCATCA TCAACTCCTC CGCAACAGGC
AATGTGACGG CGAACTACAC CAGCAGCCAG TTGCTCGGTC AGCCTGGATT TGGCGTCGGC
GGATTGGTCG GGAGCAACAG CGGAGGCACG ATCACGGGCG GCTTCGCCAG CGGGGACGTC
AAAGTGCACG CGACGGCCGC TTCCGGCGCC TATGATATCG GCGGACTGGT CGGATATAAT
GAATTCGGCA CGATCTCCCA TTCATCCGCG ACCGGCGATG TTTCGGGCAT AGGTAAAAAT
GTTTCCGAAA TCGGCGCCCT GGTCGGAATG AATCTGGACG ACATCGCAAA TGGCTTTGGC
GGCATCGATC ATTCCACCGC CAGTGGAACC GTGACAGGAA ACGACGCTGG CGGACTGTTC
GGAGCTGGAA ACCTGCAGGC GGTATCGGAC TCCGTCTTCA CCGGATCTGT GAATGGCGTC
GGACCGGCCG CGGACGCCGC CGCGGCGGCG CAAGCCAGAG CGCAGACCGC TGCGGAGCAG
GCCACGGCGG ACGAAGACGC TCGGCAGATC GCAGCGATCG CTCAAGCGGC GGCGAGCTCA
GCAACGGTCG TCGCGACGAC GGACGCGGAA GAGTCAGCGA CGCCTCCGAA CCCGGTGAAG
GCGACCGCGG CGGGCAAGCG CGCGACGGCG GCGATCGCGG GACCGAAGAC CGAGGATAAT
GTCAAGGTCG AGCAGCCCGC GCCGCGCGTC GCATCGACCG AGGAGACGTC CGCCGCGTCA
AGCGAACCGG CGCCCAGCCA CCGCAAGGCG GAGACCAGGA CAGCGCAGAA ATCCGCTGTC
AAAGGCAAGG GCGCGGGGTT CGGCGCCGCG ATCCGCAGCA TCGACATCGA TGGTCAGCAT
TACGATCTAC AGGACGACGC CTCGAAGAAA AACGCGCCGG GCCGGAAAGT CCAATAG
 
Protein sequence
MPSTSVGTSN KTRAVAPPRR QRLSARLRAL HGPFATMTAI AALLALPRPG LAGPSGGVVI 
DGSANIGQAG SVTNINQSSN RAIINWQGFS IGYGETVNFN QPGAWGVTLN RVVGNEASII
SGALNANGQV FLVNSAGVLF SKSAQVNVGG LVASTLDISN ANFMAGNYVF SGSSAASVVN
QGSIHASDGG YVALLGKTVS NEGVITATLG AVAMASGNKI TLNFAGDSLL DVTIDEGTLN
ALVENKRAIR ADGGRVILTA KAADAVLSAQ VNNTGIIQAR TMAALKGGAT TRGTARIGSI
KLIASGGTVR VGGTLDASAP RGGKGGKIET SGEKVKIADS AFVTTKALST ANGSWLIDPD
GFSIAATGGD ITGARLGALL DENNITLQST SGKGKSGNIS VNDAVSWAAN TLLTLDATKT
IHINAPITAT GDGAGLVLNY GGYATTGSAA AGTDYRVNRN GGASITLSGA NASLSINGNA
YTLIHSMADL TAITPLLFDA NGNPVYDPDT LLQAYGPGGT GYYALAQNLD AAGVTYNGPL
ISNLSGVFAG LGHTIKNLKI DSTAPNANGQ YADAVLIDQI GEGRNSPAVV RDIKLANVNI
AGFQEAAGLA GNNLGTISNA YVSGKVRVTD GGSAAGLAAN NGGLITNAHT NVAVTATHGG
SNVGGLVGFN SSRGVIRYSS ADGSVRAAGY STSPDSGLLF SSGIGGLVGT NIGTIAYSNA
NVTVTTKDSV NVGGLVGVNY NFNTPGDGTA GVIINSSATG NVTANYTSSQ LLGQPGFGVG
GLVGSNSGGT ITGGFASGDV KVHATAASGA YDIGGLVGYN EFGTISHSSA TGDVSGIGKN
VSEIGALVGM NLDDIANGFG GIDHSTASGT VTGNDAGGLF GAGNLQAVSD SVFTGSVNGV
GPAADAAAAA QARAQTAAEQ ATADEDARQI AAIAQAAASS ATVVATTDAE ESATPPNPVK
ATAAGKRATA AIAGPKTEDN VKVEQPAPRV ASTEETSAAS SEPAPSHRKA ETRTAQKSAV
KGKGAGFGAA IRSIDIDGQH YDLQDDASKK NAPGRKVQ