Gene Msil_2322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2322 
Symbol 
ID7090306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2517000 
End bp2519807 
Gene Length2808 bp 
Protein Length935 aa 
Translation table11 
GC content62% 
IMG OID643465645 
ProductTPR repeat-containing protein 
Protein accessionYP_002362615 
Protein GI217978468 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0205826 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGC CAAAATATGA AAGCGCGGCT GTTTCGGGTC AAGGCCATAG TCGCGACAAA 
GCGCTGCTGC AGGTCGCGAT TACGCGGGCG TTCGCCTTTC TGAATTCCGG CCAACCGGAC
GAGGCGCTGG CGGAGCTCGG CGGACACGCC CAGCGGGCGG CGCGGAGCGA TCTCGCCTGT
TATGTCTTCG GGCTGATTTG CTTCAACGCC GGGGATCTGC GCGAGGCGCT GATCTGGTTC
GAGCGCGCGC TCGCCTTGAA GCCCGATTAT TTCGAGGTGC TCAGCGCGCG CGCCATCGTG
CTGCAAAGAC TCGGCCAGCC CGAGGATGCG CTCGAGGCTT TTGAGGACAT CCTGAAACTG
CGCCCGAACG ACGCCGACGC GCTGTTCAGC ATCGGCGTCA TTTTACAGAG TCTTGGCCGC
ATGAACGAAG CGCTCGTCTC CTATGAAGGC GCTTTGCGGG CGCAGCCCAA GCATTGCGAG
GCGTTGACCA ATCGCGGCGC TCTGCTCGAA CGATTTGGCC GTCTTACTGA GGCGCTGTCC
TGTTTCGAAG CGATCATCGC GCTGCGCCCC AACAATGGCG GAGCCCTCTT CAACAAGGGC
TCGGTGCTGC AAAAGCTCGG CCGCAACGAA GACGCGCTCG CCGCCTATGA GGCGGCGGCG
CAATCCGGGC CGCCCGATCC CGAGACCGAG CTCAATCGCG GCAATGTGTT GCAAAAACTC
GGACGGCTCG ATGAAGCGAT CGTCTGCTAC GACCGCGCAG CGCGCCGGCC TGGGGGTTAT
CCGCAGGCGC TCTACAACAA GGGCATTGCT TTGCAGGCGC TGGGCCGGCG GTCGGCCGCC
CTTGCCGCTT ATGACGCGGC TCTCGTGCTC GACCCCCGCT ACTGCGAGGC GATCTGCAAT
CGCGGCAATC TGTTGCATGA ACTCGGCCGG CTCGAGGACG CCTATATGGC CTATGCCGCG
GCGCTGAAAA TCAGGCCCGC GTTCCTGCCG GCGCTGACCA ACCGCGCCAA TATCTGCCTG
CAATGGGGCC GCCTCGACGA AGCGATCCGC CATTGCGACG AGGCGTTGCG GCATGATCCA
AAATACCCGC AGGCGTTGGG CTTGCGCGGC GCGGCGCTGC ACCGCCTTGG GCGGCTCGAG
GAGGCGCTCG TTTCGCTCGA CCTTGCCGTG TCCGTCCGAC CGGCCGCGCC GGAGGCCTGG
CTCAATCGCG GCAACGTCTT GCAGGAGATG GACCGGCTCG CCGACGCCGT CGCCTCCTAC
CATGAGGCGC TCCGGCTTTC CCCTCATTAT CCGGAGGCGC TGTCAAGCCT TGGCGTCGCT
CTGAAGGAAC AGGGGGATGT CGACGAAGCG CTTGCATGTT TCAACGAGGC CATACACTAC
AAGCCAGACT ATCCAGATGC GCGCAACAAC AGGGCCGGAG CGCTGTTGCT GATGGGGAGG
CTGAAAGAAG GTTTTCGCGA CTTTGAAAGT CGTTGGGATC GATCCAACGC GCCGCCGAGA
CCCATCATTC CCGCGGCGGC CCGGTGGACC GGCGAGGATC TGACGGGCAA AAAAATTCTC
GTTTACGATG AACAGGGGCT TGGCGATCTC ATCCAGTTCT GCCGCTACAT TCCCTTGCTT
GAGGAGCGCG GGGCTGAAGT CACCCTGTTG TGCCGCAGGA CCATGCAAAG GCTGCTGCGC
AGCCTGGATT CCCGCGTTCG GATGATCGAC TCCCTGGACC CTCAAGACCG GTATGATTTT
GCATCCGCCT TGCTCAGTCT GCCAGGCGGA TTCGGCGCGG AGCTCGAAAC GATTCCGGCG
CAGACGCCTT ATCTTTTCGC GGAGCCCCAG GCTGTCGCCC AGTGGTCGCA GCGCATCGGC
CCCGAAGGAT TTCGGATCGG CATATGCTGG CGCGGAAATT CCGCGATCAA TTTGAAGCGC
GGCTTTTCCC TGGACTGCCT CGGCCCGATC GCCGCGATCG AGGGCGCGCG CCTGATCGGC
CTGGTCAAGG GCGAAGGGCC GATGGAAATC GAGACGCCGC AGGGATCGGC GCGCATCGAA
GGGCCGGGGC CCGATTATGA CGCAGGGCCG GACGCCTTTA TCGATTGCGC CGCCGTGATG
GAATCTCTGG ATCTCGTCAT CACGTCGGAC ACCGCCATAG CCCATCTCGC CGGCGCGCTT
GGACGGCCTG TGTTCGTCGC CCTGAAACAT GCGCCGGACT GGCGATGGCT GCTGCATCGT
CTAGATTCGC CATGGTATCC GACGATGCGG TTGTTCCGCC AAAAGGAGCG CGATCAATGG
CGGCCCGTTT TCGATGAAAT GGCTGCGGCG GTCGGCGCGC TTGTCCGCGG CGTCGGCAAT
TCTATCCCGC CGCCCGATTT GTCTTCCAGC GATCAGAGCG TCGCGGCAGG ACCGCACGCG
CTCCAACCTG AAGACCCGCC GGCGCTCATC GCCATACCGG CAGGCGTTGG CGAACTCATC
GACAAGATCA CAATTCTTGA GATCAAGGAG CGCCGCGTCG ACGATCCGGC CAAGCTGCAC
AACATACGCT TCGAACTCGC CCTGTTGCGC AAGCTTCGAG ATGAGCACGA TCTGTCGGAC
CCTGCGCTCG CGCGTCTTGA GGCGGAGTTA AGAAAGGCCA ATGAATCTCT GTGGGATGTC
GAAGACGCAT TGCGCTCGTG CGAATCGAAG AACAAATTCG ACGAGGAGTT TGTCTCTCTC
GCGCGACTTG TCTACACCTG CAACGACAAG CGCGCTCATG TGAAGAAAGA GATCAATCTG
TTGTTCAATT CCGCCATTAT CGAGGAGAAA TCCTACGCCC GCGCGTGA
 
Protein sequence
MSVPKYESAA VSGQGHSRDK ALLQVAITRA FAFLNSGQPD EALAELGGHA QRAARSDLAC 
YVFGLICFNA GDLREALIWF ERALALKPDY FEVLSARAIV LQRLGQPEDA LEAFEDILKL
RPNDADALFS IGVILQSLGR MNEALVSYEG ALRAQPKHCE ALTNRGALLE RFGRLTEALS
CFEAIIALRP NNGGALFNKG SVLQKLGRNE DALAAYEAAA QSGPPDPETE LNRGNVLQKL
GRLDEAIVCY DRAARRPGGY PQALYNKGIA LQALGRRSAA LAAYDAALVL DPRYCEAICN
RGNLLHELGR LEDAYMAYAA ALKIRPAFLP ALTNRANICL QWGRLDEAIR HCDEALRHDP
KYPQALGLRG AALHRLGRLE EALVSLDLAV SVRPAAPEAW LNRGNVLQEM DRLADAVASY
HEALRLSPHY PEALSSLGVA LKEQGDVDEA LACFNEAIHY KPDYPDARNN RAGALLLMGR
LKEGFRDFES RWDRSNAPPR PIIPAAARWT GEDLTGKKIL VYDEQGLGDL IQFCRYIPLL
EERGAEVTLL CRRTMQRLLR SLDSRVRMID SLDPQDRYDF ASALLSLPGG FGAELETIPA
QTPYLFAEPQ AVAQWSQRIG PEGFRIGICW RGNSAINLKR GFSLDCLGPI AAIEGARLIG
LVKGEGPMEI ETPQGSARIE GPGPDYDAGP DAFIDCAAVM ESLDLVITSD TAIAHLAGAL
GRPVFVALKH APDWRWLLHR LDSPWYPTMR LFRQKERDQW RPVFDEMAAA VGALVRGVGN
SIPPPDLSSS DQSVAAGPHA LQPEDPPALI AIPAGVGELI DKITILEIKE RRVDDPAKLH
NIRFELALLR KLRDEHDLSD PALARLEAEL RKANESLWDV EDALRSCESK NKFDEEFVSL
ARLVYTCNDK RAHVKKEINL LFNSAIIEEK SYARA