Gene Mlab_0678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0678 
Symbol 
ID4794472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp648537 
End bp651758 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content52% 
IMG OID640099338 
Productextracellular ligand-binding receptor 
Protein accessionYP_001030117 
Protein GI124485501 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGATA TTGGCGGTCT TTTCGGCAAA AACGACAAAC AGAATCCGTG GATAGCACGC 
GGGGAACAGG CATACGAAAA AGGAATGTAC GACGACGCAG CCCAGCATTT CACAAAAGCT
CTGGAACTAG AGCCGGGTTC GGCAAAGCTC TGGACAAAAC TTGCCGCCTG CCAGAAATTT
ACTCAAAAAT ACGATGCAGC CTCCAGATCA TACGCCAAAG TTGTGGAACT CAACCCGGAT
GATTTCCATG CCTGGACCTC GCTCGCGGTT ATTCTGGGAG ATTCTGGAAA ATACGAAGAG
GCACTCTCTG CCCTCGGCCA TGTCGTTCTC CCGGAAAGCG AGGTATACCT CAAAGACAGA
AAATGCGAGT GGCTTTTGCG TTCCGGAAAA TACCGGGAAG CTGCCGCCGT CTGCTCGCAG
CTCATCTCCC ATTTCCCGGG AAATATCCAA TACCGGATCC GTTATGCCGA TCTTTTGATG
CGAAGCGGAA TGTTTGCCGA AGCACGATCG GTTTACGACG ATCTCTCCTC CTCTCTGGGG
CAGACGGACC TGATCTCCAA CGCCGCATTC TGCAGCGAGA TGACGGGAGA TGTCGAAGGA
GCGCTGAAAC GGTATGCCCA TCTTGCCGAA AACAACATTG TCGGCTGGTA CCGCAGAGCC
CGGCTCGAAG AGAGTCTTGG AAACTTTAAA AGCGCCGCCG CAGCATACGG TATGATCCAG
CATTATGCGA CCGGCGATGA TATTACCGTC ACAATCCGCC GGGCTTTATC ATTTTACTGG
GACGGAAACG GGAAAGAAGC CGCGGTCCAG CTGGAAAAGA TCCTGGCAAA AGGCTACGCA
AACGCTGAAC TCTGGCATCT GCTTGGTACA ATCTCGTTTT TGAACGGAGA ATTCAAACGT
GCCGTCGAAG CATTCGTCGA ATGCATTCAT CTGAACCAGA CGAATTCTGC GGTTTGGTAT
ATGAAAGGAT GTGCCGAGTA TCTCTCCGGA AAATACAAAG AGGCAGTCGA AAGTTTTGAG
AAGATGGGAA AAATTGGAAC CGCTGGACCA TCGCCTGCGA AAATGAAATG GTTCGAGGAC
AGCGACCTTG ATCTGTTCGA TAACGCTCCG GCCCAGAAAG AACAGATTAA AGTCGAGGTC
GGCGTCGTGA ATGAAGGGCT TTTGGCTATG CAGGGATATG CCCTTGCGGC ACTCGGCCGA
TACGCAGACG CAGACAAAGC CGCCGGCGTT GTTTTAAGCC ATGCTCCCTC AAGGCTCGAC
ATGGAACTTC TGCACAGTAG GTGTCTTGCC GGACTTGGGC GTTATCAGTA TGCTGGAGAC
GCGGCAGCAC GCGTACTTGC AAAATCTCCC GATAACATCG CGGCTTTGGA ACGGCACGCC
GAGTCCATGA TGCTTGCAGG GAAATATAAG GAAGCCGCCG GATCATGGCA AAAACTCGTT
GAAGCGGCGC CGGAAAACAC GCTTGCCTAT ATCGGCCTCC TCAAAGCATG CAGCGGGACC
GGGAAATATG CTGAGGCGGA GCAGCTTGCA GGCCATCTGT TAAAAGAGTA TCAGCCCCGC
GATCTTACAA TCACCCTGCT TGCGGGAGAT GCGGCCCTTT CCGCCGGAAA TTACGAAGAG
GCCGTTACCC ATTACCAAAA TGCCGTAGCC TTATCGTCCA ATACGCCGGC ACCATACATA
GGTCTTGGAT CCGCATATGA GATGCTCGGC GATTATGAAA AAGCCGTGGA AGCGTTTTCC
TCAGCAGACG AAATCCTGCC GGACCGGCCG GGAATCCTTC TCAATCTTGG AAGGGTTCAG
GCAGAGGCGG GACAAAATAA GGCTGCTGCA CAAACCTATC TTGGAATCAT GAACAACCAC
CCCGACATTG CCGGAGCGGC AGTTCATGTC ACGACACTCT GCGCCGATCT CGGACGAAAT
GAAGAGGCGG CAAATGCGGC TTTAGCTGCA TTATCCAAAG GAGAGGGGGG CAAAGCTCTT
CTTACTCTTG GAGGAGATCT TTGTACCTCG GCAGATATGC TGGATCAGGC GCAGGACTGC
TATACTGCAG CACTTAAATT AGATCCGGAC GATATCCACT CCCTTTACTC GCTTGGACAT
GTCCACTTGT TAAAGGGCGA GTTCAAGCTC TGTGTCGACT ACATGGATCG GTGCCTCAGC
CTTGATTCCG ATCACACCAA AGCTTTAGCT GACAAAGGAT CGGCCTATGT GAACCTTGGC
AGACTGGAAG ATGCGGAAAA AGTACTTCGG CATCTGACGG ATATCGACAC CGCCAACACC
AAAGCCCTGC TGGAACTCGC AGATGTTCTT GAGCAGCTGC AGAAGTATGA CGAGGTCCTT
GAGGTGTATG CAAAATACCT CCAGGCAGGC ATTCCAAATG CGGATGTCAT CAGAAAACTT
GCTTCGATAT ATATCATGAG GGGTGAATAT GACGAGGCAC TCTCCGGATA CGACCTCCTG
CTCGAATCAA ATTCAGACGA CATAGTCACC CGCCGTCTGA GAGCCGAGGC TCTGCACTTC
CTTGGGAAAG ACATTGAAGC GGCAGAAGCC TGTGCAGAAA TGCTGACTCT TCGCCCGCAT
GACCAGAGTA TAAGATCCCT CTATGCCGCA TCACTTGCCA ACATCGGAAA AACGGAGGAT
GCCCTTAAAC AGTATGCCGA GCTGACCCTC AAAGACCCGG AAAACACCGC AGCCCTCTTC
GGATATGCCG AGATGCTATC ACGCATGGGT AAATATCCGG AAGCAGTCAG ACATTTCGAC
AAACTGATCG GAAAATACCC GCGAAACAGT CTGCTCCATA TCGAAAAAGC CCTTGCTTCG
ATCAAGATCG GCGAACCGGC GGATATCACG AGCGACATGA CGACCGCGGC CCAGGCCGAT
CCGAAAAATC CCTATGTCCT TTCCGGTCTT GGATTCATGC AGATGGTCAC CGGCCACCCG
ACCGAAGCGC TCGCCGCATT CGACAAAGCG GAAACTGCGG GATGCAAAGA CCCTGACCTG
AACTTCTGCC GCGGGCTCAT CTATCTTCAG CAGAACAGAT TCGATATGGC GGAAAAAGCC
GCCGACCACA TCCTGAAAAA CGATCCTGAT CATATGCCCG CCATGCATCT CAAAGCTCGG
TCTCTCGAGT CTGTGGGCAG ACTCAAAGAA GCCATCGGGT ATTACGACAG AATCGTGCAG
CTCTCGGAGA TCGATAAAGA GATGGGACCA TCCGAGGACT GA
 
Protein sequence
MVDIGGLFGK NDKQNPWIAR GEQAYEKGMY DDAAQHFTKA LELEPGSAKL WTKLAACQKF 
TQKYDAASRS YAKVVELNPD DFHAWTSLAV ILGDSGKYEE ALSALGHVVL PESEVYLKDR
KCEWLLRSGK YREAAAVCSQ LISHFPGNIQ YRIRYADLLM RSGMFAEARS VYDDLSSSLG
QTDLISNAAF CSEMTGDVEG ALKRYAHLAE NNIVGWYRRA RLEESLGNFK SAAAAYGMIQ
HYATGDDITV TIRRALSFYW DGNGKEAAVQ LEKILAKGYA NAELWHLLGT ISFLNGEFKR
AVEAFVECIH LNQTNSAVWY MKGCAEYLSG KYKEAVESFE KMGKIGTAGP SPAKMKWFED
SDLDLFDNAP AQKEQIKVEV GVVNEGLLAM QGYALAALGR YADADKAAGV VLSHAPSRLD
MELLHSRCLA GLGRYQYAGD AAARVLAKSP DNIAALERHA ESMMLAGKYK EAAGSWQKLV
EAAPENTLAY IGLLKACSGT GKYAEAEQLA GHLLKEYQPR DLTITLLAGD AALSAGNYEE
AVTHYQNAVA LSSNTPAPYI GLGSAYEMLG DYEKAVEAFS SADEILPDRP GILLNLGRVQ
AEAGQNKAAA QTYLGIMNNH PDIAGAAVHV TTLCADLGRN EEAANAALAA LSKGEGGKAL
LTLGGDLCTS ADMLDQAQDC YTAALKLDPD DIHSLYSLGH VHLLKGEFKL CVDYMDRCLS
LDSDHTKALA DKGSAYVNLG RLEDAEKVLR HLTDIDTANT KALLELADVL EQLQKYDEVL
EVYAKYLQAG IPNADVIRKL ASIYIMRGEY DEALSGYDLL LESNSDDIVT RRLRAEALHF
LGKDIEAAEA CAEMLTLRPH DQSIRSLYAA SLANIGKTED ALKQYAELTL KDPENTAALF
GYAEMLSRMG KYPEAVRHFD KLIGKYPRNS LLHIEKALAS IKIGEPADIT SDMTTAAQAD
PKNPYVLSGL GFMQMVTGHP TEALAAFDKA ETAGCKDPDL NFCRGLIYLQ QNRFDMAEKA
ADHILKNDPD HMPAMHLKAR SLESVGRLKE AIGYYDRIVQ LSEIDKEMGP SED