Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_0678 |
Symbol | |
ID | 4794472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | - |
Start bp | 648537 |
End bp | 651758 |
Gene Length | 3222 bp |
Protein Length | 1073 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640099338 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_001030117 |
Protein GI | 124485501 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAGATA TTGGCGGTCT TTTCGGCAAA AACGACAAAC AGAATCCGTG GATAGCACGC GGGGAACAGG CATACGAAAA AGGAATGTAC GACGACGCAG CCCAGCATTT CACAAAAGCT CTGGAACTAG AGCCGGGTTC GGCAAAGCTC TGGACAAAAC TTGCCGCCTG CCAGAAATTT ACTCAAAAAT ACGATGCAGC CTCCAGATCA TACGCCAAAG TTGTGGAACT CAACCCGGAT GATTTCCATG CCTGGACCTC GCTCGCGGTT ATTCTGGGAG ATTCTGGAAA ATACGAAGAG GCACTCTCTG CCCTCGGCCA TGTCGTTCTC CCGGAAAGCG AGGTATACCT CAAAGACAGA AAATGCGAGT GGCTTTTGCG TTCCGGAAAA TACCGGGAAG CTGCCGCCGT CTGCTCGCAG CTCATCTCCC ATTTCCCGGG AAATATCCAA TACCGGATCC GTTATGCCGA TCTTTTGATG CGAAGCGGAA TGTTTGCCGA AGCACGATCG GTTTACGACG ATCTCTCCTC CTCTCTGGGG CAGACGGACC TGATCTCCAA CGCCGCATTC TGCAGCGAGA TGACGGGAGA TGTCGAAGGA GCGCTGAAAC GGTATGCCCA TCTTGCCGAA AACAACATTG TCGGCTGGTA CCGCAGAGCC CGGCTCGAAG AGAGTCTTGG AAACTTTAAA AGCGCCGCCG CAGCATACGG TATGATCCAG CATTATGCGA CCGGCGATGA TATTACCGTC ACAATCCGCC GGGCTTTATC ATTTTACTGG GACGGAAACG GGAAAGAAGC CGCGGTCCAG CTGGAAAAGA TCCTGGCAAA AGGCTACGCA AACGCTGAAC TCTGGCATCT GCTTGGTACA ATCTCGTTTT TGAACGGAGA ATTCAAACGT GCCGTCGAAG CATTCGTCGA ATGCATTCAT CTGAACCAGA CGAATTCTGC GGTTTGGTAT ATGAAAGGAT GTGCCGAGTA TCTCTCCGGA AAATACAAAG AGGCAGTCGA AAGTTTTGAG AAGATGGGAA AAATTGGAAC CGCTGGACCA TCGCCTGCGA AAATGAAATG GTTCGAGGAC AGCGACCTTG ATCTGTTCGA TAACGCTCCG GCCCAGAAAG AACAGATTAA AGTCGAGGTC GGCGTCGTGA ATGAAGGGCT TTTGGCTATG CAGGGATATG CCCTTGCGGC ACTCGGCCGA TACGCAGACG CAGACAAAGC CGCCGGCGTT GTTTTAAGCC ATGCTCCCTC AAGGCTCGAC ATGGAACTTC TGCACAGTAG GTGTCTTGCC GGACTTGGGC GTTATCAGTA TGCTGGAGAC GCGGCAGCAC GCGTACTTGC AAAATCTCCC GATAACATCG CGGCTTTGGA ACGGCACGCC GAGTCCATGA TGCTTGCAGG GAAATATAAG GAAGCCGCCG GATCATGGCA AAAACTCGTT GAAGCGGCGC CGGAAAACAC GCTTGCCTAT ATCGGCCTCC TCAAAGCATG CAGCGGGACC GGGAAATATG CTGAGGCGGA GCAGCTTGCA GGCCATCTGT TAAAAGAGTA TCAGCCCCGC GATCTTACAA TCACCCTGCT TGCGGGAGAT GCGGCCCTTT CCGCCGGAAA TTACGAAGAG GCCGTTACCC ATTACCAAAA TGCCGTAGCC TTATCGTCCA ATACGCCGGC ACCATACATA GGTCTTGGAT CCGCATATGA GATGCTCGGC GATTATGAAA AAGCCGTGGA AGCGTTTTCC TCAGCAGACG AAATCCTGCC GGACCGGCCG GGAATCCTTC TCAATCTTGG AAGGGTTCAG GCAGAGGCGG GACAAAATAA GGCTGCTGCA CAAACCTATC TTGGAATCAT GAACAACCAC CCCGACATTG CCGGAGCGGC AGTTCATGTC ACGACACTCT GCGCCGATCT CGGACGAAAT GAAGAGGCGG CAAATGCGGC TTTAGCTGCA TTATCCAAAG GAGAGGGGGG CAAAGCTCTT CTTACTCTTG GAGGAGATCT TTGTACCTCG GCAGATATGC TGGATCAGGC GCAGGACTGC TATACTGCAG CACTTAAATT AGATCCGGAC GATATCCACT CCCTTTACTC GCTTGGACAT GTCCACTTGT TAAAGGGCGA GTTCAAGCTC TGTGTCGACT ACATGGATCG GTGCCTCAGC CTTGATTCCG ATCACACCAA AGCTTTAGCT GACAAAGGAT CGGCCTATGT GAACCTTGGC AGACTGGAAG ATGCGGAAAA AGTACTTCGG CATCTGACGG ATATCGACAC CGCCAACACC AAAGCCCTGC TGGAACTCGC AGATGTTCTT GAGCAGCTGC AGAAGTATGA CGAGGTCCTT GAGGTGTATG CAAAATACCT CCAGGCAGGC ATTCCAAATG CGGATGTCAT CAGAAAACTT GCTTCGATAT ATATCATGAG GGGTGAATAT GACGAGGCAC TCTCCGGATA CGACCTCCTG CTCGAATCAA ATTCAGACGA CATAGTCACC CGCCGTCTGA GAGCCGAGGC TCTGCACTTC CTTGGGAAAG ACATTGAAGC GGCAGAAGCC TGTGCAGAAA TGCTGACTCT TCGCCCGCAT GACCAGAGTA TAAGATCCCT CTATGCCGCA TCACTTGCCA ACATCGGAAA AACGGAGGAT GCCCTTAAAC AGTATGCCGA GCTGACCCTC AAAGACCCGG AAAACACCGC AGCCCTCTTC GGATATGCCG AGATGCTATC ACGCATGGGT AAATATCCGG AAGCAGTCAG ACATTTCGAC AAACTGATCG GAAAATACCC GCGAAACAGT CTGCTCCATA TCGAAAAAGC CCTTGCTTCG ATCAAGATCG GCGAACCGGC GGATATCACG AGCGACATGA CGACCGCGGC CCAGGCCGAT CCGAAAAATC CCTATGTCCT TTCCGGTCTT GGATTCATGC AGATGGTCAC CGGCCACCCG ACCGAAGCGC TCGCCGCATT CGACAAAGCG GAAACTGCGG GATGCAAAGA CCCTGACCTG AACTTCTGCC GCGGGCTCAT CTATCTTCAG CAGAACAGAT TCGATATGGC GGAAAAAGCC GCCGACCACA TCCTGAAAAA CGATCCTGAT CATATGCCCG CCATGCATCT CAAAGCTCGG TCTCTCGAGT CTGTGGGCAG ACTCAAAGAA GCCATCGGGT ATTACGACAG AATCGTGCAG CTCTCGGAGA TCGATAAAGA GATGGGACCA TCCGAGGACT GA
|
Protein sequence | MVDIGGLFGK NDKQNPWIAR GEQAYEKGMY DDAAQHFTKA LELEPGSAKL WTKLAACQKF TQKYDAASRS YAKVVELNPD DFHAWTSLAV ILGDSGKYEE ALSALGHVVL PESEVYLKDR KCEWLLRSGK YREAAAVCSQ LISHFPGNIQ YRIRYADLLM RSGMFAEARS VYDDLSSSLG QTDLISNAAF CSEMTGDVEG ALKRYAHLAE NNIVGWYRRA RLEESLGNFK SAAAAYGMIQ HYATGDDITV TIRRALSFYW DGNGKEAAVQ LEKILAKGYA NAELWHLLGT ISFLNGEFKR AVEAFVECIH LNQTNSAVWY MKGCAEYLSG KYKEAVESFE KMGKIGTAGP SPAKMKWFED SDLDLFDNAP AQKEQIKVEV GVVNEGLLAM QGYALAALGR YADADKAAGV VLSHAPSRLD MELLHSRCLA GLGRYQYAGD AAARVLAKSP DNIAALERHA ESMMLAGKYK EAAGSWQKLV EAAPENTLAY IGLLKACSGT GKYAEAEQLA GHLLKEYQPR DLTITLLAGD AALSAGNYEE AVTHYQNAVA LSSNTPAPYI GLGSAYEMLG DYEKAVEAFS SADEILPDRP GILLNLGRVQ AEAGQNKAAA QTYLGIMNNH PDIAGAAVHV TTLCADLGRN EEAANAALAA LSKGEGGKAL LTLGGDLCTS ADMLDQAQDC YTAALKLDPD DIHSLYSLGH VHLLKGEFKL CVDYMDRCLS LDSDHTKALA DKGSAYVNLG RLEDAEKVLR HLTDIDTANT KALLELADVL EQLQKYDEVL EVYAKYLQAG IPNADVIRKL ASIYIMRGEY DEALSGYDLL LESNSDDIVT RRLRAEALHF LGKDIEAAEA CAEMLTLRPH DQSIRSLYAA SLANIGKTED ALKQYAELTL KDPENTAALF GYAEMLSRMG KYPEAVRHFD KLIGKYPRNS LLHIEKALAS IKIGEPADIT SDMTTAAQAD PKNPYVLSGL GFMQMVTGHP TEALAAFDKA ETAGCKDPDL NFCRGLIYLQ QNRFDMAEKA ADHILKNDPD HMPAMHLKAR SLESVGRLKE AIGYYDRIVQ LSEIDKEMGP SED
|
| |