Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_1217 |
Symbol | |
ID | 4796148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | + |
Start bp | 1247194 |
End bp | 1249179 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640099893 |
Product | V-type ATP synthase subunit I |
Protein accession | YP_001030653 |
Protein GI | 124486037 |
COG category | [C] Energy production and conversion |
COG ID | [COG1269] Archaeal/vacuolar-type H+-ATPase subunit I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000138447 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.236673 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCAGC CTAGACCGAT GACCCATTTG TTAATTGCCG CGAGCAAGGA GCAGATGGCG TCAGTTGTGA CCGAATTATA TCGTCACCAG ATTTTCCACA TCACCGATTT TGTAGATCAG GGGAAAGAAG GCTATGAAGG AGTTAAAATC GGCGTTCCTT TTGAAAACGC CGGAGAGCTC TCGAACAACC TCCTGAAAAT ACGCTCCATT GAGAGTGTGT TCCAGACCGC CCCGTCTACG CAAAGCACTG TCCAGATGCG TCCTGCAGAA ACGATTCGAG CAGCAATAGA TAGCGAGCTT CCCAAAATTG AGGAGGAAGT TAGCAGTCTC ACGGTAAAGC GATCGAATGC AGAGACCCGT ATCCGGGAAT GTGAACAGCG TATCAACGAA CTCAAACCAT TTGCCTTAGT GCCTCTAGAG CTTGATCTCT ACAGAGGTTA CGGCAGTCTG ACGGTTTTTG CAGGGTACAT CGAGCATGAC CTTGACATCT CTGTTCCGCA CGAAAAGACT TACGTCGCGG GAAAAGACGG AAACTTCATA GTCGTCTTTG CAGAGAGCAA AAACGCAGCA GACATCGAAA GGCAGCTGGC CGATGCAGGC TTCCAGAGCG TTTCCCTGCC GGTTGAATCC GGCACGGTCC AGGCAGCAGT GGAGAAATAC ACTGTCGAAC TGGGTAAAGA GACACAGATC CGTGAAGACT GTACCGCAAA GATCGTTGAA CTCAAAGCAA AATACGCAGA CTTACTCGCC GCTTGCGATG AAGTATTGTC CTGTGATGTG GAACGTGCCG AAGCGCCGCT TCGATTTGCA ACCACCGACG AAGCTTTCGT CATTGACGGC TGGATCCCGG CCGACACGGT GGATAAAATC ACAGCAGCAC TCAACCAGGC AACTGGTGAA CGCGTTTATG TAACCGTGGA TCCTTCGGAT TACGAAGCGA CCGCGGTACC GGTGGAATAT AATAATCCAT CATTTGCAAA GCCCGCCGAA CTGTTTATGG ACCTGTATTC CCGTCCAAAA TACAAAGAAC TGGACCCGAC CATTATCCTT GCGATCATGT TCCCCTTGCT CTTTGGATTC ATTGTCGGCG ATCTGGGATA TGGTCTTCTT TATCTGGCTC TTGCCCTCGT CCTTCGTAAG ACTTTACTTA AAATGGGCGA AACGGGATAC AAAGCCTTTA TCATCATCCT TGGAGCAGCT ATTTCAACGA GCGTCTTCGG ACTTCTCTAT AGTGAGTTCT TTGGTATGAG TCTCCCCTGG CACTCGATTA TTTTCTCCCG TCACCTTCCA ATTGGTGCCG GCATGGAACA CGCGATGCCC CAGGTTATTG AACTGCTGAT AGTATCAGCC CTTCTTGGTG TCATCTACAT CAGCGTTGGC CGTTTCTTCG GAATGATCAA TCACTACAGA ATGGATCACC CCGGAAGCCA CCGGACAAAA GCAGTCCTTG CACAGCTTGG ATGGAACTTT ATCTTGTGGG GCATTCTGGT GCTTATCCTG AGTGTGGCTG ACTTAAACGA ATACTGCCAG TTCCCCTTTG CACTTCCCAC ATTTGTCGCG TACCCGACCA TTGCTGGAAT TATAGGTGCA ATTCTGATCG TCGTTGGTTG TGTGTTCGTC GCTATCGAGA ACCCACTTGA CCTGATGGAA ATCCCAACAA ACACGCTTTC CCACATGCTG TCCTTCTGCC GTCTTGCTGC AGTCGGTCTT TCATCCGTGG CGATCGCAAT GGTCGTTAAC TTCATGGCTG TCGATTTATT CATCAGCCCT GCAATGGCCA ACCTGGATGT TGTTGGTGTA CTATTGATTA TTGTGGGTGT CATAATTCTT ATATTAGGTC ACGCACTCAA CGTCGCCCTT GGTATTCTTG GCGGTGCTCT GCACCCGATT CGTTTACACT ATGTCGAATT CTTTACCAAA TTCTACCAGG GCGGAGGAAT CATATACAAG CCCTTTGGAC TTAAAAGAAA ATTCTCTGAG GAATAA
|
Protein sequence | MFQPRPMTHL LIAASKEQMA SVVTELYRHQ IFHITDFVDQ GKEGYEGVKI GVPFENAGEL SNNLLKIRSI ESVFQTAPST QSTVQMRPAE TIRAAIDSEL PKIEEEVSSL TVKRSNAETR IRECEQRINE LKPFALVPLE LDLYRGYGSL TVFAGYIEHD LDISVPHEKT YVAGKDGNFI VVFAESKNAA DIERQLADAG FQSVSLPVES GTVQAAVEKY TVELGKETQI REDCTAKIVE LKAKYADLLA ACDEVLSCDV ERAEAPLRFA TTDEAFVIDG WIPADTVDKI TAALNQATGE RVYVTVDPSD YEATAVPVEY NNPSFAKPAE LFMDLYSRPK YKELDPTIIL AIMFPLLFGF IVGDLGYGLL YLALALVLRK TLLKMGETGY KAFIIILGAA ISTSVFGLLY SEFFGMSLPW HSIIFSRHLP IGAGMEHAMP QVIELLIVSA LLGVIYISVG RFFGMINHYR MDHPGSHRTK AVLAQLGWNF ILWGILVLIL SVADLNEYCQ FPFALPTFVA YPTIAGIIGA ILIVVGCVFV AIENPLDLME IPTNTLSHML SFCRLAAVGL SSVAIAMVVN FMAVDLFISP AMANLDVVGV LLIIVGVIIL ILGHALNVAL GILGGALHPI RLHYVEFFTK FYQGGGIIYK PFGLKRKFSE E
|
| |