Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_0723 |
Symbol | |
ID | 4794614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | - |
Start bp | 696648 |
End bp | 698231 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640099383 |
Product | hypothetical protein |
Protein accession | YP_001030162 |
Protein GI | 124485546 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.703564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.46015 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC GTATTTTAGA GGGCTTATCC ATCAACGGAT TCGCCGCACT GTGTATTGCA GGCGTACTGA TCGGCGCCAT GGTGATTTGC GCCGGATGCG TCCAGTCAGG AAATGATGAC AGTACGCTTC GCGTCGTCAT GAACTTCGGT CCCGATTCAA GCGGCAGTCT CGACCCGGCC AATGGGTGGG AAGGCTGGTA TGTGCGGGAA GCGGGCCTCT TTGAGACGCT TTTCTATGAT GATCAGAACA TGGAACTCAA ACCCGAACTG GCAACCGGCT ACAAAGCATT GAGCGATACC GAATGGGAAA TCACGCTTCG CAAGGGCGTA GTCTTCCACG ACGGAACGCC GTTCAACGCC GACGCGGTGA TATATTCGTT CAACCGGGTC CTCGACCCCG CAAACAGCCG TTCATCCGAG TACTCATTCA TTAAAGAAGT GAGAAAGACT GACGACTATA CGATCGTAAT CGAAACGACC GAACCGTATG CTCCGCTGAT CGCATCGCTC GTTGATCCGG TCATGTCGAT CATCAGTCCG AACATTGTCG ATGTGGACAA ACAGCCGGTC GGTACCGGAC CCTATGCATT CGTTTCGTTT GAATCGGGTG CAAGCATGGA TCTCGTCAGA AACGACAAGT ATTGGGGCGG CACGCCGAAA GCTGCGAAAC TCTCCATGAC ATACAACGCC GACGCGACCG CACGGACGCT TATGCTGAAG TCCGGCGATG TCGATATCGC AAAGGATATT CTCCCAAGCG AGTATGCTTC CCTGAAATCA GACTCCTCGA CCGATGTCGA ATCCAAGGAG ACGCTTCGAG CATACTTCAT CTACATCAAC GGCGGAAAAG CTCCGTTCGA TGACGTGAAC GTTCGTCAGG CACTCAGCTA CGCACTGAAC CGTCAGGAAA TCGTGGATAC CGCGCTTGAA GGCGTGGCGG GTTCTCCTGC GGTAGGTATG TTTACGAATA CCATGCCGTG GAACGCAAAC GACAAGATCG CCAGCTACGA CAACGATCAG GCAAAAGCCC TTGAACTCCT GGCCAAAGCA GGAATCACCA AAGGATCCGA CGGAAAACTG TATTATAACG GCGAACCGTT CACCATCGAG ATAATGACCT ATACGAACCG TGCGGCCCTT CCGGCAAGCC TGGAAGTGAT CGCGGCCCAG TATGAAAAGC TCGGTATCAC GGTCACGACG AAACACGCTG AATGGAGCGC GATCAAATCA ACCGTGACAT CGGGAACCTA TGATATGGTA CTCTATACCT GGGTCACGGC ACCGACCGGA GACCCGGACT ACTTCATCAG CGGTCACTAT CTCTCGACCG GGGCATACGC TTCCGGCTGG ACCCATTACT CGAATCCGCA GATGGATGAA TGGATCCTTG CCGCACGGTC GACCTTTGAT CAGACCAAGC GAGCCGAGCT CTACGACAAG ATCCAGGAAC AGGCGCAGAT CGATTGTCCG ATCATCTACG TGTTCTATGC GATGGAAAAC GATGCGATGA GTACCTCCGT TAAGGGATTC ACCATCTACC CCAACGACTA CACGCTCGTC ACAAAGAACA TCGCGGTCGT ATAA
|
Protein sequence | MKKRILEGLS INGFAALCIA GVLIGAMVIC AGCVQSGNDD STLRVVMNFG PDSSGSLDPA NGWEGWYVRE AGLFETLFYD DQNMELKPEL ATGYKALSDT EWEITLRKGV VFHDGTPFNA DAVIYSFNRV LDPANSRSSE YSFIKEVRKT DDYTIVIETT EPYAPLIASL VDPVMSIISP NIVDVDKQPV GTGPYAFVSF ESGASMDLVR NDKYWGGTPK AAKLSMTYNA DATARTLMLK SGDVDIAKDI LPSEYASLKS DSSTDVESKE TLRAYFIYIN GGKAPFDDVN VRQALSYALN RQEIVDTALE GVAGSPAVGM FTNTMPWNAN DKIASYDNDQ AKALELLAKA GITKGSDGKL YYNGEPFTIE IMTYTNRAAL PASLEVIAAQ YEKLGITVTT KHAEWSAIKS TVTSGTYDMV LYTWVTAPTG DPDYFISGHY LSTGAYASGW THYSNPQMDE WILAARSTFD QTKRAELYDK IQEQAQIDCP IIYVFYAMEN DAMSTSVKGF TIYPNDYTLV TKNIAVV
|
| |