Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_1652 |
Symbol | |
ID | 4795905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | - |
Start bp | 1680104 |
End bp | 1681525 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640100337 |
Product | hypothetical protein |
Protein accession | YP_001031080 |
Protein GI | 124486464 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0169] Shikimate 5-dehydrogenase [COG0710] 3-dehydroquinate dehydratase |
TIGRFAM ID | [TIGR00507] shikimate 5-dehydrogenase [TIGR01093] 3-dehydroquinate dehydratase, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00124545 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTGCAA ATTGTGCCGT TATCGTGGCG AAAGATGAGA GAGAAGTTCG AACTCTTTCA GAAGAAGCGA TCTCGAAAGG AGCAGAGGCT CTTGAATTCC GGCTGGATTC CTTCCCGGTT ATTCCTGCAG ACCTCTCCTT TCTTTCATGC GGGGTTCCAT CAATAGCCAC ACTCAGATCT CCGTTGGACG AAGAGCGGAA GGAGATTTTC ATCCGTGCTC TTTCGTCGGG CGCGACATAC ATCGACATAG AATCAGATTC GGTTCTGAGA GGTCAGTTTC CAAAAGAACA GGTCATCTGT TCGTATCATG ACTTTGAAAA AACTCCTGCC GATGATGAAA TTCTCACGAT CTTCAAAGAC CTCCAATCTT CGGGAATCCC CAAAGCCGCA TTCATGGTGA GGGGGCCGGC GGATCTGCTT GCCGTCTGGA ATGCAGCATC AGTTCTGAGA GAAAGAGGAT ATCCGTTTAT CTTTATAGGC ATGGGGGCCG CCGGTGAGGT TACACGTATC CGTGCCGCCG AGCTTGGATC AATGCTCAAC TACTGTGCCT TAAGGCCGGA ACTTGCATCC GCTCCGGGTC AGATAACACT GGAAGAGGCT GAGAGACTCG GTACCGACCC GCAGATTACT GCCGTAACCG GATATCCTTT GACCCATACC TTCTCGCCGG AGATCCACAA CGCAGCGTTC AAAGCCGCGA AGATCCGGGG AAGATATGTG AAGATCCCCG CAGCTTCTCA GGAAGTGCCC CTGATTCCTG ATGTTATCAA AAAATATCGG ATCATCGGGA TGAACGTAAC CATTCCTCAT AAAGAGTCGG TCATCCCTCT CCTGAAACGG ATAGATCCTC TCGCGGAGTC TGCCGGCGCA GTGAACACTA TCCGAAATTC TCCCGCCGGC CTTGAGGGGT ACAATACCGA TATTCTGGGA ATAGCGGCGT CGCTCGCTTC GGTGGATATG GATCCAAAGG ATGCCGATGT GCTTATCATC GGTGCCGGAG GGGCAGCAAA GGCCGCCGCC GCATATCTGA AATCTGCCGG TGCTCGTCTT TCGATAACCA ACAGGACCCA TTCCCGGGCA GAGACACTTG CCGAAATATT TGGTGCCCGG GCAGTGAAGC TGGAGGACCT TGCTCCGGAG TATCAGCTGA TTCTGAATGC GACCCCGGCT GGAATGAGCG GGTTTGAACA CTGCTCACCG GTTCCAAATA CGCTATTCAC CAAAGACACG GTTGTTATGG ATATGATCTA TGATCCGGAA GTGACTCCCC TTCTTGCCGC TGCAAAAGCA GCCGGCGTTC GTGCCTGCAT CAACGGAAAG ACCATGCTTA TCGAACAGGC CGCCGCGTCA TTCACTCTCT GGACAGGAAT TACTCCGAAT CGTGATGTGA TGAGAAAAGC ATTCGAAGCG AGGGCGGCAT GA
|
Protein sequence | MTANCAVIVA KDEREVRTLS EEAISKGAEA LEFRLDSFPV IPADLSFLSC GVPSIATLRS PLDEERKEIF IRALSSGATY IDIESDSVLR GQFPKEQVIC SYHDFEKTPA DDEILTIFKD LQSSGIPKAA FMVRGPADLL AVWNAASVLR ERGYPFIFIG MGAAGEVTRI RAAELGSMLN YCALRPELAS APGQITLEEA ERLGTDPQIT AVTGYPLTHT FSPEIHNAAF KAAKIRGRYV KIPAASQEVP LIPDVIKKYR IIGMNVTIPH KESVIPLLKR IDPLAESAGA VNTIRNSPAG LEGYNTDILG IAASLASVDM DPKDADVLII GAGGAAKAAA AYLKSAGARL SITNRTHSRA ETLAEIFGAR AVKLEDLAPE YQLILNATPA GMSGFEHCSP VPNTLFTKDT VVMDMIYDPE VTPLLAAAKA AGVRACINGK TMLIEQAAAS FTLWTGITPN RDVMRKAFEA RAA
|
| |