Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_0803 |
Symbol | |
ID | 4794651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | - |
Start bp | 783354 |
End bp | 784622 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640099466 |
Product | hypothetical protein |
Protein accession | YP_001030241 |
Protein GI | 124485625 |
COG category | [S] Function unknown |
COG ID | [COG0585] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00094] tRNA pseudouridine synthase, TruD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.929073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0000530348 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACCAT CCACCCACCC CCTCGAAATC GACCTCGGCA TGCGGTACTA CGCCACCAGC CAGCCCGGAA TCGGCGGACG TCTCCGAACC ACCCCCGAAG ACTTCGTCGT CGAAGAACTG CCCATCACCT TCACCAACAC CGGCCCCTAC ACCATCTGCA AACTCACCAA ACGCTCCTGG GAACACCAGC ACGCCATGCA CGAAATCACC AACCGGCTGC GCATCAGCCA GAAACGCATC GGATGGGCAG GCACCAAAGA CAAAAACGCC GTCACCACCC AGTACATCTC CCTCTACAAC GTCCCCGCCG AAGCCGCCGC CAACCTCAAC ATCAAAGACA TGACCATCGA ACCGGTCGCG ACCCACCAGT TCTCCCTCGG CCTTGGAAAC CTCCTTGGAA ACAGATTCAA GATCACCCTT CGGGACTGCG AACCGAACGA TCTCGCAAAA AACACCGCTG AAATCTCCGG CGAAATCGCT GCAGGAATCC CCAACTACTA CGGACTCCAG AGATTCGGCG CCCTCAAACC CGTCACCCAC AAAATGGGTT ACCACATCCT CAGAAAAGAA TTCAAAGAAG CCGTCGATCT CTACGTCGGA GGCTGCTTCC CCTACGAATC CGAACAGGTC CAAACCGCCC GCAAAAACTT CGCCGAGACC GGCGACGCAA AAACCGCCCT CTACGAACTG CCGCCCTGGC TCTCGTATGA GCGGATCATG CTCGACTCCC TCGCCAAAAA CCCCGGCGAC TACGGAGCGG CCCTTCAGGC AATGCCCCCA AAACTCCTCT CCATGTTCGT CTCGGCATAC CAGTCCTGGC TCTTCAACAT CGCCCTTTCC AAACGGTGCG AAGAAAACGC CCCCCTCAAC GAGCCGAGAG TCGGCGAACA CCTCGAATTC ACCAACGGAC GCGTCGACAC CGTCACCGAG AAAAACATCG CGACCGCCCG CCAGCACATG AAACGCGGCC GATGCTTCAT CGTCGGCTGG ATGCCCGGAA AGACCCTGCC CGTGGCCCCC GGTCCTCTCG AAGAAACGAT GTTTGCCCAG ATGGAAAAGG ACAACATCTC CATGCAGAGC TTTGCCGACG CAACAGAGTT TGTAAAAACC AACTTCGACG GCGCCCACCG AAGGATCTCC CTCGCAACCG AAGTAGAAAC CGCGGTGTTT GAAAACAACG TGCAGCTGAA CTTCGTTCTG CCGCCCGGCC ATTATGCGAC CACCGTCGCA CGGGAGTTCA TGCAGGCCGC TCCCGAAAAA ATGGTCTGA
|
Protein sequence | MKPSTHPLEI DLGMRYYATS QPGIGGRLRT TPEDFVVEEL PITFTNTGPY TICKLTKRSW EHQHAMHEIT NRLRISQKRI GWAGTKDKNA VTTQYISLYN VPAEAAANLN IKDMTIEPVA THQFSLGLGN LLGNRFKITL RDCEPNDLAK NTAEISGEIA AGIPNYYGLQ RFGALKPVTH KMGYHILRKE FKEAVDLYVG GCFPYESEQV QTARKNFAET GDAKTALYEL PPWLSYERIM LDSLAKNPGD YGAALQAMPP KLLSMFVSAY QSWLFNIALS KRCEENAPLN EPRVGEHLEF TNGRVDTVTE KNIATARQHM KRGRCFIVGW MPGKTLPVAP GPLEETMFAQ MEKDNISMQS FADATEFVKT NFDGAHRRIS LATEVETAVF ENNVQLNFVL PPGHYATTVA REFMQAAPEK MV
|
| |