Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0738 |
Symbol | |
ID | 7270472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 736901 |
End bp | 738784 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643569381 |
Product | hypothetical protein |
Protein accession | YP_002465825 |
Protein GI | 219851393 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.140724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTCCC ATACTCTCTT CACTCACTCT CTTTTCCGTG TCTGCAGTCT CCTGATCTGC CTTGTCCTGC TGGTCAGCGC AGTTTCAGCC GCCACAACCG AGGTTCATGT GGTGAAGACC GCGGCCGACG GCAGCACCGT CCTCGCCGAA CGGACCGTCG ACTACCAATG GCTCGAATCA AACCTGCCCG TGCTCGGTGA CGGCGTCACC CACTACTACC TCCAGGGCCC CGTCTTCGAC GGCGATGTGT GGGATCCCAC CGAGTCGGTG AACGTCGAGG AGAAGGATAT GGGCGCCCTG AAAGGGACCG ATCTCCGCGA TATCTGCGAC CTCGTGGGGG GAATGCAATC CGACGATACG CTTGTCGTCA AGGCCTCAGA CGGCTTCTCG AAGCGATTCT CGTACAATAC CATCTATACT CCCCCAGCTC GGGAGGGTCC GCTCGTCCTG ACCTGGTACA GGGGCGACGA AGGGTACGTC CCGTCGTACT CGAACGGCAT GCGGATCGTA TTCTTCGCCG ACACCTCCGT GAACCCCTGG GGCTATCACG TCTTCGGCCT CAGCGACATG AAGGCTTCGA TGCCCTCGAA CGAATGGAAC TTCTTTGACA TCTACCCGAC CACGACCGGG CTCTCGGCCA TGTATGTGAA CGAACTCCGG ATCGTCTCGA CTCAGCAGGC GCCCACAACC ACTGAGACGA CCACCTCCGC GACACCGACA CTAACGCCCA CAACAACCCC AACCGACACC CAGACCGAGC TGCCAACGGT GGTCGTTACG ACTATAAACA CCACGCCGAC GTCGACAACC ACAGCAACCC CGACTGACAC CCAGACCGAG CAGCCGACAG TGGTCGTCAC GACCATAAAC ACCACACCAA CGTCGACAAA CACTGCAACC CCATCCGTCA CTCAGACCCA GACACCCACA GCGACCACCA CGACACTCAC GATCATCCCG ACGGTCTCTG TCACCGGAAC GACGGTGCCG ACGACTGTGG GGACGATCGA ACAGGTCTCC CCCACGACGG TGGCGACCAC CCCGTCGATA ACCACAGTCA GAACCATCGA TACCGGAGTG ACTACCAGAT CCACCCCGGT CGTAACAGCC AGCACCGCCC CTGCGGTATC CACAATACCA CAGACCACAA CAGCACTCGC CCCGGTCATA TCGTCCCCCT CAACCACGAG TCCAGTACCC TCGATCTCCC TATCAGGTAT GAATACGCTC TTACCCTCCC ACGAGCAGAC CGTCGTTCCA GCTCCAGACA TCACCAGCAC CGGTATCCAG TCCTCGACGA CATCCAGTGC AGGAACCATC GCACCGATCA TCACAGCCAC GAGCAGGTCT TCAGTTCAGA CAGCCAGTTC TTCTGGAGGA GCCCCCGTCT CAACCGGAGA TTCATCTTCA GACTCATCGA GCGACGACTA TACCGGCGTC GGTGGGACAA ACACAACGAC CGCAACGACG GCGATCACAA CTGTAACTAC ACCTCATACA CCGACTGGAA ATGCAACGCC CACTCCGACC CAGGTAAATA CCAGCACTCC AGTGACGACA ACGTCGACCC CACCAGTTCT CGACACAGGA AGTCAGGACG AATACTCGCC GCTGATCCTT CCCGGAGGTT CGAACTCAAA ATCCTCGTCT GGAGGACAGA CCTCTACCAA GAACTTCCTC TCCACGATAC AGTCGGCCAT CGACCGCCTC TCGTCGTCAG ACCTGAGTAT CTTCCTGCTG ATCGCGGTGG CACTGCTCTT CATCCTGCTG ATCTTTGCAG GGCTGATCAT CATAGTTCTC CTCCTGCTCC TGGCACTGGC CGGGATCCTG TACCTGCGGC AACGAAGGGA GAAGGAACAG AAGGATGAAC AGAACAGAGA ATGA
|
Protein sequence | MESHTLFTHS LFRVCSLLIC LVLLVSAVSA ATTEVHVVKT AADGSTVLAE RTVDYQWLES NLPVLGDGVT HYYLQGPVFD GDVWDPTESV NVEEKDMGAL KGTDLRDICD LVGGMQSDDT LVVKASDGFS KRFSYNTIYT PPAREGPLVL TWYRGDEGYV PSYSNGMRIV FFADTSVNPW GYHVFGLSDM KASMPSNEWN FFDIYPTTTG LSAMYVNELR IVSTQQAPTT TETTTSATPT LTPTTTPTDT QTELPTVVVT TINTTPTSTT TATPTDTQTE QPTVVVTTIN TTPTSTNTAT PSVTQTQTPT ATTTTLTIIP TVSVTGTTVP TTVGTIEQVS PTTVATTPSI TTVRTIDTGV TTRSTPVVTA STAPAVSTIP QTTTALAPVI SSPSTTSPVP SISLSGMNTL LPSHEQTVVP APDITSTGIQ SSTTSSAGTI APIITATSRS SVQTASSSGG APVSTGDSSS DSSSDDYTGV GGTNTTTATT AITTVTTPHT PTGNATPTPT QVNTSTPVTT TSTPPVLDTG SQDEYSPLIL PGGSNSKSSS GGQTSTKNFL STIQSAIDRL SSSDLSIFLL IAVALLFILL IFAGLIIIVL LLLLALAGIL YLRQRREKEQ KDEQNRE
|
| |