Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1890 |
Symbol | |
ID | 7272707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 2004300 |
End bp | 2006315 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643570505 |
Product | cell surface protein |
Protein accession | YP_002466918 |
Protein GI | 219852486 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.642058 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCCCT TTTCCCCTCC CTCCCGATTG GACCGGTTCT CCAGGGTTGC CTCCCTGCTC TGCATCCTGA TGCTGGTCGC AGTAACGGTG CCCCCCGCCA CAGCGGTGGC TCTCTCAGTA TCATCTGAAA GATCCGGACC GGTCCTGCAG ACCAGTGACG ATCTGGTCTC TTCGGCAGAG GTGGAGCGAG CGATCGAAAC GGTGGAAAAC CAGTCTTCTT CAGTCGGTCA GAGCGGGGGA GTCCAGAAGA TCGCCACCGA TCTGCTGGAC TCTGCACAGA TGCAGCGTCA GGGAGAGTCC GGAGAGGAGA AGAATACTTC CTGGTCGATC CAGGTTTATA TCACCCTTCT GTCTGGAACA CCCACCTCAT CCATCGATTC CATGGTTGAC GAAGTGACTG ATCGGGATGA GGAGGATCAT CTGGTAGTGG CCCGGGTCAC TGGACCTGAG TTGACGTCGC TGGCTGCAGA TCCCCGGGTG AAGGGGATCA GCAGGGTCAG TGCACCCCTG GTGAATGCCG GCAGAACGGT CACTGCAGGC GATACGCTTC TCAAGGCGGA CCAGCTTCGG ACGCAGTACG GTGCTTCTGG GGCCGGAGTG AAGATCGGCG TGATCTCTGA CGGAGTCAAA GGGCTTGCAG AGGCCCAGGC CACCGGAGAC CTGCCCGCCG ATCTCCATGT CATCTCCAAC GCCGATGGTG GAAATGAGGG GACTGCCATG CTGGAGATCA TCCATGACGA GGCACCCCAG GCGACCCTGT ACTTTCACGA ACATGGATCC AATGTGGTCG CCTTCAACCA TGCGGTCGAT GAACTGGTCG CTGAGGGGTG CACGGTGATC TGTGATGACA TCTGCTGGCC AAAGGAGCCG TTCTTCGAGC AGGGGATCGT CGCTACCCAC ATCTCCTCGG TGATCGCTGA AAAGCAGATC ATCTATGTAT CAGCCGCGTC GAACTATGCC GACCAGCACT ATCTGGGTAC GTATTACAAT GATGGCGACA ACTTTCATGA CTTCAGCAGC GGCACCTCTA TTCACAAGAA CATGTATGTC GACCTTCCAT CGGGGAGTAG CGTGTCGGCG GTGCTGCAAT GGGATGACAA ATTCGGCAGT TCTGGCAACG ATTACGATCT CTATTTTGTC GACGCCCGGA CCGGCATGAT CCTCGATCAG AGTACCTCCC GGCAGGATGG CAATGACGAT CCTATCGAGA CGATCGAGTA CCAGAACGAC GGTTATTCGA CGATCGAAGG GGAGTTCGTG ATCCGTAACT ATCGGGGGGA GGCTCAGGTC AGGACCCTCG ATCTCTTCAT CTACCCGGAT GATGATGCCA CGATGTACTC CAACAATGTC AACCCTGCAG GCTCGGTCTA TGGTCACCAG GCGATTCCTG ATGTGGTGAC GGTCGGTGCG GTCAGGGCCG ATGGTGTTCG GAGTATCAGT ATCGAACCGT TCTCCTCCCG CGGGCCGGTG AACCTGATCT ATCCGTCACC TCTTACGATC ATTAAACCGG ACATCTGTGC CCCCGATCGG GTGGCCGTCT CTGGGGCTGG GGGGTTCGAT ACGATCTTTG TAGGAACCAG CGCCTCGGCT CCTCATGTTG CGGGCATCAT CGCACAGGTC TGGGGCGTGC TCCCGAACCG GAGTGCCAGT GAGATCAGAA CAGCCCTCCT CTCCTCGGCC GTCGACCTCG GTTCGTCTGG CAGAGATTCT GTCTATGGGT ATGGCCTTGC CGATGCTGTG CAGATGTATA TCGCAGCCGG AGGGGGGGCC GTGCTGGTCG GGCAGGTTCC TGCATCGGTA ACGCCGACCC CCATGGCGAC CATTGAGCCG TCTGTCACCC AATCCGCTTT ATCGACAGAT CCATTCAGAA GGAGCAGCCC GTTCGGGCAA ACGGGGATGA CCTCGTTCGG GACATCCACA GCCGGCACCC CCGGGACAAC CCCTGTGTCG ACGGCGTCCG ATCTCTCTGT ACCCTCAGCA CCGACCTTTG TCAGATGGTC CCCATGGGGG ATCTGA
|
Protein sequence | MYPFSPPSRL DRFSRVASLL CILMLVAVTV PPATAVALSV SSERSGPVLQ TSDDLVSSAE VERAIETVEN QSSSVGQSGG VQKIATDLLD SAQMQRQGES GEEKNTSWSI QVYITLLSGT PTSSIDSMVD EVTDRDEEDH LVVARVTGPE LTSLAADPRV KGISRVSAPL VNAGRTVTAG DTLLKADQLR TQYGASGAGV KIGVISDGVK GLAEAQATGD LPADLHVISN ADGGNEGTAM LEIIHDEAPQ ATLYFHEHGS NVVAFNHAVD ELVAEGCTVI CDDICWPKEP FFEQGIVATH ISSVIAEKQI IYVSAASNYA DQHYLGTYYN DGDNFHDFSS GTSIHKNMYV DLPSGSSVSA VLQWDDKFGS SGNDYDLYFV DARTGMILDQ STSRQDGNDD PIETIEYQND GYSTIEGEFV IRNYRGEAQV RTLDLFIYPD DDATMYSNNV NPAGSVYGHQ AIPDVVTVGA VRADGVRSIS IEPFSSRGPV NLIYPSPLTI IKPDICAPDR VAVSGAGGFD TIFVGTSASA PHVAGIIAQV WGVLPNRSAS EIRTALLSSA VDLGSSGRDS VYGYGLADAV QMYIAAGGGA VLVGQVPASV TPTPMATIEP SVTQSALSTD PFRRSSPFGQ TGMTSFGTST AGTPGTTPVS TASDLSVPSA PTFVRWSPWG I
|
| |