Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0001 |
Symbol | |
ID | 7271383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 442 |
End bp | 1704 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643568660 |
Product | orc1/cdc6 family replication initiation protein |
Protein accession | YP_002465120 |
Protein GI | 219850688 |
COG category | [L] Replication, recombination and repair [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1474] Cdc6-related protein, AAA superfamily ATPase |
TIGRFAM ID | [TIGR02928] orc1/cdc6 family replication initiation protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00890369 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGGGCTTT TTAAGAAGTA TCTGACTGAT CGGAATAAAA TTTTCAAGAA CAGGGAGGTT CTCAGGCATT CGTACCGTCC TCATATCCTG CCTCATCGTA TGCCCCAGAT CGATGAGATC GCCTCAATTT TGGCCCCCTC CCTTCGCAAC GAGACCCCCT CAAACATCCT GATCTATGGA AAAACTGGCA CCGGGAAGAC TGCAAGTGTC AGATATGTGG GTTCTGAGCT AGAAAAAGCG TCAAGTCAGA ATGTGGCTAC CTGTGCTGTG GTCCATATCA ACTGCGAAGT AATCGACACC CAGTACCGAG TGCTGGCTCA GATCGCCAAG GATATCGAGA GACCTGACGA AACCCCCTCA GACAAGCTCA GACCCCATAT CCCGATGACC GGCTGGCCGA CAGACCAGGT TTATATGGAA CTGAAGAATC AGCTCGAGGC ACGCGGCGGT GTACTGGTGA TCGTCCTCGA CGAGATCGAC AAGCTGGTCA AGAAGAGCGG GGACGACACC CTCTACAACC TGACCAGGAT CAACTCTGAT CTCCACAGTT CCCGGGTCTG TATCATTGGC ATCTCTAATG ACCTCAGCTT CAAGGATTTT CTGGACCCGC GGGTCCTCTC CTCCCTATCA GAGGAGGAGA TCGTCTTTCC CCCGTACAAT GCTCCGCAGC TCTGCGATAT TCTCCAGCAA CGTGCAGAGA AGGCGTTTGT CCTTGAGACA CTCGATGAAG GGGTGATTCC GCTCTGCGCG GCTCTGGCTG CCCAGGAACA TGGAGACGCA CGCCGTGCGC TCGATCTACT TCGGATCTCC GGCGAACTCG CCGAGCGCGA GGAGGCTGAG AAGGTCGCTG AGAGCCATGT CAAGAGTGCT CAGGCTAAGA TCGAGACCGA CTCGATGATC GAGTGCATCT CGACCTTGCC GACGCAGTCC AAGGTGGTTC TCTACTCGAT GCTCCTCCTC GAGCAGCTCG GGCATCTGAT CTTCACCTCC GGGGAGGTCT GCAGAGTCTA CCAGGAACTG GCCTGCCACC TGGAGATTGA TGTGCTGACC AATCGGCGGA TCACTGATCT GATCTCAGAG CTGAACATGC TCGGGGTGAT CAACACCCGG GTGGTCTCCC GCGGCAGGTA TGGACGGACC AAGGAGATGT GGTTCGATGC AAACTCGACC AAGATCCAGG AAGTGATCCA GAAGGACCCA CGGCTCACAG AACAGGGGCT CGGGCAGATG GATCGCACCT GGCTGAAACA GACATTTAGG TGA
|
Protein sequence | MGLFKKYLTD RNKIFKNREV LRHSYRPHIL PHRMPQIDEI ASILAPSLRN ETPSNILIYG KTGTGKTASV RYVGSELEKA SSQNVATCAV VHINCEVIDT QYRVLAQIAK DIERPDETPS DKLRPHIPMT GWPTDQVYME LKNQLEARGG VLVIVLDEID KLVKKSGDDT LYNLTRINSD LHSSRVCIIG ISNDLSFKDF LDPRVLSSLS EEEIVFPPYN APQLCDILQQ RAEKAFVLET LDEGVIPLCA ALAAQEHGDA RRALDLLRIS GELAEREEAE KVAESHVKSA QAKIETDSMI ECISTLPTQS KVVLYSMLLL EQLGHLIFTS GEVCRVYQEL ACHLEIDVLT NRRITDLISE LNMLGVINTR VVSRGRYGRT KEMWFDANST KIQEVIQKDP RLTEQGLGQM DRTWLKQTFR
|
| |