Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0781 |
Symbol | |
ID | 7270522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 796170 |
End bp | 797213 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643569429 |
Product | amidohydrolase |
Protein accession | YP_002465866 |
Protein GI | 219851434 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0322523 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0439555 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGAAG AGACCTGTTG CGGACGAGCA CTGATCGGGG AGGACCTCGA GGAGCGGTGC GTGGAGATCT CAGTCCTGAA TGGGATCATC CACCGGATAG AGGAGGTTCG TGCCGCACCT GAGATCTGGA TCTGCCCGTC CTTCTTCAAT GCCCACACCC ATCTCGGCGA TACGATCGCG ATGGATTGTC CAGCAAAAGG CGATCTGACT GACCTGGTGA CCCCGCCAGA CGGGCTCAAG CACCGACTGC TCCGGGCGGC GAGCCACCAG TCGCTGACCT GTGGAATGCA CCAGAGCATC GAGCGGATGA TCAGTGCCGG GATACACGGG TTTGCCGATT TCCGTGAGGG CGGTACCGAG GGGGTGACCG CTCTGAAGGA GGCCGCGACA GGACTCCCGT GCCGACCGGT GATCCTCGGG CGGGACGGCG GGGAGGCTGT GGCCGACGGG GCAGGAATCT CAAGCGTCCG GGACTGCCCG GATGTCGAGG GAACGGTCAG CAGATCCCGT ATAGCCAAAA AACTTGTCGC ATTCCATGCA GGTGAACGGG ATCGGTTTGA TGTCGACAGA GCACTCAGTT ATAATCCGGA CCTGCTGATC CATATGACCC ATGCCACCGA TCGACAGTTG AGAAAGGCCG CGGACCAGGG GATACCGATC GCTGTCTGTC CGCGTTCAAA CTGGATGCTG GGGGTCACCG ACTCCCGGGA TCACCCGCCG CTGAGAAGGA TGATCGATGC CGGCTGCAGG GTACTGCTCG GGACCGACAA TGTGATGTTC GTTGAACCTG ACCTCTTCTC TGAGATGGCG TTCACATCGA TGATCTACCA GATAGATCCG CGCGTATTGT TGCATGCGGC GATCGACGGA GCATTGCTGA CCGGCAGTTC TCCTTACATC AAAGAAGGGA GTGCTGCAGA GTTCTTATTG ATAAATACTC ATAATACGAG TTTAATCCAT TCCCAGGATA TGGTGACGAG CATCGTGAGA AGGGTCGACC GGTCTGTTCT GTCCAATACT CTTATAAAAC AAAAAAAGGA ATAG
|
Protein sequence | MSEETCCGRA LIGEDLEERC VEISVLNGII HRIEEVRAAP EIWICPSFFN AHTHLGDTIA MDCPAKGDLT DLVTPPDGLK HRLLRAASHQ SLTCGMHQSI ERMISAGIHG FADFREGGTE GVTALKEAAT GLPCRPVILG RDGGEAVADG AGISSVRDCP DVEGTVSRSR IAKKLVAFHA GERDRFDVDR ALSYNPDLLI HMTHATDRQL RKAADQGIPI AVCPRSNWML GVTDSRDHPP LRRMIDAGCR VLLGTDNVMF VEPDLFSEMA FTSMIYQIDP RVLLHAAIDG ALLTGSSPYI KEGSAAEFLL INTHNTSLIH SQDMVTSIVR RVDRSVLSNT LIKQKKE
|
| |