Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MmarC5_1501 |
Symbol | |
ID | 4927820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcus maripaludis C5 |
Kingdom | Archaea |
Replicon accession | NC_009135 |
Strand | + |
Start bp | 1444322 |
End bp | 1445698 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640166997 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_001098012 |
Protein GI | 134046527 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.604824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAC TCAAAATAAT AAGGGATCCA ATCCACAAAG ATATCAAGCT TAATGAAGAA GAGATTTCAA TAGTAGATAT GCCCGAAATT CAGAGATTGA GAAATATTAA GCAGACCGGA CTTACTTGCT TAGTATATCC CAGTGCAAAC CATACCCGTT TTGAACATTC TATCGGTACA ATGCACGTTG CAGGCGAAAT TGCCAAAAAT TTGGAAAATA TTGATAGAAA TCTAACCAAA ATCGTTGCTT TACTTCACGA TATTGGACAT CCTCCTTTTT CACATACTTT AGAGGTTGCA GGATACAATC ATGAAGAATT TACGAAAGAA AAAATAAAAA AAATGAGTTT TGAGAACTAT ACTTCCAAAG AAGTTTTAGA TGTATATTCC TCAAAAGGTC TAGAAGGCTC ATTGATCCAC GGGGATGTCG ATGCTGACAG AATGGACTAT TTAATAAGGG ACAGCCACCA TACTGGAGTA GCATATGGCT CAATTGATAT TCCTAGACTC ATTAGAAGTA TCGTGGTTCT TGAAGATACG AATAAACTTG GAATCATTGA AAAAGGAAGA ACCACTGTTG AGTCCCTCCT TACTGCACGA TACCAGATGT ATCCTACCGT TTACATGCAT CCTGCATCCA GAATTTCAGA AACCATGATT AAAAACGCGA CGATCGATGC AATAAAGGAT AGTATCTTCA AATTAAGTGA TTTATCGGTA ATGGATGATA TTGATCTCAT TTGTACCCTC AGAAGATCCG AAAGCTCAGC AACTGAAATG ATGAAGAGAT TGGATAATAG GGATTTATTT AAAAGCATTT CGATTCAAAG ATACAATGAA TTATCCCCTA AAGAGAGATG GAACTTAATA AATTTATCTG AAAATGAAAT AGGTTTAATT GAAAACGAAA TGACTGAATA TTTTGAATCT AGAATCTTTT TAGATATCCC AAAACCTCCT AAAATGGCAG AACACAGAAT TACTGTCATG ATGGGCGATA GAAAACATAG ACTCGATGAA ATATCTCCAC TTGCAGAAAG TTTAAAAGAA GCATACAAAA AATCATGGAG TATCATGGTT TATTCAGAAC CGGAATCCGC AAAAAAACTA TCTGAATTGA TAAAGGATAG AGAAAAATTC TTATTTGAGT TTATTACTGA TGGGCCCATT GATAACCCGA TTTTAAATGT TTTAAAAGAA CACGGAACCG TTCAAGGAGT TACAAAATTA GCAGGCCTCA TCAAGAAAAG TCCAAACGAT GTAGAATTCC ATTCCGAACT TCAAAAATTA ATATTCTGCG GTTTAGTGGA TAAAAAAGTT GAGCCTGTAC GCGGAACTTA TAGGTACGAT TACACTGCGG TATCAATTGA TAATTAA
|
Protein sequence | MSKLKIIRDP IHKDIKLNEE EISIVDMPEI QRLRNIKQTG LTCLVYPSAN HTRFEHSIGT MHVAGEIAKN LENIDRNLTK IVALLHDIGH PPFSHTLEVA GYNHEEFTKE KIKKMSFENY TSKEVLDVYS SKGLEGSLIH GDVDADRMDY LIRDSHHTGV AYGSIDIPRL IRSIVVLEDT NKLGIIEKGR TTVESLLTAR YQMYPTVYMH PASRISETMI KNATIDAIKD SIFKLSDLSV MDDIDLICTL RRSESSATEM MKRLDNRDLF KSISIQRYNE LSPKERWNLI NLSENEIGLI ENEMTEYFES RIFLDIPKPP KMAEHRITVM MGDRKHRLDE ISPLAESLKE AYKKSWSIMV YSEPESAKKL SELIKDREKF LFEFITDGPI DNPILNVLKE HGTVQGVTKL AGLIKKSPND VEFHSELQKL IFCGLVDKKV EPVRGTYRYD YTAVSIDN
|
| |