Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MmarC6_0079 |
Symbol | |
ID | 5738541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcus maripaludis C6 |
Kingdom | Archaea |
Replicon accession | NC_009975 |
Strand | - |
Start bp | 63129 |
End bp | 66092 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641282545 |
Product | TP901 family phage tail tape measure protein |
Protein accession | YP_001548135 |
Protein GI | 159904473 |
COG category | [S] Function unknown |
COG ID | [COG5280] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACAAG ACGTGAAATT TAAAGTTCAG GCAACTGAAG CTTATGCGGA TGTATTTCGA GGACTACAAG CTGAAGCAGA AAGTGCATTT CAAGGTATGG AAAATGCAGC ACAAGAAGGC TCAGCAGGAG CAGCAGCTGC ACTTGGACTC GTTCAAACCA AAGCAGGAAC TGCAAGTAGT GAGATCCAGT CAATGGCGAA GAAAGCACAG ACCTCATTTA GGGATCTTGA ATCCTCAATG AGAAGTGCTG GTGTCGCAGC TGCAGCCATA TCTGCACCAC TTGTTGCAGG ACTTGGAGTT GCCATTGACA CAGGGATGGA GTACGAGCAA AGTTTAAAAA ACGTGCAGGC TCTTTTGAAA TCCTCTGAGG ACCAGTATAA GGAATTATAC GATTTTGGAA TTGAAGTTTC CAGGACATCA ATCTATGCAG CAAATGAAAT CGGGGAGGCA ATGTACTACT CCGCATCTGC AGGTCAGACC AATCAACAGA TAATGGCAAG TCTTAACGAT ACCCTAAGCC TTGCAGCTGC TACACAGAGC GATGTTGCAC AGACAAGCGA CTTAATGAAC TCTTCATTTT CAATATTCCA ACTCGAAGCC TCAGAATCTT CAAGGGTTGC AAGCACATAT GCTGAAGCTA TCGCAACATC ACAGGCAAAC CTCACAAAAT TGCAGTATTC AATGAGGCAG GTCGGAACTG TTGCACAGGG AATGGGCTAC GATATCGAGG ACACTACTGC AGCACTGATG ACTCTTTATC AAGCAGGTTA CCAGGGCGAA CAGGCAGGAA CCATATTGTC TAGTGCATTG ACGACAATTT TGGATCTTAC GCCTGAAGCA AAAGCCACGC TCCAGAAATA CAAAATTACA GAGAGCGACC TTCTTGAAGA CGGGGCTGTT AAATCATGGG AAGATGTTAT CACAGTATTT GAAAATGCAG ACATGAACCA GCAGGATCTT GCAACGATAT TCGGTGAAAC ATCCCTCAAA GGTATGATTT CAATGATAGG TCAGGGTTCT GAAGCTCTGA AAACTTATCA AGATAATCTT CATAATGCCT CTGGAGCAGC TGAAGAAATG AAAGAAATCC AGCTCGACTC GATTCCTGGA ATGTGGGCCC TGTTTAAATC AGACATGAAT GCAGTCGAAT TAAATGTTTT TGGGGATATG CGGCCCGAAA TTGAAGAAAT AATTGGATCA TTCCAAAGTG CAGTTCCATT TATCGACGAA TTTGCACGAG CTTCTGTAAG TGGGGTTGTG CTTGTAATCG ATACATTAAA ACCAGTTGCT TCTATTGTAG CAGATATATT TTCAAGTATT CCCGAAGAGG GACAGCAAAC AATTGCAGTA TTGACTGGAG TTGTTGGAGC TGGAGCAGCT GTTGCAGCAC CAATACTTTT AGTGGGAAGT ACAATTGCAG GAAACCTTGC ACTCGCTTCA GGTATGCTCT CAGGTATTGC AGGAAGTAGT GCACTGGCAG GAATCGTTGG AGCATTTAGT GGTGGAGGAA TTGCAGCAGG TGCAGCCACA GCAACCGCCA CAGTATCCGC AGGACTTGCA GGTATTGCTA CAGCTGCCGC TCCAGTAGTT GCAGTTGCAG GTACAATTGC AGCAGCTGCA TACTTTGGAA AACGTGCATG GGATGATAAC AGAGACTCAG TCGACCAGGT AATTCTCTCC TTCAAACAGG CAGGAGCTGC AGTTCGAGAT ACAGTGAACC CAATATTGGA AGATCACAAG GATATTATTG ACAAAACCGG CCAAGTTGCA GGAATTGCAT TAAATTTAGT GTCTGGAGCA ATATCACACG AGGTCGGCAG AGCTCTCGAA GCATACAACA CACTCACAGC AATGTACGAC TCAAACGCAT ATCACATGAA AGATGTAATT GATGGTATAG CCGGGGCCCT TGGCTGGTTA CTTGACTATT CACTCCAGGT AATAGAAGAT ATATTGGATG GAATTATCTG GATGTATAAC AACTGGGATG CAATTTGGAA CCAGATGCAC GATACCGTCG GGATTATCTG GAATCAAATC CTCGAAACAA CTGGAACAGG TGTAAATGAG ATAATCGAGG TAATGAACAC CCTTATCGAT GCATACAATG TAGCAGCTGG AGCATTAGGA AAAGAGACAA TTACCGAATT AAAAACCGTA AATGTTGAAG AATATCAATG GGAAATATCA AACCCTGATG CATTAAAAGA AACAGCTGCA GCACTTACTC CCCCAAATAC TGAAGTTATA GTTACCCCTC AGATCCAGCC AGTTGAAGAA ATCGATGCAC AAACCTTGTT CGGAGATACA AGTCTCGTAT TTACTCCAAA AGTGAATCCT GAGTTTGACT GGAGTGCTTA CAAAGATCCA GATTTAAATC CAACAGTCAC TCCAGAATTT AACTGGGTCA TAAATCCTCC AGAAGTAGAA ACAACAGTAA CGCCCGAATT TGATTGGAGT AATTATGTAA TTCCAGAAGT AAACCCGATA GTTAAACCAG ATGTTGATTG GAGTAATTAT ATGGCTCCAG AATTGAGTAC TAAATTAACT CCAGAAGTAG ACTGGGTTTT GGACGTACCT GAAGTTGAAA CAACAGTCAC TCCAGAATTT AACTGGGCAA ACTACGAGCT CCCCGGATTA AATAATCCAA CAGTCACTCC AGAATTTAAC TGGGCAAACT ATGTCCTTCC AGACAATGGC TCAGGCGTAC TTGCAGCAGT TCCTGTAAAT GCTCCGGATT ACCCCGTCCC TCAGCAGTAC ACTGTAGAAA CCCCTGCCAC AGTACAAAAA ACGGAAAATA CACAATATAT TATAAACGTA CCTCTCGAAG GAATCACGAT TTCGAATAAA GAAGATGCAG AATACCTCGC TGAGGTAATA GACCAAAAAG TTAGCGAGCG ATTGGAAAAA CATACCACTT CAACAGTTAA AAAAGACGTA ACATTTGACA AATACAGCAG GTAA
|
Protein sequence | MGQDVKFKVQ ATEAYADVFR GLQAEAESAF QGMENAAQEG SAGAAAALGL VQTKAGTASS EIQSMAKKAQ TSFRDLESSM RSAGVAAAAI SAPLVAGLGV AIDTGMEYEQ SLKNVQALLK SSEDQYKELY DFGIEVSRTS IYAANEIGEA MYYSASAGQT NQQIMASLND TLSLAAATQS DVAQTSDLMN SSFSIFQLEA SESSRVASTY AEAIATSQAN LTKLQYSMRQ VGTVAQGMGY DIEDTTAALM TLYQAGYQGE QAGTILSSAL TTILDLTPEA KATLQKYKIT ESDLLEDGAV KSWEDVITVF ENADMNQQDL ATIFGETSLK GMISMIGQGS EALKTYQDNL HNASGAAEEM KEIQLDSIPG MWALFKSDMN AVELNVFGDM RPEIEEIIGS FQSAVPFIDE FARASVSGVV LVIDTLKPVA SIVADIFSSI PEEGQQTIAV LTGVVGAGAA VAAPILLVGS TIAGNLALAS GMLSGIAGSS ALAGIVGAFS GGGIAAGAAT ATATVSAGLA GIATAAAPVV AVAGTIAAAA YFGKRAWDDN RDSVDQVILS FKQAGAAVRD TVNPILEDHK DIIDKTGQVA GIALNLVSGA ISHEVGRALE AYNTLTAMYD SNAYHMKDVI DGIAGALGWL LDYSLQVIED ILDGIIWMYN NWDAIWNQMH DTVGIIWNQI LETTGTGVNE IIEVMNTLID AYNVAAGALG KETITELKTV NVEEYQWEIS NPDALKETAA ALTPPNTEVI VTPQIQPVEE IDAQTLFGDT SLVFTPKVNP EFDWSAYKDP DLNPTVTPEF NWVINPPEVE TTVTPEFDWS NYVIPEVNPI VKPDVDWSNY MAPELSTKLT PEVDWVLDVP EVETTVTPEF NWANYELPGL NNPTVTPEFN WANYVLPDNG SGVLAAVPVN APDYPVPQQY TVETPATVQK TENTQYIINV PLEGITISNK EDAEYLAEVI DQKVSERLEK HTTSTVKKDV TFDKYSR
|
| |