Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A2795 |
Symbol | |
ID | 3625123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | + |
Start bp | 3570555 |
End bp | 3573440 |
Gene Length | 2886 bp |
Protein Length | 961 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637701646 |
Product | hypothetical protein |
Protein accession | YP_306276 |
Protein GI | 73670261 |
COG category | [S] Function unknown |
COG ID | [COG1572] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.827572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0000678695 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGATATTGT TAACTCTGAT TCTGACCAGC GGGGTCTGCC TGGCTGACAA TTTCGTAGGC GGAATCCCTC TGACTTCTGT CCACAGCGGA ACAGTAAGTG GAGGTGTTTA TTGTGATAGC TACTACGGAA CAGCAGATCA GGCAATACAC ACCGATAAAA CTATAGACAA GACCTTTACA CTGCCTGATG ATGCTGAAGT TGAATGGGCA ATGTTGCTCA CAACCGTATA CTGCGGGCAC ATGCAAAACA ATTACCAGGG AACGGCAACA GTAAGTTTTA ACGATAAAAC TCTAGGAACA GAAACCCTCA ACGTTCCTTA TGAATACATT ACCAACGGGG GTAATGACGG AAAAGCATAT GTCCAGGTAA ACGACCATGT TGACAGGGTA ACAAGTGACT ACATGATGTA TTATGATGTC ACAAGCCTTG TAAAATCCGG GAAAAATAAA GCTACTGTAC ATACTGAGCC TACTGACAAG AATTTTGATG GTAGAATAAA ACTGATAACT CTTATTGTAG CTTACAATGA CGGTAGCGGA AAAAAGATCT GGTATCAGGT AAACCGCGGG CATGATCCAG ATACATATTA TGTTGATGAC AATAGAGAAA CTTACGTTGG GAGCACCGCA TTTAAGGCAG CTTTGCCATC TGATGCGTCC TTGACAGATG CAACACTTAC AGACGTACAC ATGGCAAGTA CAGACGGATC ATACACTTTC AACGGAAAAG CACTCACTTC AGGTACACCT CAGGGTACTT ATTGTGGGCT AGATTCTTGG GATGTTACGG ACGATTTTAA ATCTACTGGC ACAAACACTC TGACCTATGA TCGGACTGGC GCGTTCTATA AAAATGCACT TGCTATACTG ACGGCCGAAT ACACTACTTC ATCCTCAGAC AACGATAGTG GAGATAATTC TTCGGACAAC AATAGTGGAG ATAATTCTTC GGACAATAAT AGTGGAGATA ATTCTCCGGA CAACAATAGT GGAGACAACT CATCAGACAA TGGTAGTGGA GACAACTCGT CAGACAACAA TAGTGGAGAC AACTCATCCA ATGGTAGTGG AGACAACTCG TCGGACAACG ATAGCGGACA AACAGTATCA TCTGACCTGA GCATACAGGA TTTAAAAGTC TTGCACAATA ATGGGAATAA AGTATGGGAC AACCTGAACA ATACCGTAAA AGTAAATATA ACAAACAGTG GACCAGACGA TGCAGGTGAT TTTGCCGTCG AACTTTATGC TGAGACAGCT CTTGTTGAAA GCAAACCAGT TTCGGGTCTT GCAAATGGGG CAGCGGAAAC AGTTGAACTT AACTGGAAGC CAGAAGAAGC AAAGAATTAC ACTCTTAAAG CAGTTATAGT TCCTGGTTCT ACGATCAACG ACCCGACAGC AACAAACAAT AAACTGAGCA AGACTCAGGA AGTGCAGCAT AATGGATATG CAGGAGATAA GCCTCTTGAG ACTTATGCCC ACAACACTGT AAAAGGTGAC ATTATTTATG ATTATGGAGA CAGCAAGTAC AGCGGCAAAG TATCTTCTGG CAGCACATAC ACAGTAAATC ACAACCTGAA GCTTCCTGCA AATGCAACTA TAAAGCTTGC AAGACTCTAT AATTACTGGA CATGGAGTGC TACAGGCACT ACCGGAGTTG ATCCTTCCAT GAGCCTGAAG TTCCAGGGGA CTTCTCTGAA TCCGGAAGCA AAATACAGTG ACCAGAAGGG ATGGGGCTCT GTATATGACT ACCCAAGCGG TACCTGGGCT TATGATGTAA CGGATCTTGT AAAAGGAAGT GGAAATTATA CTACAGTAGT TACAAATATC AATAGTGAAA CTGGAAATTT TGTCTGCTTT GATGGAATCG GCTTACTTGT GGTGTATGAA GATGCCACAG GAAACGAAAC CGAGTACTGG ATAAACGAAG GCTGTGACAT GGTGAGTACA ATGAGTACTT CAGGCGGTCT GACTCCTGAA GACGCTACCG TAAAAATCCC ATTCAACGGT TCTATAAATC TCAGCAATGT AGACAGCGCC AAGCTCTGGA CTACAGTTCA GTCTGGAGGA CATGATGGCA TTATCTTGCA ATTCAATGAA ATGAATAAGT CTGGTGTTTA TGACTCAACT CCCTACTCAG ATCTGGATAT TGATGAAGCA AGGCCTGTTG ATAATTATCT CCTGACCAAG AATAACATGG CTCAGATAAT TGCTCCTTCT GTTACAGATA ATAGTGGGGA CTACCTTGCT CCTTCGAGTG CGATTCTTGC TGTCAGTTAT AAAGGCGGAA CCAGTGACAA TGGAACCAGT GATAACGGAA CTAGTGACAA TGGAACCAGT GACAATGGAA CCAGTGACAA TGGAACCAGT GACAATGGAA CCAGTGATAA CGGAACCAGT GATAACGGAA CCAGTGACAA TGGAACAAGT GACAATGGAA CAAGTGACAA TGGAACAAGT GACAATGGAA CCAGTGACAA TGGAACCAGT GACAATGGGA CCAGTGACAA CGGAACAAGT TCATCTGGAT CGGATTCAGC TACGGTATCA CTTACTGTGA ACATCACACC TGTAATTTGC TTGCAGGTCA CACCGAACTC AGTAGATTTC GGAACGTTAA AACCGGGAAC AACGAGTGAA TCCGTACCTC TGACCCTCAA AAACAATGGA CATGGTAGTA TAAAAGTAAC GGCTGAAGTA GAAGACCAGG ATGACGGCCC ATTCAATACA GGGCTTATGC TTGACCAGAA CAAATGTTCT GATTACAGTA AAACGATAGC TTCAAACACA TCTGAAACCT CGGAAGCCCA GCTCGAACTA CCAGAAAATT ATTCGTCTAC AGGACAGTTT AATGGAAGCC TTATCTTTTG GGCAGAAGCA GCCTAA
|
Protein sequence | MILLTLILTS GVCLADNFVG GIPLTSVHSG TVSGGVYCDS YYGTADQAIH TDKTIDKTFT LPDDAEVEWA MLLTTVYCGH MQNNYQGTAT VSFNDKTLGT ETLNVPYEYI TNGGNDGKAY VQVNDHVDRV TSDYMMYYDV TSLVKSGKNK ATVHTEPTDK NFDGRIKLIT LIVAYNDGSG KKIWYQVNRG HDPDTYYVDD NRETYVGSTA FKAALPSDAS LTDATLTDVH MASTDGSYTF NGKALTSGTP QGTYCGLDSW DVTDDFKSTG TNTLTYDRTG AFYKNALAIL TAEYTTSSSD NDSGDNSSDN NSGDNSSDNN SGDNSPDNNS GDNSSDNGSG DNSSDNNSGD NSSNGSGDNS SDNDSGQTVS SDLSIQDLKV LHNNGNKVWD NLNNTVKVNI TNSGPDDAGD FAVELYAETA LVESKPVSGL ANGAAETVEL NWKPEEAKNY TLKAVIVPGS TINDPTATNN KLSKTQEVQH NGYAGDKPLE TYAHNTVKGD IIYDYGDSKY SGKVSSGSTY TVNHNLKLPA NATIKLARLY NYWTWSATGT TGVDPSMSLK FQGTSLNPEA KYSDQKGWGS VYDYPSGTWA YDVTDLVKGS GNYTTVVTNI NSETGNFVCF DGIGLLVVYE DATGNETEYW INEGCDMVST MSTSGGLTPE DATVKIPFNG SINLSNVDSA KLWTTVQSGG HDGIILQFNE MNKSGVYDST PYSDLDIDEA RPVDNYLLTK NNMAQIIAPS VTDNSGDYLA PSSAILAVSY KGGTSDNGTS DNGTSDNGTS DNGTSDNGTS DNGTSDNGTS DNGTSDNGTS DNGTSDNGTS DNGTSDNGTS DNGTSDNGTS SSGSDSATVS LTVNITPVIC LQVTPNSVDF GTLKPGTTSE SVPLTLKNNG HGSIKVTAEV EDQDDGPFNT GLMLDQNKCS DYSKTIASNT SETSEAQLEL PENYSSTGQF NGSLIFWAEA A
|
| |