Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A2844 |
Symbol | |
ID | 3627012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | - |
Start bp | 3634896 |
End bp | 3637901 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637701694 |
Product | hypothetical protein |
Protein accession | YP_306324 |
Protein GI | 73670309 |
COG category | [R] General function prediction only [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0823] Periplasmic component of the Tol biopolymer transport system [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0777438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0344466 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTGGGTAA ATAAATTAAC CGTTTTTTCA GCGGTTGCTA TATTTCTTAT TACTGTGGTT TATAGCCCTG CATCCGCCAG GGAGATTACA GTAGATAATA ATGGCTCAGG TGCAGATTTT CGATCGATTC AGGAAGCCGT AAACAATTCG TCTTCAGGAG ATACGGTTCT TGTCATGCCT GGAACTTATA ACGAAAATAT AATTGTTAAC ATTAACTCGC TGACTATTAA GTCGAAATCA AAAAACCCTG AGGTACTGGT AAAGTCTCTA GAGGAAAACA AAAGCGTTTT TCTTATAACA GCTAGTAACG TAACTCTGAG CGGCTTCAAT ATAACAGGAG CTAAAGGAAG TTACACTTAT TATCCATCTG GGATTTGCCT TAAGAATACT AGAAACTGCA AAATTACAGG CAACACTCTA TTTGAAAACT ATCTGGGAGT GTGCCTTGTT AATGCTGATT ATAATATAGT TTCGAAAAAC TTCCTATTCA ACAGTTCTAT TTCTCTAAAT GAAGGCAGTA ACATGAATAA TCTGAGAGAC AATGCACTTG AAGAAGGGCC CATTTCTCTG TCATACTCCA GGTATAACAC AATCTCAGAA AACTCTCTCT TCAATGGTTC TATTTCTATG GGTGAAAGCG GCATGAATAA CCTGACAAAC AATACCATTG AAAAGGGCTC CATTTTTCTG GCAGCATGGT GCTCACTTAA CCTGATATCT AAAAATAAGA TCTTAAACGG ACAGGGTATA AGTATTGCCT GTTGTGGGGG AGGTGATAAT ATATCTGATA ATGTGATTTC GAATTGCTCC ACAGGGGTTT CTACATATGA TCACGGCATA GATGTTATTA ACAATAGCAT TATAGACTGT TACCGTGGGA TATATATTGC TCAGTCGCCT TCCAGAATTC ATAATAATAC AATCCTGAAC TGCAGCACTG GAATTACTGT GATGGATTCT ACTACTGATA TATCAAATAA CATAATAATC TCCAGTGCAA AATGCGGGCT CTCTATTCCA GACCGAGAGT TTGATGAACG AGTATATAAC AACTACTTTA ACAACACTAT AAACGTAAGG CTGGGAAATC ATGACAAGTA TACCTGGAAC AGTTCACGTG TCTCAGGTAC TAATATTGTA GGCGGGCCAT ATTTGGGTGG CAATTATTGG GCAAACCCGA ATGGAACCGG CTTTTCTGAA GCCTGTACGG ATTCAGATGG AGATTGGATT TGCGATTCGC CTTACAATCT CAATGGAAGC GATTTCGATT TTCTTCCTCT TGCATCTATA TCCAGAACAC AGAATCCACC TGTTGCAAAC TTCAGTATCA ATATCACACA GGGACTTGCC CCTTTTTCAG TCCAGTTCAC AGATTCTTCA CAACATGCAC TTCTATGGAA CTGGGACTTT GACAATGATG GAAAATCAGA CTCTACGGAA AAAGATCCAG TTTATGAGTA TAAAGCTCCA GGAAATTATA CTGTTAATCT GACAGTCAGC AACGCAAAAG GTACGGCCTC GAAAGCTCAG AAGATTATTG CACAGGATGC AAAGAGTCTT CCTGTAGCCG ATTTCAGTGT TAATACTACA AAAGGCCAGG CCCCTCTTTC TGTTATTTTT ACTGATCTTT CGCAAAATGT AGCAAAAAGA GCATGGGACT TTAATAATGA TGGTATTACT GATTCTACAA ACAAAACTGC AGTTTATACC TATACTTTTC CGGGAACCTA TATTGTTAAC CTGACCGTAG GCAATTCAAA AGGATCTTTC TCAAAGTTAT TTCCGATAAC AGTATCTCCT GTACGGCGTG TAGATGGGCA ACTTATCCTT ACTGAATATC AGATCACTAC CAACGGAGTA AACCCGAGTC GACCTGCTAT TTATAAGGAT AGAATTGTAT GGTCGGATAA TCGCAATGGA AATCCTGATA TTTATACGTA CGACCTTTCC ACTTCCATGG AAACTCAAAT AACTACAATC AAATCATACG ACTATTCCCC TGAGATCTAT GGTGACAGAA TAGTGTGGAC TGATTATCGT AATGGAAACG GAGATATTTA CCTGTACAAC CTTTCTACTA AAAAGGAAAC TCAGATCACT ACTAATGAAT CCGCCTCAAA TCCTAAAATC TACGAAGATA AAATAATCTG GGTAGATTAC CGCAATGGTG ATATTCAAAA CTTCTCAAAT CCCGATATTT TCATGTACGA TCTCTCTACT CATAATGAGA CTCAAATTAG CAGCAGTGCC TCAGATGATT TTTCTCCTGA CATATATGGA GACAGGATAG TGTGGTGCGC TAAACGACAT GAGTCTGAAA ACTCCAGTAT CTACATGTAT GATCTTTCTA CTTCAAAGAA TACAAAAATA ACCACCAATG AATCACAACA TATGAATCCT GTAATCTATG GTGATAGGAT CATCTGGGAA GATTACCTTA ACGAAAAACG CAGTATACAC ATATATAATC TCTCCACTTC TACAGAAACC CAGATTGCCA CCAGTCAATC AGGCTATCAT TGGCCTGCTA TTTATGAAAA CAGAGTCTTG TGGGCAGATT ATCGTAATGG TCATATAGAT ATCTATATGT ATGATCTCTC CACTCAGAAG GAAACTAGGA TAACCACTAA TGGATTATCA TTTGCAGAGT CTGCTATTTA TGGAGACAAG GTTGTGTGGA CAGGCAACAT CAATGGAAAA AGCAATATAT ACATGTGCAT TATTTCGGAA GAGGGAAAGA TACCAGAACC ACCCATTCCA GACTTTTCTG CATTTCCTAC TTCTGGAGGA GCACCATTAA AAGTATTATT TACTGACAAT AGTACAGGTG GACCGACTTC CTGGCTCTGG GACTTTGGAG ATGGCATTAA TTCAAAGCAT GCTTTAAATG CAACTCATAC ATTTAATGAA CCAGGAAAGT ATAATGTAAG CCTTATAGTA AACAATGCAA ATGGTAGCGC CACTAAAACA ATACCTGAAT GTATTACAGT TTTCAAAAAG GAATGA
|
Protein sequence | MWVNKLTVFS AVAIFLITVV YSPASAREIT VDNNGSGADF RSIQEAVNNS SSGDTVLVMP GTYNENIIVN INSLTIKSKS KNPEVLVKSL EENKSVFLIT ASNVTLSGFN ITGAKGSYTY YPSGICLKNT RNCKITGNTL FENYLGVCLV NADYNIVSKN FLFNSSISLN EGSNMNNLRD NALEEGPISL SYSRYNTISE NSLFNGSISM GESGMNNLTN NTIEKGSIFL AAWCSLNLIS KNKILNGQGI SIACCGGGDN ISDNVISNCS TGVSTYDHGI DVINNSIIDC YRGIYIAQSP SRIHNNTILN CSTGITVMDS TTDISNNIII SSAKCGLSIP DREFDERVYN NYFNNTINVR LGNHDKYTWN SSRVSGTNIV GGPYLGGNYW ANPNGTGFSE ACTDSDGDWI CDSPYNLNGS DFDFLPLASI SRTQNPPVAN FSINITQGLA PFSVQFTDSS QHALLWNWDF DNDGKSDSTE KDPVYEYKAP GNYTVNLTVS NAKGTASKAQ KIIAQDAKSL PVADFSVNTT KGQAPLSVIF TDLSQNVAKR AWDFNNDGIT DSTNKTAVYT YTFPGTYIVN LTVGNSKGSF SKLFPITVSP VRRVDGQLIL TEYQITTNGV NPSRPAIYKD RIVWSDNRNG NPDIYTYDLS TSMETQITTI KSYDYSPEIY GDRIVWTDYR NGNGDIYLYN LSTKKETQIT TNESASNPKI YEDKIIWVDY RNGDIQNFSN PDIFMYDLST HNETQISSSA SDDFSPDIYG DRIVWCAKRH ESENSSIYMY DLSTSKNTKI TTNESQHMNP VIYGDRIIWE DYLNEKRSIH IYNLSTSTET QIATSQSGYH WPAIYENRVL WADYRNGHID IYMYDLSTQK ETRITTNGLS FAESAIYGDK VVWTGNINGK SNIYMCIISE EGKIPEPPIP DFSAFPTSGG APLKVLFTDN STGGPTSWLW DFGDGINSKH ALNATHTFNE PGKYNVSLIV NNANGSATKT IPECITVFKK E
|
| |