Gene Mbar_A2845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2845 
Symbol 
ID3627013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3638226 
End bp3641108 
Gene Length2883 bp 
Protein Length960 aa 
Translation table11 
GC content37% 
IMG OID637701695 
Productcell surface protein 
Protein accessionYP_306325 
Protein GI73670310 
COG category[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system
[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.726454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0327621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGGGTAA ATAAATTAAC CGTTTTTTTA GTGATTACTA TCTTTTTTAT TATTGTGACA 
TATAACCCTG CATCCGCCAG GGAGATTACA GTAGATAGCA ATGTTTCAGA TGCAGATTTC
CGCTCTATTC AAGAAGCAGT AAACAATTCT TCTCCTGGTG ATGTGGTTTT AGTTTCCCCA
GGAATTTATA GAGAAAGTGT GGATATAGAA ATACAGAATA TAAGCATCCT TTCGGAGTCT
GGAAATCCCA AAGACACTTT TGTTAGGGCT TTTAACGTAA GCACGAATAA TATTACTGTA
AGCGGATTTA GTATAGAAGA AATCTTAAAC TTACGAGGAC ACAGCGACTA TTATTACTAT
TACAACTATC CAGCTGAAAA CTGTACTATT AAAAATAATA TTTTGAAATC GGGTATTTAT
AGTAACGAAT GTTTCAATTC AACTATTGAG AAAAATATAT TTTTAAAATC AGGTATCAGT
GTATATGGTC CTGATAAAGG TTCTTTTACA ATTTCTGACA ATTTAATTGT TAATGGGGAC
ATTGATTCTC ATCTGGGACC AAATATTGAT TTGCTTAACA ATACCATTCT AAAGGGTAGT
ATAAGATTGG GTGAAGGTGG GAACAACAGA ATTATTGGAA ATTATATTTC AAATAGCTCA
TATTTTGGAA TTAGCTTCTG GGAATCTTAT TCCAATGAGA TCGAAAATAA TACGGTTGTG
AATTGTAGTA ACGGCATCTC CATAGAGTTT CTCTCATCGC AGAATATAAT CAACAATAAC
ACCCTGATTT GTAATGATAA AGGAATTTTA GTTGGAGGAC ATGTATCAGG GGAAAACATA
ATTTCAAATA ATACAGTCTC AAGCAGCAAT ATAGGAATAC TGTTGAGCGG TTCGAGCTCA
GTTGGAAATC CGGCAGGAGG CAATTCACTA TTAGGCAACA CTATCTCAAA CAACAATATC
GGGATATTGT TCGAAGGTTA TTCTTCTAGT AATCTGGTGA GAAATAACAG GGTGGAACTA
AACAAGCAAT GCGGAGTATA TATCAACAAC GTCGGATGTG GAGCGCGATA TGGTTCTACT
AACCAGTTCT ACAACAACAT CTTCAATAAT ACAATCAACT TCTTTAACGA CACGAGTAAT
TATACACATG ATTATACGAG TATCTATACA GGCAATTCCT ATACTATACA ACCAATAGGC
AATGGAACCG GCATAGTTCC TGTTGCCTTG AATACCATAA AAACCTCAGG CACTAATATT
GTAGGTGGAC CTTATATTGG TGGTAATTAC TGGGCAAAGC CTGATGGAAC CGGTTTTTCG
CAAATCTGCG CTGATTCGGA TGGAAATGGA ATTGGCGACC TGCCTTACAA TATAACCGAG
AATGAGACTG ATTATTTCCC TCTTGTATCT TCATCAAGAT CAAAAGAAAC AATAATTCCC
GTTGCAAATT TCAGTACTAA CATCACGCAT ACTCTTGTCC CACTTTCTGT CCAGTTTATA
GATCTTTCAC GAAATGCAGT TGCATGGAGT TGGGACTTTG ACAATAATGG AATGCCGGAT
TCTACAAGCC AGAATCCGGT TCATGTGTAT ACATCACCGG GAACTTACAT TATCAACTTA
ACAGCAAGCA ATGGAAAAGA GACATCCTCA AAAACTCACG AAATAATTGT GCAAGAAGCT
AAAGTTCCTC CTGAAGCAGA ATTCTATGCA AATGTTACAG GCGGACAGGT TCCTCTTTCA
GTCCAGTTTA CAGACCTTTC AAAAAATACA ACAGCAGTAG TATGGGATTT CAACAGTGAT
GGAATTCCTG ACTCTACAGA AAGAAATCCA GTTTATGTAT ATACCTATCC TGGAAACTAT
ACCATTAATC TGACAGTAAA CAATATAAAA GGCATGGATT CAAAATCATC TACAGTAACA
GTATCTCCTT CACAACGTCT GGAAGGTAAA CTCATCCTAA CGGAATATCA GATTACTACC
AGTGAATTGG ATGAGACACA ACCTGCAATC TACGAAGACA GAATTGTATG GCAGAGTAAT
TGTAATGGAA GTTATACTTT ACATCTATAC AATATATCCA CTTTTTCGGA AACTCCAATT
GTTAGCAGAA ATAATTCTGA ATTTTACCCC GCTATCTATA ATGACAGGAT TGTGTGGCTG
GAATCCGGGA ACATATACCT GTACAATCTC TCAACTTCTA CTAAAACTCT GATCTCAAAT
CAATTGGGAA TATATCCTGC TATTTATGGT GATAAGATTG CCTGGCAGGG TGAATGTAAT
GGAGATTGTA TATATATGTA TGATGTATCC AGCTCAAAGC AAGCTCGGAT AACAAACAAT
AAATCATCAT CTTATCTGCC TGCTGTCTAT GGGAACAGAA TCGTGTGGGA GAGTCGTCGC
ATTGCCAATG GATCTTCTAA TATCTTTACG TATGATCTCT CTACTGAAAA GGAAACTCAG
ATAAGTACAG ACGAATCCTA TCAGCAATCT CCTGCCATCT ACGGGAACAG GATTGTCTGG
GGAGATTACC GCAATGGAAA CAAGGATATC TATATGTACA ATTTATCTAC TTCCGAGAAA
ATCCAGATAA GCACCAGTGG ATTAGCATTC GATCCTTCTA TCTATGGAAA CAGAATAGTA
TGGCGGGATA GCCGCAACGG TAAGGAATAC ATAGAGAACT CAAATATCTA CATGTACGAC
CTTTCCACTA AAAAGGAAAC TCAAATTACC ACTAGTGGAC CAGCAAGCTC TCCAGCTATT
TATGGGAACA AGATCGTATG GGAAGATAGA CGCAATGGAG ATGCCGATAT CTATATGTGC
ATTATCTCGG AACAGAGGGA AGAATCACCT GATGCAGATT TTTCTGCATC CTCTATTTCT
TGA
 
Protein sequence
MWVNKLTVFL VITIFFIIVT YNPASAREIT VDSNVSDADF RSIQEAVNNS SPGDVVLVSP 
GIYRESVDIE IQNISILSES GNPKDTFVRA FNVSTNNITV SGFSIEEILN LRGHSDYYYY
YNYPAENCTI KNNILKSGIY SNECFNSTIE KNIFLKSGIS VYGPDKGSFT ISDNLIVNGD
IDSHLGPNID LLNNTILKGS IRLGEGGNNR IIGNYISNSS YFGISFWESY SNEIENNTVV
NCSNGISIEF LSSQNIINNN TLICNDKGIL VGGHVSGENI ISNNTVSSSN IGILLSGSSS
VGNPAGGNSL LGNTISNNNI GILFEGYSSS NLVRNNRVEL NKQCGVYINN VGCGARYGST
NQFYNNIFNN TINFFNDTSN YTHDYTSIYT GNSYTIQPIG NGTGIVPVAL NTIKTSGTNI
VGGPYIGGNY WAKPDGTGFS QICADSDGNG IGDLPYNITE NETDYFPLVS SSRSKETIIP
VANFSTNITH TLVPLSVQFI DLSRNAVAWS WDFDNNGMPD STSQNPVHVY TSPGTYIINL
TASNGKETSS KTHEIIVQEA KVPPEAEFYA NVTGGQVPLS VQFTDLSKNT TAVVWDFNSD
GIPDSTERNP VYVYTYPGNY TINLTVNNIK GMDSKSSTVT VSPSQRLEGK LILTEYQITT
SELDETQPAI YEDRIVWQSN CNGSYTLHLY NISTFSETPI VSRNNSEFYP AIYNDRIVWL
ESGNIYLYNL STSTKTLISN QLGIYPAIYG DKIAWQGECN GDCIYMYDVS SSKQARITNN
KSSSYLPAVY GNRIVWESRR IANGSSNIFT YDLSTEKETQ ISTDESYQQS PAIYGNRIVW
GDYRNGNKDI YMYNLSTSEK IQISTSGLAF DPSIYGNRIV WRDSRNGKEY IENSNIYMYD
LSTKKETQIT TSGPASSPAI YGNKIVWEDR RNGDADIYMC IISEQREESP DADFSASSIS