Gene Mbar_A1479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A1479 
Symbol 
ID3627536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp1823881 
End bp1825890 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content39% 
IMG OID637700366 
Productcell surface protein 
Protein accessionYP_305015 
Protein GI73669000 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAC AGATGACCTT TTGTACTATA AGCTTAATTT CAATAGCCTT TATTTTGTGT 
TTGTTTTTGT CTACGGCATC AGCAGATACA GCGGAAAGTG TTTTGCCTAC AATCACTGAG
ACTAAAATCA GCACTAGTGG ACACGTAGAC AATCCTGCTA TCTACAATAA CAGAATAGTA
TGGATAGATA TTGATTATGC AGAGCTAGAG ACATACTACG ATGTCTACAT GTATGATCTC
TCCACTAAAA AGGAAACTCA AATAACCACC AGTGGGCATG CAGGCGAAAA TCCTGCAATT
TATGGTGACA GAATAGTATA CTCCGATCAT TACGAAGGCT CGGAAATCTA CATGTATGAT
CTTTCCACTA AAAAAAGAAC TCAAATAACC ACCAGTGGAA ACGCAGAAGA TCCAGCTATT
TACAATAATA GGATAGTGTG GAGAGGAAAT TATGATATCT ACATGTATGA TCTCTCTACT
AAGAAGGAAA CCCAGATAAC CTCAGAAGAA TCAATCCAGA CAAATCCTTC TATCTATGGT
GACAGGATAG TATGGAATGA TGATCGCAAT GGAAATTGGG ATATCTACAT GTATAATGTT
TCCACTAAAA AAGAAACTCA AATCACCAAT GGATCATCAT GGGCAATCGA TCCTGCAATT
TATGGTGACA GAATAGTATG GATGGATGAA CGAAGTGGAA ACTATGATAT ATATATGTAT
GATCTTTCTA CTAAAAAGGA AACTCAGATA ACATCCAGTC CAGATGCTCA GACGCATCCT
GCTATCTATG GTAACAGGAT AGTGTGGGAG GATGATGGTG GAGAGGATGA TGATTATACA
AATCATGGTA TCTATATGTA TGATATTTCC ACTAACCAGA AAATGAAAAT TAGCAACAAA
GGATCAGCAC GCAATCCTGC TATTTACAGT AACAATGTAG TATGGAAATA TGTTGCAGAT
ATTTATGGAA ACGGTGACAT TTACATGGGT ACTATTTCAA GCGGAGAACC AGTAGCATCA
ATTGCTGCAT TCTCTGCAAA ACCTACCTCA GGAAAAGCAC CGTTAACAGT TGCTTTTAAG
GACGAAAGCA CAGGAACACC GACAAAGTGG ATATGGAACT TTGGAGATGG TTCAAAGTCA
TTCCTCCAGA ATCCGGTTCA CAAGTATTCA AAAGCAGGAA CATATACTGT TAGCTTAACG
GTAAAGAATG CCGCAGGACG TAACACGATA ACAAAAACAG ATTGTATAAC CGTGGCAGCA
AAACCCGTTG CTGCATTCTC TGCATCTCCC ACTTCAGGAA AAGTACCACT GAAGGTTCAG
TTTACTGACA CAAGCACAGG AACACCGACA AAGTGGATAT GGAACTTTGG AGACGGATCA
AAGTCATTTC ACCAGAATCC GGTTCACAAG TATTCAAAGG CAGGAACGTA TACTGTTAGC
TTAACAGTAA AGAATGCGGC AGGACGTAAT TCGATAACAA AAACAAAATA TATAACCGTG
ACAGTAAAAC CCGTTGCTGC ATTCTCTGCA TCTCCAACCT CAGGAAAATA TCCATTAAAC
GTTAAATTTA CGGACAAAAG TACCGGATCA CCAACGAAAT GGAAATGGGA CTTTGGAGAC
GGAACAAAGT CATTCCTTCA GAATCCAACG CATAAGTATT CAAAAGCAGG AAAGTACACA
GTAACCCTTA AAGTAACCAA TGCAGCAGGC ATCAATACGG CAACAAAATC AAATTATATA
ACCGTGACAG GAACTGCGCA AGCTCCGACT GCAGATTTCT GGGGCTGGCC ATTATCAGGA
AAAGCTCCAC TAAAGGTAAC ATTCACAGAG ACTAGCAAAG GATCGCCAAC CTCATGGAAA
TGGGATTTCG GAGATGGTAA ATATTCAACA GAAAAGAGTC CAACACACAC ATATTCATCT
GCGGGAACTT ACACGGTTAA ACTCATAGCA ACAAATGAAG CAGGAAGTAG TACAAAATCA
AAATGGAAAT ATATAAAAGT GGCAAAGTGA
 
Protein sequence
MNKQMTFCTI SLISIAFILC LFLSTASADT AESVLPTITE TKISTSGHVD NPAIYNNRIV 
WIDIDYAELE TYYDVYMYDL STKKETQITT SGHAGENPAI YGDRIVYSDH YEGSEIYMYD
LSTKKRTQIT TSGNAEDPAI YNNRIVWRGN YDIYMYDLST KKETQITSEE SIQTNPSIYG
DRIVWNDDRN GNWDIYMYNV STKKETQITN GSSWAIDPAI YGDRIVWMDE RSGNYDIYMY
DLSTKKETQI TSSPDAQTHP AIYGNRIVWE DDGGEDDDYT NHGIYMYDIS TNQKMKISNK
GSARNPAIYS NNVVWKYVAD IYGNGDIYMG TISSGEPVAS IAAFSAKPTS GKAPLTVAFK
DESTGTPTKW IWNFGDGSKS FLQNPVHKYS KAGTYTVSLT VKNAAGRNTI TKTDCITVAA
KPVAAFSASP TSGKVPLKVQ FTDTSTGTPT KWIWNFGDGS KSFHQNPVHK YSKAGTYTVS
LTVKNAAGRN SITKTKYITV TVKPVAAFSA SPTSGKYPLN VKFTDKSTGS PTKWKWDFGD
GTKSFLQNPT HKYSKAGKYT VTLKVTNAAG INTATKSNYI TVTGTAQAPT ADFWGWPLSG
KAPLKVTFTE TSKGSPTSWK WDFGDGKYST EKSPTHTYSS AGTYTVKLIA TNEAGSSTKS
KWKYIKVAK