Gene Mbar_A1575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A1575 
Symbol 
ID3625351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp1938410 
End bp1940803 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content46% 
IMG OID637700457 
Producthypothetical protein 
Protein accessionYP_305102 
Protein GI73669087 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.842074 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGG ATATGTCAAG CGCCGACGTT GCTGCAGTTG TTGCCGAGCT GTCAGCAGGT 
CCAAAATCCA TAATTGATGC GAAGATCGGG AAGATCTATC AGCCCGCAAA CGAGGAAATT
CGCATCAATC TTTACGTTTT TCACCAGGGC AGGGACAACC TGGTTATTGA GGCCGGGAAG
CGCATTCATC TTAGCAAATA CCTCAGGGCA AGCCCGACGC TTCCCCAGGC TTTTCCGATG
CTGCTTAGAA AATACCTGAT GGGAGGCAGG ATTGTATCTG TTGAGCAGCA CGACTTTGAC
AGGATCGTGA AAATTGGAAT TGAGAGAGCT GGAGTCCACA GTAATCTTAT TGTGGAGCTT
TTCGCCCCTG GGAACATACT TATTGTCGAT TCGGAAAACA GGATTATTCT GCCTATGAAT
CCTGTAACAA TGAAGGATAG GCGGCTTAGG AGTGGAGAAA TTTACGAACT GCCTGAGGCA
CAGATAAGTC CTCTCAAGGT AAAGGATTCT GACCTTATGA AGGAGTTTTC CAGGTCAACC
TCTGATATCG TCAGGACGGT TGCCACAAGG TTCAATCTGG GTGGAGTTTT GGCAGAAGAA
GTCTGTGCCA GGGCAGGAAT TGACAAATCA AAGCCTTCAA AGGAAGCAAC TGAAGAAGAC
GCCTCCATGA TCTGCAATGC AATGCACGAT CTCTTTTCTC CGCTTTTGAT GACAGGAGCC
GGAGAGAAAG GTCTCACAGA AGCCGAAATC GAGACTGAAC CTAAATCCGA TATCGAGTCT
AAGCCAAAAA CCGAGATTAA ACCCGAAACC GAGATTAAAC CCGAAACCAA AATTAAACCC
GAAGTTGGGG TTGAAGGCGA GGCTCCTAAT CTCAGGCCTC AGCATGTGAA AAAGGAAATC
AAAGGAAAGC TGGAAACTTT TGATGTACTT CCTTTTGATC TTACTCGCTA TTCCGGATTT
GAAAAAGAGT ATTTTGATTC TTTTAACACG GCACTTGATG AATTTTTTGG GAAAAAGGCA
CTTGAACAGA TCGAAGAGGT AAAAGCAGCT AAGAAAAAGG AGAAAACACT TGGCGTTTAT
GAACGGCGGC TTCTTCAACA GGAAGGGAGT CTTAAAAAGT TTGAAAAAGA AATCGAGAAA
AATAATACCC TTGCTGAGAC AGTCTATGCA AACTACCAGG GTATCGAAGA GCTCCTTTCC
GTGCTCAATG GAGCAAGGTC AACGGGATAT TCCTGGGACG AGATCCGCTC CATTCTGAAG
CAGGCTAAAA AGACCGTGCC TGCCGCGCAG AAAATCACAA ATATTGACCC AAGGACAGGG
ACTGTAACTG TGAATTTTGA CGGAAAGAGT ATTAGTCTTG ACATCCGCAA AACAGTGCCA
CAGAATGCTC AGGAATACTA TGAAAAGGTC AAGAAATTTA ACAAGAAAAA AGACGGAGCT
CTCAAAGCTA TCGAAGATAC CAGGAAGGCT ATGGAGAAAA AGGCCGTGGC AAAGGTTGCA
AAGGCAGGAA GAAAGCTCCG GGCGTCCAGG AAAAAACACT GGTATGACAG GTTCAGGTGG
TTCGTGTCCT CAGACGGCTT TTTTATTGTG GGAGGTAGGG ATGCAGACAC CAATGAGGAG
ATATTTAAGA AATACCTGGA AAAGAGAGAC CTTGTCTTCC ATACTCAAAC ACCAGGGGCT
CCTCTCACAG TTATCAAGAC CGGCGGAGAA GAGGTTCCTG AATCTACTTT GCAGGAAGCG
GCACAGTTTG CTGTTTCTTA TTCCAGTCTC TGGAAAGCAG GGCATTTCAG TGGGGACTGC
TACTGGGTTA AAGCCGAGCA GGTCAGCAAA ACTCCAGAAT CAGGAGAATA TGTGAAAAAA
GGAGCTTTTA TCATCCGTGG GGAACGCAAT TACTTTAAGG ATATTCCTCT CGGCGTTGCA
GTTGGGCTTG AACTCAAAGG CGAGACAAGG GTTATAGGTG GGCCTGTTTC TGCTGTCCGG
AAACATGGGG ATTATATCCT CGAAGTCGTC CCCGGGGCTT TTAACCAGAA TGATATCTCT
AAAAAGATCT ACAGGATTTA TGCCGACGAA CTCAACGATC CCCGCTTCGT AAAGCAGATT
GCTTCTCCTG ACCAGATCGC TATGATGGTC CCACCCGGAG AATCGGACTT AAAGAGTCAG
AAGCCGAAAA GGAAGGGAGA AAAGATCAAG GGTGAGGGCG AGGAACATGA GTCTCAGGAA
GTTGAGACAG AACTTGAAGA TAACGGGGAC GAAATTGAAA AGGAATTCGA AAAAAACTTC
GGAAAGGGAA CTAAAGAGGA ATTCGAGGAA AAACTCGCTG GAAAAATCCC AGAAAATAAA
ACCGGGGATG GAAAGGAAGA GAAAATGGAA CTTCATGGAG GTAAAAAGGC ATGA
 
Protein sequence
MKQDMSSADV AAVVAELSAG PKSIIDAKIG KIYQPANEEI RINLYVFHQG RDNLVIEAGK 
RIHLSKYLRA SPTLPQAFPM LLRKYLMGGR IVSVEQHDFD RIVKIGIERA GVHSNLIVEL
FAPGNILIVD SENRIILPMN PVTMKDRRLR SGEIYELPEA QISPLKVKDS DLMKEFSRST
SDIVRTVATR FNLGGVLAEE VCARAGIDKS KPSKEATEED ASMICNAMHD LFSPLLMTGA
GEKGLTEAEI ETEPKSDIES KPKTEIKPET EIKPETKIKP EVGVEGEAPN LRPQHVKKEI
KGKLETFDVL PFDLTRYSGF EKEYFDSFNT ALDEFFGKKA LEQIEEVKAA KKKEKTLGVY
ERRLLQQEGS LKKFEKEIEK NNTLAETVYA NYQGIEELLS VLNGARSTGY SWDEIRSILK
QAKKTVPAAQ KITNIDPRTG TVTVNFDGKS ISLDIRKTVP QNAQEYYEKV KKFNKKKDGA
LKAIEDTRKA MEKKAVAKVA KAGRKLRASR KKHWYDRFRW FVSSDGFFIV GGRDADTNEE
IFKKYLEKRD LVFHTQTPGA PLTVIKTGGE EVPESTLQEA AQFAVSYSSL WKAGHFSGDC
YWVKAEQVSK TPESGEYVKK GAFIIRGERN YFKDIPLGVA VGLELKGETR VIGGPVSAVR
KHGDYILEVV PGAFNQNDIS KKIYRIYADE LNDPRFVKQI ASPDQIAMMV PPGESDLKSQ
KPKRKGEKIK GEGEEHESQE VETELEDNGD EIEKEFEKNF GKGTKEEFEE KLAGKIPENK
TGDGKEEKME LHGGKKA