Gene Mbar_A1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A1471 
Symbol 
ID3627752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp1806344 
End bp1808401 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content41% 
IMG OID637700359 
Productcell surface protein 
Protein accessionYP_305008 
Protein GI73668993 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.133477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAATTA ACAAAAAATC GTATTCAGTG GTCTTAGCAT CAGCAGTTCT GATTTTATTT 
TTAATTCTGG TTTCATCTGC AGCATCGGCG GTTACTGAGC AAACTGCTTC GCTCACTATC
AATGAAACTC CGATAACCAC CAGTGGATCA GCAACGTCTC CTTCTATATA TGGTGACAGG
ATAGTATGGA AGGACTGGCG TAATGGGAAC CGGGATGATG GACCTTTCAA TATTTACATG
TACAATATCT CCACTCAGAA GGAAACTCAA ATTACTACCG GTGGATCAGC ATGTTGTCCT
TCTATATATG GTGACAGGAT AGTATGGAAG GAATGGCGTA ATGGGAACCA GAATGATACC
GAGGAAGAAT ATGATGTATA CATGTACGAT CTCTCCACTC ACAAGGAGAC TCGGATAAAC
AATAGACCAG TCCCTTATGA CTACTTAGGG TATAGCTTCG GACCGTATAT ATCCGGTAAC
ACGATAATGT GGGGTGGCCC ACATGGTTAT TACATTTATG ATATCTCCAC TCAGGAAACT
CATATCGATA ATATGTCAGC ACCCTATCCT GTTTTTTTCG GTAACATAGT AGTGTGGGTA
CCTGATATTG ATGGACGCCC GAGTAATCTC ATAATGTACG ACCTCTCCAC TCACAAGGAA
ACTCAGATTA CCAACAACGA TTCAGCATTC AATCCTGATA TCTACGGAGA TATAATAGTG
TGGGCAGATT GGCGCCAGAG AGATGATGGA TATCAATACT CTGAGATCTA CATGTATGAT
CTTTCCACTA AGAAGGAAAC TCAAATTACC AACAGCGGAT CAACACATGG ATCAGCACAT
TATTATAGTC CTAGAATCTC CGAGGACAGG ATAATTTGGC TGGATACCCG AGATAGTGGA
AGTATTTACA TGTATAATAT CTCAACTCAA GAGGAAACTC AGACTAACAA TACATCAATT
AGGCAAAGGC AAGGATTTAC TTTCTATGGT GATAAGATAG TGTGGTCGGA TTGGAAAAAT
GAAAAACCCA ATGTCTACAT GGGTACTCTC ACCAGTTCAA ACCTGCCAAT TGCTTCCTTC
TCTGCATCTC CAACCTCAGG AAAAGCCCCA ATGAAAGTGC AGTTTACTGA CAAAAGTACC
GGAACACCTA CTTCCTGGTT CTGGAATTTT GGAGACGGAT CAAAGTCATT CCTTCAGAAT
CCGGTTCATA AGTATTCAAA GGCAGGGATT TATAATGTTA GCTTAACGGT AAAGAATGCG
GTAGGCCGTA ACACGGTAAC AAAAACCGGA TATATAAAAG TGGTAGCAAA ACCAGTTGCT
GCATTCTCTG CATATCCTAC CTCAGGAAAA ACACCATTAA ACGTTAAATT TACTGACACA
AGCACAGGAA TACCTGCTTC CTGGTTCTGG AATTTCGGAG ACGGATCAAA GTCATTCCTT
CAGAATCCGG TTCATAAGTA TTCAAAGGCA GGAATATATA CTGTTAGCTT AACAGTAAAG
AATGCAGCAG GACGTAGCGC GGTAACAAAA ACAGAATATA TAAAAGTGGT AGCAAAACCA
GTTTCTGCAT TCTCTGCATA TCCAACATCC GGAAAATATC CATTAAACGT TAAATTTACT
GACAAAAGTA CAGGAACACC AACGAAATGG AAATGGGATT TTGGAGATGG ATCAAAGTCA
TTCCTTCAGA ATCCGACTCA TAAATATTCC AAAGCAGGAA AATACACAGT AACCCTCACA
GTAACCAATG CGGTAGGCAT CAACACAGCA ACAAAATCAA AGTATATAAC CGTGACAGGA
ACTTCGCAAG CTCCGACTGC AGATTTCTGG GGCTGGCCAT TATCAGGAAA AGCTCCGCTA
AAGGTAACAT TCACAGAGAC GAGCAAAGGA TCACCAACCT CATGGAAATG GGATTTCGGA
GATGGAAAAT ATTCAACAGA AAAGAGTCCA ACACACACAT ATTCAGCAGC AGGAACTTAC
ACGGTTAAAC TCATAGCAAC AAATGAAGCA GGAAGTAGTA CAAAATCAAA ATGGAAATAT
ATAAAAGTGG CAAAGTGA
 
Protein sequence
MKINKKSYSV VLASAVLILF LILVSSAASA VTEQTASLTI NETPITTSGS ATSPSIYGDR 
IVWKDWRNGN RDDGPFNIYM YNISTQKETQ ITTGGSACCP SIYGDRIVWK EWRNGNQNDT
EEEYDVYMYD LSTHKETRIN NRPVPYDYLG YSFGPYISGN TIMWGGPHGY YIYDISTQET
HIDNMSAPYP VFFGNIVVWV PDIDGRPSNL IMYDLSTHKE TQITNNDSAF NPDIYGDIIV
WADWRQRDDG YQYSEIYMYD LSTKKETQIT NSGSTHGSAH YYSPRISEDR IIWLDTRDSG
SIYMYNISTQ EETQTNNTSI RQRQGFTFYG DKIVWSDWKN EKPNVYMGTL TSSNLPIASF
SASPTSGKAP MKVQFTDKST GTPTSWFWNF GDGSKSFLQN PVHKYSKAGI YNVSLTVKNA
VGRNTVTKTG YIKVVAKPVA AFSAYPTSGK TPLNVKFTDT STGIPASWFW NFGDGSKSFL
QNPVHKYSKA GIYTVSLTVK NAAGRSAVTK TEYIKVVAKP VSAFSAYPTS GKYPLNVKFT
DKSTGTPTKW KWDFGDGSKS FLQNPTHKYS KAGKYTVTLT VTNAVGINTA TKSKYITVTG
TSQAPTADFW GWPLSGKAPL KVTFTETSKG SPTSWKWDFG DGKYSTEKSP THTYSAAGTY
TVKLIATNEA GSSTKSKWKY IKVAK