Gene Mbar_A0989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A0989 
Symbol 
ID3626872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp1208339 
End bp1210408 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content40% 
IMG OID637699880 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_304539 
Protein GI73668524 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATAAAA ACACAAAAAT AGGGCATCTA GTAATAGGCT TTGCTCTTCT TCTTATTTTG 
ATTTTCTTTG TTGGGTACAC AGGTTATCAG GGTATGAACG ATGTTGAAAA GAAAAGTCGA
GCCATTCAAA ACATGACTTT CATCACGAAT AATATGCAGG GAGCCCTGGA AGCTCAGGAA
AGTTATGTTA TTTATGGTGA TCCTGTTTAT AAAGAAGACA CTTATACACA TCTTGATCTT
GTACCCACAC AAGCAGCTAT ATCCAAGGAA ATATATCTTG GCTACCTCGA TCCTGTAAAC
CAGGACCGTA TGGATTCTAT TCTTAAAACT TCTGGCGAAT TCAGGGAAAG TTTTGATAGC
TATGTTAAAG CAGACGATGA AGAAATAGCT CTGAGAACCA ATATCACTTC TAAAGGCGAT
CTTATCTTGA AGAAAGCAGA TGAGCTTTAT CAGGATCAGA TGTTTCAATA TCAACAATAT
TTAGAAAACG GCTCTTCAGG TGAAGTTCTT CAGCAAAAAT TATCCAATGC TCAGGAAGCT
CAGAAAATCA GTATACTTGC AATGGAAGCT CGAAATCAGT ACCAGAATTA TATAATTACT
CCCGAAGATC AGTATGCGGA AAACTTCGAT CGGATTATGG AGAATATAAC TGAAGTTACC
GAGAGTTTAA ATAAGCAGAT GGTAAAACCT GAAAATCTTG AGCTTGGAAA CACAATAATC
GCCAGCGTCA CGGAAATCCG AAGTGATTTT GATAGCTTGA CAGTTCTTAA AGAGAAAAAG
GCGGCTGATG TGAAAAATAT GGCAACTATA GCTGCACAGA TTAATAAAAC TGCTGAAGCC
GCCAGTGCCG ACCAGAAAGA GAAGCTCGAC ACCCTAATCG TAAATTCTAT AAGTAAGATC
TTTCTCGTTA CTCTTCTTTC GATACTTATC GGTGCTTTAC TTGTTTTTGT GATTCTGAAC
CTCTATAGAA AACCCATATA TGAACTGCTC GAAGCGGCCG AGAAAATTTC TAATGGAGAC
CTTAGTGTTG AGATTGAAGG AAACTCGAGA AGTGAAATCT CTCAGCTCTC ACAAGCTTTT
AAATCGATGG TTGAAAACCT CCGTGGCCTG ATTCAAGGAA TTCAGGAAAG TTCGGTTCAC
CTTTCTACAC TTTCAGAAGA AATGTCTGCT TCTTCTGAGG AAGTGGCTTC AGCGTCACGG
AAAATTTCTG ACACTGCAAC TGAAATCTCA AATGGGACCG AAGTGCAGAG TACAAAAATA
GTGGATATAA CTCATGCAAT GCAGGACATG ACACACAATA TCCAGGAAAT TGCGGATAAT
ACCCAGAAAG TTTCTAAGAA CACTAATCTT GTCAATAACA CTATCAATAG TATTGGAAAT
GCCTCAAGAG AACTCTTGGT GAAAATGAAC CATATTCGCT TCTCTGTTGA TGAAACTAAG
GAAGTGATAA CGGAACTTGA TTCCAAGTCT CAACAAATCA ATGAAATTGT AACTCTTATT
ACCAGGATTG CCGATCAGAC AAATATGCTT GCGCTGAATG CTGCAATTGA GGCTGCCCGG
GCTGGTGAGC ACGGCAGGGG TTTTTCTGTT GTAGCCGATG AGGTCCGAAA ACTTGCCGAC
GAATCCGGCC GTGCTGCTAA TAACATTTCT AGTCTAATTG ATGAAATAAG AGGCAGTATC
AGTGAAACCG TGGAAAGTAT AGAGGCCAGT AAAAAAGATG TGCAGGCTGG TTCCCTGTCT
GTTAATAACG CAGTTGAAAT GGTTTCAGGG ATTGTAACTA CAATCAATGA AATTACAAAT
ATGATAGAAG ATGTTGCAGC TGCTACAGAA GAGCAGTCCG CATCCATTGA AGAAATCACT
TCCACTCTGG AGGATATTTC CTCAATTTCG GAACAGTCCA CCGCAGGAAC CCAGGAAACC
GCTGCAGCTC TCGAAGAGCA GAGTGCTTCA ATGTCAGAAC TTGCCAATAT GGCTAGTGAT
CTTTCTTTGC TTGGGGAAAG AATGAAAAAA GCTACGGAAA AATTCAAACT TAGTAACTTA
AAAGAAGGTT CAGAAAATAA GTCAAGTTAA
 
Protein sequence
MYKNTKIGHL VIGFALLLIL IFFVGYTGYQ GMNDVEKKSR AIQNMTFITN NMQGALEAQE 
SYVIYGDPVY KEDTYTHLDL VPTQAAISKE IYLGYLDPVN QDRMDSILKT SGEFRESFDS
YVKADDEEIA LRTNITSKGD LILKKADELY QDQMFQYQQY LENGSSGEVL QQKLSNAQEA
QKISILAMEA RNQYQNYIIT PEDQYAENFD RIMENITEVT ESLNKQMVKP ENLELGNTII
ASVTEIRSDF DSLTVLKEKK AADVKNMATI AAQINKTAEA ASADQKEKLD TLIVNSISKI
FLVTLLSILI GALLVFVILN LYRKPIYELL EAAEKISNGD LSVEIEGNSR SEISQLSQAF
KSMVENLRGL IQGIQESSVH LSTLSEEMSA SSEEVASASR KISDTATEIS NGTEVQSTKI
VDITHAMQDM THNIQEIADN TQKVSKNTNL VNNTINSIGN ASRELLVKMN HIRFSVDETK
EVITELDSKS QQINEIVTLI TRIADQTNML ALNAAIEAAR AGEHGRGFSV VADEVRKLAD
ESGRAANNIS SLIDEIRGSI SETVESIEAS KKDVQAGSLS VNNAVEMVSG IVTTINEITN
MIEDVAAATE EQSASIEEIT STLEDISSIS EQSTAGTQET AAALEEQSAS MSELANMASD
LSLLGERMKK ATEKFKLSNL KEGSENKSS