Gene Mbar_A3566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3566 
Symbol 
ID3626495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4578374 
End bp4580062 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content43% 
IMG OID637702398 
Productdipeptide/oligopeptide-binding protein 
Protein accessionYP_307015 
Protein GI73671000 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.672958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0864129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAAAA AAAAGTTATT ATTTTCTATT TTCTTGACAG CCCTGATTTT AATAACGGCG 
GGCTGCGCTA ATAAGGACAG CCCTCAGGCC GGGACATCGG CTGAAAACGC TTCTGATCAG
ACCGGAACAT TGGTTGAGAA CCCTTCCGAC GGATCAAAGT ATGTGAATGT TGTTAATCTA
AGCGGCGGGG ATTATGGCTA TCCACAGCCA TTTTCGATAT ATCCGAGAGG TCCTGGGTCA
TCAAAAGTTG GAATGATCTT TGACAGTCTG TTCGAAAGGG ATGAAAAAGG TATAATTCCC
TGGCTGGCTG AAAGCTGGGA TGCCAATTCA AATGGAACAG AATATACAGT TTATCTCCGT
GACGGTGTCA ACTGGAGCGA TGGAGTGCCT TTTACGGCAA ATGATGTTAA ATTTACTTTT
GATTATGAGC AGAAAAATGT ACCCATATCA GGTGGAATTG AGTCCGGTAT TATAGATAAT
GTTCAGGTCG TGAATTCCAG TACCGTCAAG TTCGTACTCA CGCAGCCTGC TTCTCCATTT
ATTTATAAGG TCACGAGTTT CAAAATCATA CCTGAGCATA TCTATAAAAA TGTCTCCGAT
CCTACCAGTT TCCTTGACCC AGAAGCAGTC ATCGGTACTG GCCCGTTCAT TCTTGATGAG
TACAACAAAG AGCATGGAAC ATATCGGTTT GTAGTAAATG AGAATTTCTG GGGACCGGAA
CCTGCCGTTA AAGCCGTTGA ATTTATTCCG GTCAGCGACT CATTAATAGC TTTTGAACAA
GGACAAATAG ATTTCACAAG TATATCGCCT GATACTCTTG ACCGGTTCAA ATCAGATTCT
GATATAAGAA TAGTCCAGCA GCCGGCTTTC TGGGGTTACC AGTTTTATTT CAATATGAAA
AACTGTCCTG AGCTGAATGA CAGTAGAATA AGGCAGGCCT TTGCTTACGC CATTGATCGC
GATGAACTGG TGGAAAAGAT CGCAAGAGGT GCAGGGAAAG CCGGTAAAAT GGGCATACTC
CCTGAAGACC ATATCTGGTA TAACTCTGAC CAGCCGAAAT ATGACTACAA TCCGGATAAA
GCCCGAGCAT TGCTTGAAGA AGCCGGATGG ACTGACACAG ATGGGGATGG GATACGTGAT
AAAAACGGGG AAAAACTGTC ATATGTATTA TCTCTTGGAT CATCTGCTGC TGGCAATAGC
GAAGTCCGTA TCGGCGAACT TATAAAAGAG AGACTAAATG AAGTAGGAAT TGACGTTCAG
GTAAAAGCCC TTGAGAGCAA ATCCCGTGAT GCCAATCTAA AGAGCGGAGA CTTTGAACTT
GCGATCAGCG GCTTTGGCGG CTGGGGACAG GATGCAGATT ATCTCCGTAC AAGATACTGT
GACACAGGTG CACAGTCAGG AAGTGTATCA TCTGGAGCAG CAGTATTTGG TTACCACAAC
GATACCCTGA ATGATCTTGG TGCTCAGGAA TTACAGGAAT TGAACGATGA TAAACGGAAA
GAAATAGTAT ACAATATGCA GACCGTGCTT GCTAATGATG TACCCGCAAT ACCGCTCTAT
TATACTACAT CATATGATGT ATGGCGCATT TCAAAATATG ACGGCTGGAT GAATAGGTAC
GATCACCATG CAAGAACACA CAATATTCTT TCGTATTTAG AGAGGGATGG AATTGCAGCA
AAAAGATAA
 
Protein sequence
MEKKKLLFSI FLTALILITA GCANKDSPQA GTSAENASDQ TGTLVENPSD GSKYVNVVNL 
SGGDYGYPQP FSIYPRGPGS SKVGMIFDSL FERDEKGIIP WLAESWDANS NGTEYTVYLR
DGVNWSDGVP FTANDVKFTF DYEQKNVPIS GGIESGIIDN VQVVNSSTVK FVLTQPASPF
IYKVTSFKII PEHIYKNVSD PTSFLDPEAV IGTGPFILDE YNKEHGTYRF VVNENFWGPE
PAVKAVEFIP VSDSLIAFEQ GQIDFTSISP DTLDRFKSDS DIRIVQQPAF WGYQFYFNMK
NCPELNDSRI RQAFAYAIDR DELVEKIARG AGKAGKMGIL PEDHIWYNSD QPKYDYNPDK
ARALLEEAGW TDTDGDGIRD KNGEKLSYVL SLGSSAAGNS EVRIGELIKE RLNEVGIDVQ
VKALESKSRD ANLKSGDFEL AISGFGGWGQ DADYLRTRYC DTGAQSGSVS SGAAVFGYHN
DTLNDLGAQE LQELNDDKRK EIVYNMQTVL ANDVPAIPLY YTTSYDVWRI SKYDGWMNRY
DHHARTHNIL SYLERDGIAA KR