Gene Mbar_A1284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A1284 
Symbol 
ID3627771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp1590414 
End bp1592051 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content44% 
IMG OID637700174 
Productoligopeptide ABC transporter, solute-binding protein 
Protein accessionYP_304827 
Protein GI73668812 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000669385 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA AAAACTTCCC CTTGATTATA TTGTTAATTA TCCTGCTGAC CTGCGCTATC 
CTGACGACTT CCGGGTGCAG CAGTACAGAA GAAAGTGGAG CAAAAGACGC AGTCTCCGGG
ATGGAGGTTA ACTCAGGATC AGATGATGCT TCCCAGATAG AAGGAGGCTC TGAAAATAAT
ACACTTGTAC TTGGTGAAAT GTGGAATATC GAATCGATTG ACCCAATTAA TGGGGATGGC
ACCCTTGTCT GCGAGAAAGC TGCAATAACC GAAACTCTTG TTGGTGCCAA TGACGATTTC
TCGCTCAAAC CTGGGCTTGC AACTTCCTGG GAACAGCTTG ATGAAAATAC CTGGGAATTT
AAGCTGAGAA ATAATGTTAC CTTCCATGAC GGAAGCAAAA TGACTGCAGA AGACGTAAAT
TTCACTCTGG AAAAAGTCAT CTCGGAAAAC GCGAAAGTTG CTTCCATGCT AAAAATAGAT
TCGATAGAAA TAGTTGACAA CTATACTCTT AAAATCAAAA CCAAAGAAAT AAACCCTATC
CTTCCTGGAG TTCTCCATTA TCCTGATACT GCTATAATAA GTCCCTCTTC TTATAATGAA
AATGGAGAGT TTGTAAAACC TGTAGGAACG GGTCCATATA AATTAGAATC ATTTGACGAA
CAGACCAGAG TTCTGACAGT TGTAAAGAAC GATAACTGGT GGGGAGGAGA AGTAGGGCTT
GATAAAATGA TCCTTAAGGG AATACCAGAC CCCAACACGA GAGCAATGGC GATCGAAAAT
GGAGAAGTTG ACTTTACCGT CGATGTACCC TATAGTGAAA CTGACAGGAT TGACGCTATA
GACGGCATTA ACGTAGAGAA ATACAAAACC CCAAGAGTCT ACAAACTTGA CCTGAACCTG
AAACATGAAC CCCTTGAAGA TGTTAGAGTA AGGCAGGCTA TGTCCTATGC CATTGACAGG
TCTGACATCG CAGAAAATGT ACTGTACAAT GTAGGAGAAG CTGCTGCAGG TCCTTTCCTG
CCCACAATGG TCTGGGCAAA TAAGAGCCTG AAACCCTACA GCCAGGACCT TGAGAAAGCT
GATGAGCTCC TTACAGCTGC TGGCTGGGTG GATACTGACG GAGACGGCAT CAGGGATAAG
GATGGACAAC CTCTCAAGTT CAACCTGATG ACCTATTCGG CAAGGCCTGG ACTTCCTCCA
ATGGCTGAAG CTATGGCTGC CCAGTTAAGG GAGGCAGGCA TAGGCATAGA GACAGAAGTT
CTGGAAATGG GGTCAATCGA TGACAGAAGG GAAAGCGGAG ACTGGGACCT CTACCTTGCA
GCTTACAATA TTGCGATGGT TCCGGACCCA GAATATATTC TCACAAACTG GTACATGACA
AACGGGACTG ACAATAACGC AGGATATTCC AATCCTAAAG TAGACTCCCT AATAACAGAA
GCCAGAAAAA TCACGAACAT GAGTGAACGC TATAAAAAGT TCAATGAGGT AGAAGCTATC
GCTTATGATG AACAGCCCAT GATCATAGTG GCTTACTACG GCTGTGCAAT CGTAAAGAAA
GACTATGTAA AAGGATACGT CTTCGATCCG ACAGCTCATG ACTACCGTAT AAACGCAGAT
ATGCATATCG AGAAGTAA
 
Protein sequence
MKNKNFPLII LLIILLTCAI LTTSGCSSTE ESGAKDAVSG MEVNSGSDDA SQIEGGSENN 
TLVLGEMWNI ESIDPINGDG TLVCEKAAIT ETLVGANDDF SLKPGLATSW EQLDENTWEF
KLRNNVTFHD GSKMTAEDVN FTLEKVISEN AKVASMLKID SIEIVDNYTL KIKTKEINPI
LPGVLHYPDT AIISPSSYNE NGEFVKPVGT GPYKLESFDE QTRVLTVVKN DNWWGGEVGL
DKMILKGIPD PNTRAMAIEN GEVDFTVDVP YSETDRIDAI DGINVEKYKT PRVYKLDLNL
KHEPLEDVRV RQAMSYAIDR SDIAENVLYN VGEAAAGPFL PTMVWANKSL KPYSQDLEKA
DELLTAAGWV DTDGDGIRDK DGQPLKFNLM TYSARPGLPP MAEAMAAQLR EAGIGIETEV
LEMGSIDDRR ESGDWDLYLA AYNIAMVPDP EYILTNWYMT NGTDNNAGYS NPKVDSLITE
ARKITNMSER YKKFNEVEAI AYDEQPMIIV AYYGCAIVKK DYVKGYVFDP TAHDYRINAD
MHIEK