Gene Mbar_A2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2043 
Symbol 
ID3625558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp2580293 
End bp2581405 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content35% 
IMG OID637700921 
Producttransposase 
Protein accessionYP_305557 
Protein GI73669542 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.087211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.244149 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAG CGTTCAAATT TAGACTCTAT CCTACAACTA CGCAGTCTGT TCAATTGAAT 
CAGCATATAG GTAGCTGTAG ATTTGTCTAC AATTGGGCGT TGGATCAAAA AATTAAAACT
TACGAACAAA CAGGAAAATC TATTTCCAGA TTTGACTTAA ACAAAAAGAT TCCTGTCTTG
AAAGCTTCTA ATGAATGGTT GGGAAAAGTA AACTCACAGT CATTACAAGG AATGACTAAG
CAGGTAGAAT CTGCTTTCAC CAGATTTTTC AGAGAAAAGA ATGGATTTCC TAAGTTTAAA
TCAAAGAAAA ATCCAATTCA ATCTTTTCCT ATACCTCAAC ACTATTCTGT GAATTTTGAA
AACAACACAG TTAAGCTTCC AAAAATCGAA CCGATTAAAG CAGTTCTTCA CCGAAAGTTT
GAAGGCGAGC TTAAAACAGC TACAGTATCA AGGTTTTGTA AAGGGAATTA CTACATTAGT
ATCCTTGTTG AAGACGGAAA AGAACTTTCA GCAAAGCAAC CTTTCACAGA ATCAACTACC
GTAGGAATAG ATGTAGGTAT CAAAGATTTT GCTATACTCT CTACAGGTGA AAAAATATCG
AATCCTATGT ATCTTAAAAA CTCCCTCAAG AGACTTAAAG TACTTCAGAA AAGAGTTTCA
AGGAAACAAA AAGGTTCAAA GAACAGAGCA AAAGCCAAAA AGAGGGTTGC TGTACTCCAT
GAGAAAATAA GCAATCAGAG ACATGATTTC CAGAACAAAC TCTCTTTTAA ACTTATAAGC
GAAAACCAAG CTATAGCTCT GGAAACTCTG AACGTTAAAG GAATGGTTAA GAATCACCAC
TTATCACAGT CTATAAGTGA TTCTGCATGG AGTAGCTTTG TTACAAAATT AGAATACAAA
GCGGCATGGT TCGGAAAAAC CACCCTGAGA ATTGGACAGT TTGAGCCATC TTCTAAGCTT
TGTAATGTCT GCGGATATCA TAATTCAGAT TTGACATTAA AAGATAGAGA ATGGACTTGT
CCAGACTGTA AAACAAAACA TGATAGAGAT ATAAATGCCG CTATCAATAT CAAGAAATTC
GTTCTCATAG ATCAAAATCT AATTGGGTTG TAA
 
Protein sequence
MMKAFKFRLY PTTTQSVQLN QHIGSCRFVY NWALDQKIKT YEQTGKSISR FDLNKKIPVL 
KASNEWLGKV NSQSLQGMTK QVESAFTRFF REKNGFPKFK SKKNPIQSFP IPQHYSVNFE
NNTVKLPKIE PIKAVLHRKF EGELKTATVS RFCKGNYYIS ILVEDGKELS AKQPFTESTT
VGIDVGIKDF AILSTGEKIS NPMYLKNSLK RLKVLQKRVS RKQKGSKNRA KAKKRVAVLH
EKISNQRHDF QNKLSFKLIS ENQAIALETL NVKGMVKNHH LSQSISDSAW SSFVTKLEYK
AAWFGKTTLR IGQFEPSSKL CNVCGYHNSD LTLKDREWTC PDCKTKHDRD INAAINIKKF
VLIDQNLIGL