Gene Mbar_A3049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3049 
Symbol 
ID3624846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3927537 
End bp3928649 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content36% 
IMG OID637701890 
Producttransposase 
Protein accessionYP_306520 
Protein GI73670505 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00782519 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.671104 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAG CGTTCAAATT TAGACTCTAT CCTACAACTA CGCAGGCTGT TCAATTAAAT 
CAGCATATAG GTAGCTGTAG ATTTGTCTAC AATTGGGCAC TTGATCAGAA AATTAAAACT
TATGAACAGA CAGGAAAATC AATTTCCAGA TTTGACTTAA ACAAAAAGCT TCCTGTCTTG
AAAGCTTCTA ATGAATGGTT AGGAGAAGTC AATTCTCAAT CATTGCAGAG AATGACTAAG
CAGGTTGAGT CTGCCTTCAC TCGATTTTTC CGAGAGAAGA ACGGCTTTCC TAAGTTCAAA
TCTAAGAAAA ACCCAATTCA ATCTTTTCCT GTACCTCAAC ACTACTCCGT AGACTTTGAA
AAAAACACTA TCAAGCTCCC TAAAATAGAA CCAATTAAAG CAGTTTTTCA CAGGAAGTTT
GAGGGCGAGC TTAAAACAGC TACTGTTTCA AGGACATGTC AAGGACATTA CTACATTAGT
ATCCTTGTTG AAGATGGAAA AGAACTTCCT ACAAAACAGA AGTATTCAGA ATCTACTACA
GTGGGTATAG ATGTCGGGAT TAAGGATTTT GCTATACTTT CCACAGGAGA AACGATTGAG
AATCCTAACT ACCTGAAAAA CTCTTTGAAC AGGTTAAAGG TTCTTCAAAA AAGAGCATCA
AGGAAACTGA AAGGTTCTAA GAACAGGGTA AAAGCCAAAC ATAGGCTTGC TGTACTACAT
GACAAAATAA CTAATCAGAG GAACGACTTC CAGAACAAAC TCTCTTTTAA ACTCATAAGC
GAAAACCAAG CAATAGCTCT GGAAACTCTG AATGTTAAAG GAATGGTCAA GAATCATCAT
TTGGCACAGG CTATAAGTGA TTCCGCATGG AGCAGTTTTG TAACAAAACT AGAGTATAAA
GCTGAATGGT ACGGAAAAAC CATCCTGAGA ATTGGGCAAT TTGAACCATC TTCTAAAGTA
TGTCATGTTT GTGGATATCA TAATTCATAT TTGACATTAA AAGATAGAGA ATGGACTTGC
CCAGACTGTA AAACAAAACA TGATAGAGAT ATAAATGCCG CTATCAATAT CAAGAAATTT
GCTCTCATAG ATCAAAATCT AATTGGATTA TAA
 
Protein sequence
MMKAFKFRLY PTTTQAVQLN QHIGSCRFVY NWALDQKIKT YEQTGKSISR FDLNKKLPVL 
KASNEWLGEV NSQSLQRMTK QVESAFTRFF REKNGFPKFK SKKNPIQSFP VPQHYSVDFE
KNTIKLPKIE PIKAVFHRKF EGELKTATVS RTCQGHYYIS ILVEDGKELP TKQKYSESTT
VGIDVGIKDF AILSTGETIE NPNYLKNSLN RLKVLQKRAS RKLKGSKNRV KAKHRLAVLH
DKITNQRNDF QNKLSFKLIS ENQAIALETL NVKGMVKNHH LAQAISDSAW SSFVTKLEYK
AEWYGKTILR IGQFEPSSKV CHVCGYHNSY LTLKDREWTC PDCKTKHDRD INAAINIKKF
ALIDQNLIGL