Gene Mbar_A3708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3708 
Symbol 
ID3624939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4777940 
End bp4781059 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content35% 
IMG OID637702541 
Producthypothetical protein 
Protein accessionYP_307151 
Protein GI73671136 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAATTT CAGAATTAGT TTTAAGTTAT CTTTCAAGCC TATTATATGA TTCTTCCAAG 
GAAATCCCTC GAAAAATTCT TGACACACAT TGGGAAATAT ATAGTAAAGC AATCGAAGAA
CTTTCAAATA CAAGCCTTAA ACTTAATAAA ATAAATATTG ATATTTTTTT GCATCAGCAA
AAGGTAGAAA TGGCAATTGA AGAGTATCTA AAAAGTCCAA ATAAAGCTGA TTGTTTAAGT
GTCTTAATTA ATGAGTTTTT TGAGTTGTTC AGCGAGGAAG ATTTTTCCTT AAACGATGCA
AATTTAATCT TAAATACTTT TTTTGAAATT ATAGATTCAG AAATTGAAAA AAATCCTGAA
CTTAGAGACT ACCTTAAGCT TTACTTAGCG AAAAAAGCAC ATAAAACAAT TGAAGAAATA
AATCAAGAAC TTAAAGAAAC ACATCAAGAG ATCCAAAAAT TATCCTTCAA AATTGATGGA
CTTCTAATCA ATAATCTTGA TAACAAGTCT GCTCCTCTCA CATCTTCTGA GCTCCAAGAT
AAAAAGATAA AATCAAATGT TCCATATCCA TTCAATCCAT TTTTTATCGG TCGAGATGAA
AAGCTAGAAC AAATTCACGA GACACTTCTT TCAAACAAAA GAGCGGTTTT GTCGCGACCA
GTAGCAATAT GTGGATTAGG TGGTATTGGA AAAACACAAA CTGTAGTACA ATATACGTAT
CTTTATAGTC ACGAATATAA ATTTGTGTTC TGGGTAACAG CTGATTCAGA AGGCTCAATT
ATTTCAGGCT ATGTGAATGT AGCGAAATTA TTAGATTTAC CTTTGAAAAA TGACTCTGAC
CAGAAACTTA TTGTTTCTGC TGTACTAAAC TGGTTTAAAA ATAATGAGAA CTGGTTACTG
GTTTTTGACA ATGCAGATGA CCCTTCAGTC TTAAGAAACT TAATGCCTTT AAATTCAAAA
GGTCACATAT TATTTACATC AAGAGCTCCT CTTTTTGAGG AGCTAGGAGC TACAAGCCAG
ATTGAAATGG ATAAGATGCT TCCTGATGAG GCACGGAAAT TTTTCATCAA ACGTACAGGG
CGTAAAAACT TAAAGCCTTC AGAACTTAAG GCACTTGATG AACTTACATC TGAGCTTGAT
TACTTGCCAT TAGCAATGGA GCAAGCTGGA GCCTATATTA GAAAAATAGA ATGTAGTTTT
GAAGATTATC TCTCAAGCTA TAAAATAAGT GGATTGAAGT TACTGGAAAG GTCTCGAATT
TCAACAGATA AATATCCAAA ATCAGTAGCT ACTACTTGGA TCTTGAACTT TGAAAATATA
AAAAAAGATT CAAAAGTTTC AGCGGAAATA TTATTTGTAA GTGCTTTTCT AAATCCATCT
AAAATCCCAG TTGATATTTT TATCAAAGGC GCAAAAGAAC TGGGGCCTTT GATTTCTTCA
GCTCTTGAAA ATATTGAAAG CCTTCCAGTT ATTTTATATG AATCTTTTGA ACCTATTAGG
CAGTATTCAT TGATTACTCG CGATGTAAAT AATTATACAT ATGATATTCA CCGCCTTGTA
CAAGCTGTTC TTAGAGACGG GATGGATGAA ACTACACAGC GAATCTGGGT TGAGCGTACT
GTTAAAGCTT TAAACTGTGC ATTTCCTGAA ATAGAGTATA ATAATTGGGA TCTTTGCGAC
AAGCTTCTTC CGCATATTCA GACATGTGAA AAATATATCA AAAAATGGAA TATGGAGACC
AAAGAATTTG CAAAGTTGCT AAATTCTACT GGCAACTATC TGTATGAACG TGCACGTTTC
AAAGAATGTG AATTATATTT TAATAGCTCA TTTGATATCA GAAAGAAAAT TTTGGATTCA
ACTCATCCAG ATATTGCTGA AAGTATGACT GATTTAGCCG CACTCTACGT GTTTCAAGGC
AGGTATTCTG AGGCTGAACC ACTTATTAAG CGCGCCCTAG AAATACGTGA GATAGTTTTA
GGTCCAGAAC ATCCTGACAC AGCAGCCTCT CTAAATATTC TAGCAGGAAC TTACAATTCT
CAAGGTCGGT ATTCTGAGGC TGAACCATTT TTTAAACGTG CCTTGGAAAT ACGCGAGAAA
GCTTTGGGCT CAGAACACCC TGACACAGCA ATTTCTCTTG ATAATTTAGC AGGAATTTAT
AGATCTCAAG GTAGATATCC CGAAGCTGAG AAATTGTTAA AGCGTGCCTT GGAAATTAAT
GAGAAAATTT TTGGTTCTGA ACACCCTAAC ACAGCACTTT CTCTTGATAA CCTATCAGTG
CTCTATCAGA GTCTAGGCAA ATATTCTGAG GCTGAATCAT TTTCAAAGCA TGCATTAGAT
ATCTACGAGA TATGTTCGGG TCCAGAGCAT CCTGACACAT CAATTTCACT TTGTAATTTA
GCAATGTGTT ATACAAGTCA AGGCAAATAT CCCGAAGCTG AGCTACTTTT AAAGCGTGCT
CAGGAAATTG ATGAGATTGT TTTGGGCTCA GAACACCCTG GTACAGCAAC CACTCTTAAT
AACTTAGCAA CACTCTATCA AAGTCAAGGT AAATATTCTG AGGCTGAACC ACTTTTTAAG
CGTGTCTTGA AAATACGTGA GAAAGTTTTG GGCTCGGAAC ACCCTGACAC AGCACTTTCT
CTCAATAATT TAGCAGGAAC CTACAAATTT CAAGGTAGAT ATTCTGAAGC TGAGCCACTT
TTAAAGCGCG CTCAGGAAAT TGATGAGAAT GTTTTGGGCT CAGAACACCC TAGTACAGCA
ACCACTCTTA ATAATTTAGC AACACTCTAT CAAAGTCAAG GCAAATATTC TGAGGCTGAA
TCACTTTTAA AGCGTGCTCT GGAAATTCAA GAGAAAATTT TTGGTTCCGA AAATATATCA
GTAGTAAGTT CTCTCAATAA TTTAGCAACA ACTTATGCGA CCCAAGGAAA AAATCTCAAA
GCTAAAGAAT TATTTCTCCG ATCAATAGAT ATAATGGAAA AAATCAAAAG AGAAGATCAT
CCAGATTTTT TAGCATTACT AGAAAACTAT GCTTGTCTAC TAATTAAAAT GAAAAAAGGT
CGTGAGGCAT CAAAAATATT AAAGCGGATT ACTTATATAA ATGCGAAAAA CAAAAACTGA
 
Protein sequence
MIISELVLSY LSSLLYDSSK EIPRKILDTH WEIYSKAIEE LSNTSLKLNK INIDIFLHQQ 
KVEMAIEEYL KSPNKADCLS VLINEFFELF SEEDFSLNDA NLILNTFFEI IDSEIEKNPE
LRDYLKLYLA KKAHKTIEEI NQELKETHQE IQKLSFKIDG LLINNLDNKS APLTSSELQD
KKIKSNVPYP FNPFFIGRDE KLEQIHETLL SNKRAVLSRP VAICGLGGIG KTQTVVQYTY
LYSHEYKFVF WVTADSEGSI ISGYVNVAKL LDLPLKNDSD QKLIVSAVLN WFKNNENWLL
VFDNADDPSV LRNLMPLNSK GHILFTSRAP LFEELGATSQ IEMDKMLPDE ARKFFIKRTG
RKNLKPSELK ALDELTSELD YLPLAMEQAG AYIRKIECSF EDYLSSYKIS GLKLLERSRI
STDKYPKSVA TTWILNFENI KKDSKVSAEI LFVSAFLNPS KIPVDIFIKG AKELGPLISS
ALENIESLPV ILYESFEPIR QYSLITRDVN NYTYDIHRLV QAVLRDGMDE TTQRIWVERT
VKALNCAFPE IEYNNWDLCD KLLPHIQTCE KYIKKWNMET KEFAKLLNST GNYLYERARF
KECELYFNSS FDIRKKILDS THPDIAESMT DLAALYVFQG RYSEAEPLIK RALEIREIVL
GPEHPDTAAS LNILAGTYNS QGRYSEAEPF FKRALEIREK ALGSEHPDTA ISLDNLAGIY
RSQGRYPEAE KLLKRALEIN EKIFGSEHPN TALSLDNLSV LYQSLGKYSE AESFSKHALD
IYEICSGPEH PDTSISLCNL AMCYTSQGKY PEAELLLKRA QEIDEIVLGS EHPGTATTLN
NLATLYQSQG KYSEAEPLFK RVLKIREKVL GSEHPDTALS LNNLAGTYKF QGRYSEAEPL
LKRAQEIDEN VLGSEHPSTA TTLNNLATLY QSQGKYSEAE SLLKRALEIQ EKIFGSENIS
VVSSLNNLAT TYATQGKNLK AKELFLRSID IMEKIKREDH PDFLALLENY ACLLIKMKKG
REASKILKRI TYINAKNKN