Gene Mbar_A2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2023 
Symbol 
ID3627840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp2556540 
End bp2557757 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content42% 
IMG OID637700901 
Productglucose-1-phosphate thymidylyltransferase 
Protein accessionYP_305537 
Protein GI73669522 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.562906 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCTA TTATCCTTGC AGCCGGAGAA GGGCTGCGCT GCAGGCCTCT TACTCTAACT 
CGTTCAAAAG TAATGCTTCC CATAGCCAAC AGGCCTATTT TAGAGCATGT AATAGGTTCG
CTTGAAAAAA ATGGAATAAC TGACATTATA TTGATTGTTG GATATAAAAA AGAACGCATA
ATGGACTATT TCGAGGACGG GCTAAATTTT GGAGTGAAAA TAAAATATGT CGAACAGAAA
GCTCAGCTAG GTACTGCACA TGCAATTGAG CAGGCAAAAA AATGGATTGA ACCCGAGGAC
TCGGAATTTC TCGTGCTCAA TGGAGACAAC CTGGTGGAAC CGAAAACTAT AGCGGACCTT
CTGAATAATT ATGAAGGAGA TGCAAGCCTT CTAACTGTTC AGATGGAAGA GACTGCTGGC
TATGGAGTAG TTCTGAAAGA AAAAAAGAGA GTCACGAGAA TTCTGGAAAA AAGACCTGGA
GACCTGAGCC GTATTGTAAA TACCGGAATT TATATTTTTA CGCCGCAGGT CTTTGAAACC
ATCGAAAAAA CCCCGATATC CGAAAACGGT GAATATGCAA TAACCGATAC CCTCCAGCTT
ATGATTGACG AAGGAAAAAT CGTTACTTCT GTTTCTACCA AATCAAAATG GATTGACGCT
GTTCATTCCT GGGATTTATT AAAAGCTAAT GCCATAGTCT TAAATTCAGC CAGGAACCTG
AAGCTTGAAG GAGAAGTTGA AGAGGGAGTT TTCCTCAGTG GAAAGGTGGC AGTAGGAAAG
AATACCAGGA TTCGTTCAGG AACTTACATT GTAGGCCCTG TGGTCATAGG GGAAAATTGT
GATATCGGTC CCAATGTAGT TATTCTGCCC TCCACAACAA TAGGAGACAA TGTATCGATC
AGGTCATTTA CCGAAATACA GAACAGTATA ATAATGAATG ACTGCAGGAT ATACTCTCAT
GGGCGTATCT CGAATTCCAT AATTGGAAGC AACAATACAA TTGGCTCAGG TTTCTTTGTT
GAAGAAAAAG AAGGTCTATC GATCATTATG AATGGAACGA TTCATCGAGC TCCCAGACTC
GGCACCATCT TCGGAGACGA CAACCGTATT GGGAATAGCG TACTTGTAAA GGCCGGAGTA
ACAATAGCTG TTGACTGCCA GGTTGAATCT GGGAATACCA TATATAGGGA TCTGTCCCGT
CATTCGGTTG TTCTTTAA
 
Protein sequence
MKAIILAAGE GLRCRPLTLT RSKVMLPIAN RPILEHVIGS LEKNGITDII LIVGYKKERI 
MDYFEDGLNF GVKIKYVEQK AQLGTAHAIE QAKKWIEPED SEFLVLNGDN LVEPKTIADL
LNNYEGDASL LTVQMEETAG YGVVLKEKKR VTRILEKRPG DLSRIVNTGI YIFTPQVFET
IEKTPISENG EYAITDTLQL MIDEGKIVTS VSTKSKWIDA VHSWDLLKAN AIVLNSARNL
KLEGEVEEGV FLSGKVAVGK NTRIRSGTYI VGPVVIGENC DIGPNVVILP STTIGDNVSI
RSFTEIQNSI IMNDCRIYSH GRISNSIIGS NNTIGSGFFV EEKEGLSIIM NGTIHRAPRL
GTIFGDDNRI GNSVLVKAGV TIAVDCQVES GNTIYRDLSR HSVVL