Gene Mbar_A2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2020 
Symbol 
ID3627837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp2552095 
End bp2553288 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content44% 
IMG OID637700898 
Productglucose-1-phosphate thymidylyltransferase 
Protein accessionYP_305534 
Protein GI73669519 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.611525 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGG TTGTCCTTGT AGCAGGCAAA GGCACAAGGA TGGAACCTCT AACTTCCGGC 
TGTCCTAAAG TTATGCTCCA GGTTGCAAAT AAACCCATCC TCGAACATAT ACTTAATTCA
GCCATCGAAG CAGGCATCGA AGGCTTTGTC TTCATTACCG GTTATCTCGA AAAGCAGATA
AAAGAGTACT TCGGGGACGG AAACAAATGG GGAGTAAGCA TAGAGTATGT ACAGCAAAAG
GAGCAGCTAG GGACTGCAAA TGCGATAGGC TGTGCAAAAG GCTATGTTGA TGGGACTTTC
CTTGTACTTA ACGGGGATAT GCTCATAGAG CAGGAGGACT TAAAAGCCCT GGTCTCAAGG
ACAGAAGAAG CTGTCATCTG TGTAAAAGAG GTGGAAAACC CGGCGGATTT CGGAGTGCTT
GAAACCGAGA ATAATAGAGT TGTCAGGATA ATAGAAAAAC CTAAAAATCC CCCTACTAAC
CTTGCAAATG CCGGGATATA TCTTTTCAGG GAATCTATTT TTGACTTTAT TGACAGAACC
AAGGCCTCAG TGAGAAATGA GTTCGAGATT ACGGACTCCA TCCAGATGCT GATTGATAGT
GGAACAGCCG TGGGTTACAG CCCTCTCGAA GGTAGATGGA TAGATATAGG GTATCCCTGG
GACCTCTTGA AAGCAAACGA ATACCTTCTG AAAGGCCTTA AGAGCAGCTG TGAAGGTACT
GTAGAGCCGA ATGCTACCAT AAAAGGAGAG GTTGTAATCG GAAAAGGCAC TATTATCAGG
AACGGTTCTT ATATCGAAGG CCCAGTAGTG ATAGGAGAGA ATTGCGATAT CGGGCCTAAT
TGTTTTATCC GCCCTTCCAC TGCAATCGGG AACCATATTA GGGTAGGAAA TGCTGTGGAG
ATAAAAAATA CGATTGTCAT GGAAGATACT CATGTGGGAC ATCTGAGTTA TGTTGGGGAC
AGCATTATTG GGCACCACTG CAACTTTGGA GCGGGTACGA AAGTTGCAAA CCTCCGCCAT
GATGGGAAAA ACATAAAAGT AATGATAAAA AGCAGGATTC TTGACACGGG CAGGAGAAAA
CTCGGAGTGA TTATGGGAGA TGACGTGCAT ACCGGTATCA ACACAAGCAT AAATATCGGT
ACGATAATGG AAAAAGGAAG ATATACATAT CCTGGAGAGA TTGTCAAACA TTAA
 
Protein sequence
MKAVVLVAGK GTRMEPLTSG CPKVMLQVAN KPILEHILNS AIEAGIEGFV FITGYLEKQI 
KEYFGDGNKW GVSIEYVQQK EQLGTANAIG CAKGYVDGTF LVLNGDMLIE QEDLKALVSR
TEEAVICVKE VENPADFGVL ETENNRVVRI IEKPKNPPTN LANAGIYLFR ESIFDFIDRT
KASVRNEFEI TDSIQMLIDS GTAVGYSPLE GRWIDIGYPW DLLKANEYLL KGLKSSCEGT
VEPNATIKGE VVIGKGTIIR NGSYIEGPVV IGENCDIGPN CFIRPSTAIG NHIRVGNAVE
IKNTIVMEDT HVGHLSYVGD SIIGHHCNFG AGTKVANLRH DGKNIKVMIK SRILDTGRRK
LGVIMGDDVH TGINTSINIG TIMEKGRYTY PGEIVKH