Gene Tmz1t_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2044 
Symbol 
ID7083804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2309297 
End bp2310565 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content68% 
IMG OID643699071 
Productglucose-1-phosphate adenylyltransferase 
Protein accessionYP_002355688 
Protein GI217970454 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0448] ADP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR02091] glucose-1-phosphate adenylyltransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.11896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGCCG AAGCCGCCAA CCGACGACGG ATCCTCACGC GCCGCACCCT CGCCCTCGTG 
CTCGCCGGCG GGCGCGGCTC GCGCCTGCGC GATCTCACCA ACGTGCGCGC CAAGCCGGCG
GTGCACTTCG GCGGCAAGTT CCGCATCATC GACTTCGCGC TGTCGAACTG CATGAACTCG
GGTCTGCGCC GCATCGGCGT GATCACACAG TACAAGTCGC ACTCGCTGCT GCGCCACCTG
CAGCGCGGCT GGAGCTTCCT GCGCAACGAG ATGGGCGAAT TCGTCGACCT GCTGCCGGCG
CAGCAGCGCA TCGACGAGGA ACAGTGGTAC CAGGGCACCG CCGACGCGGT GTTCCAGAAC
CTCGACATCA TCCGCAACTC CACGCCGCCC GACTACATCG TCGTGCTCGC CGGCGACCAT
GTGTACAAGA TGGACTACTC GATCATGCTC GAGGACCACG CCGCGAGCGG GCGCGGTGTC
ACCGTGGGCT GCATCGAGGT ACCGCGCGAG GAGGCCAAGG CCTTCGGCGT GATGGCGATC
GATGCACGGC GCCACATCAC CGCCTTCGTC GAGAAGCCCG CCGACCCGCC AGCGCTGCCG
GGCAATCCCG GGCTGTCGCT CGCCAGCATG GGCATCTACA TATTCTCGGC CAACTACCTC
TACCGCCTGC TCGAGGACGA CGCGAAGAAT CCGGACTCCA GCCACGACTT CGGCAAGGAC
CTGATTCCGC GCGCGGTGGC GGAAAACCAG GCGCTCGCCC ACCCCTTCAC GCTGTCGGCG
ATCGCCACCC CGCCCTTCTC CGGCCCCTAC TGGCGCGACG TCGGCACGGT GGACGCCTAC
TGGGCGGCCA ACCTCGACCT CGCCTCGACC ACGCCGGCGC TCAACATGTA CGACAAGGAC
TGGCCGATCT GGACCTACCA GGAGCAACTA CCGCCGGCCA AGTTCGTGCA CGATCTCGAC
GGTCGCCGCG GCGAGGCGCT CAACGCGCTG GTCTCGGGCG GCTGCATCGT CTCCGGATCG
GTCGTGCGCG AGTCGGTGCT GTTCTCCAAC GTGCTGGTGC GCTCCTACAG CACGATCGAG
CAGGCGGTGG TGCTGCCCGA CGTGCAGATC AACCGCCACT GCCGCCTGAA GAAGGTCGTC
ATCGATCGCC ACTGCGTGAT CCCCGAGCGC ACGGTGATCG GCGAGGACGC CGAGGCGGAT
GCGCGCCGCT TCCACCGCAC CGAGGGCGGC GTGGTGCTGG TGACGCGCGA AATGCTCGAC
CGACTGTGA
 
Protein sequence
MPAEAANRRR ILTRRTLALV LAGGRGSRLR DLTNVRAKPA VHFGGKFRII DFALSNCMNS 
GLRRIGVITQ YKSHSLLRHL QRGWSFLRNE MGEFVDLLPA QQRIDEEQWY QGTADAVFQN
LDIIRNSTPP DYIVVLAGDH VYKMDYSIML EDHAASGRGV TVGCIEVPRE EAKAFGVMAI
DARRHITAFV EKPADPPALP GNPGLSLASM GIYIFSANYL YRLLEDDAKN PDSSHDFGKD
LIPRAVAENQ ALAHPFTLSA IATPPFSGPY WRDVGTVDAY WAANLDLAST TPALNMYDKD
WPIWTYQEQL PPAKFVHDLD GRRGEALNAL VSGGCIVSGS VVRESVLFSN VLVRSYSTIE
QAVVLPDVQI NRHCRLKKVV IDRHCVIPER TVIGEDAEAD ARRFHRTEGG VVLVTREMLD
RL