Gene Tmz1t_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2048 
Symbol 
ID7083808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2317393 
End bp2319111 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content69% 
IMG OID643699075 
Productalpha amylase catalytic region 
Protein accessionYP_002355692 
Protein GI217970458 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.105277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCCCT CTGCAGCCCC CTCCGTCGGC CCGCAGCCGG TTTCGAGCCC CGCTGCCCCC 
TGGTGGAAAT CCGCCGTCGC CTACCAGCTC TACCCGCGCA GCTTCATGGA CAGCGACGGC
GACGGCATCG GCGACCTGCG CGGGATCCTC GCGCGGCTCG ACTACCTGCA ATGGCTGGGC
GTCGATGTGA TCTGGATTTG CCCGATCTTC GACTCCCCGA ACGACGACAA CGGCTACGAT
ATCCGCGACT ACCGCAAGAT CCTGTCGACC TTCGGCACCA TGGAGGATTT CGACCGCCTG
CTCGCCGAAG TCCACCGCCG CGGCATGAGG CTGATCCTCG ACCTGGTGCT CAACCACACC
AGCGACGAGC ACCCCTGGTT CCTCGAGTCG CGCGCCTCGC CCGACAACCC CAAGCGCGAC
TGGTACATCT GGCGCGAGGG CCGCGACGGG CATCCGCCCA ACAACTGGGA GAGCATCTTC
CGCGGCCCCG CGTGGGCCCG GGACCCGGCC ACCGGGCAGT ACTACCTGCA CCTGTTCACC
CGCCGCCAGC CCGACCTCGA CTGGACCAAT CCGGAGATGC GCGCGGCCTT CCACGACATC
GTGCGCTGGT GGCTGGACAA GGGCATCGAC GGCTTCCGCC TCGATGCGGT GAGCCACATC
CGCAAGATGC CGGGCCTGCC CGACCTGCCC AATCCGCGCG GCCTGCCCTG GGTGCCGTCC
TTCGCCATGC ACATGAACGT CGACGGCGTG CTCGACACCA TCGGCGAGCT CTGCCGCGAG
ACCTTCCGCC GCTACGACGT GATGACCGTG GGCGAGGCCA ACGGCGTCGG CCCGCGCGAG
GCGGTGGAGT GGGTGGGCAG CGACCGCGGC CGGCTCGACA TGATCTTCCA GTTCGAGCAC
CTGGCGCTGT GGTCGCGCGC GCCCGGCGCG CCGCTCGACG TGGTCGCGCT CAAGAAGGTG
CTGTCGCGCT GGCAGCATGC GCTGCACGGC AAGGGCTGGA ACGCGCTCTT CCTGGAGAAC
CACGACATCC CCCGCGTCGT CTCCAAGTGG GGCGACACCG GCGCGCTGTG GCGCGAGAGC
GCGACCGCGC TGGCGACGAT GTACTTCCTG ATGGAGGGCA CGCCCTTCAT CTACCAGGGC
CAGGAGATCG GCATGACCAA CGGCGTCTTC GAGCGCATCG AGGACTTCGA CGACGTGCTG
GCGAAGAACG ACTACGCGCA GCGCCGCCTG GCCGGCGAGG AGGAGGGCGC GATCGTCGCC
GACCTCGTGC TCACCGGCCG CGACCCGGCG CGCACGCCGA TGCAGTGGGA CGACGGCCCT
CAGGCGGGCT TCACGCGCGG ACGGCCGTGG CTGGCGGTCA ATCCCAACCA TCGCCGCATC
AACGTCGCCC AGCAGCGCCA CGACCCGGCT TCGGTGCTCA ACCACTACCG CCGCCTGATC
GCGCTGCGCA AGGCCGAGCC GGCGCTGGTG CTCGGCGACT ATCGCCTGCT GATGAAGGAC
GACGCGCAGA TCTACGCCTA CCAGCGCCGG CTGGACGGCG AGCGCATTGC GGTGATCGTC
AACCTGAGCC CTCGCCCGGC GCGCTTCGAT CATCCCGGCG TGGTGCTCCG CCACGAACGC
CTGCTGCTCG CCAACCGCGC CGTCGAAGCG CACCCTGCCG CCCACGCGCT CGAGCTCGCA
CCCTACGAGG CCCGCGTATA CCGCGTGGGA AAAATGTAG
 
Protein sequence
MPPSAAPSVG PQPVSSPAAP WWKSAVAYQL YPRSFMDSDG DGIGDLRGIL ARLDYLQWLG 
VDVIWICPIF DSPNDDNGYD IRDYRKILST FGTMEDFDRL LAEVHRRGMR LILDLVLNHT
SDEHPWFLES RASPDNPKRD WYIWREGRDG HPPNNWESIF RGPAWARDPA TGQYYLHLFT
RRQPDLDWTN PEMRAAFHDI VRWWLDKGID GFRLDAVSHI RKMPGLPDLP NPRGLPWVPS
FAMHMNVDGV LDTIGELCRE TFRRYDVMTV GEANGVGPRE AVEWVGSDRG RLDMIFQFEH
LALWSRAPGA PLDVVALKKV LSRWQHALHG KGWNALFLEN HDIPRVVSKW GDTGALWRES
ATALATMYFL MEGTPFIYQG QEIGMTNGVF ERIEDFDDVL AKNDYAQRRL AGEEEGAIVA
DLVLTGRDPA RTPMQWDDGP QAGFTRGRPW LAVNPNHRRI NVAQQRHDPA SVLNHYRRLI
ALRKAEPALV LGDYRLLMKD DAQIYAYQRR LDGERIAVIV NLSPRPARFD HPGVVLRHER
LLLANRAVEA HPAAHALELA PYEARVYRVG KM