Gene Tmz1t_1216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1216 
Symbol 
ID7083876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1344720 
End bp1346249 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content73% 
IMG OID643698232 
ProductCHAD domain containing protein 
Protein accessionYP_002354871 
Protein GI217969637 
COG category[S] Function unknown 
COG ID[COG3025] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACG AAATCGAACT CAAGCTCGCG CTGCCCAGGC GCGCGCTCCC GGCGCTGCGC 
CGCCACCCGC TGGTGGCCGC GGCGGAGAAA TGCGGCAACG CCCTCACCCT CGACAACACC
TACTACGACA CCCCGAAGCT GCAGCTCAAG GCGCGCAAGG TGGCCGTGCG CACCCGCCGC
CAGGGCCGCC AGATGCTGCA GACGGTGAAA TGCGCGGCGG TATCGAGCGG CGGGCTGTCG
CAGCGGCCGG AGTGGGAGAC CGCGTGGACC GGCGGCTTCG ATTTCACGCC GGTCGACGAC
CCCGCCACCG CGCGTCTGCT CGAACGCCAC CGCGCAGAAC TGGTGCCGGT GTTCACCACC
CGCTTCCGCC GCGAGACGCG GCGACTCGTC CCGCAGGCGG GCGTGTCCAT CCTGCTGATG
ATCGACACCG GCGCGGTGCA TGTGCGCACC CCGGAGGGCG TCGAGCGCGA GGCGGAGATC
TGCGAACTCG AGCTGGAACT CGAGAGCGGG CGCGCGCAGG ACCTCCTCGA CCTCGCCTGC
ACGCTCGCGC AGGACCTGCC GCTGATGCCG GCCGACCTCT CCAAGGCCGA ACGCGGCTAC
CGCCTTTTCC TCGACACGCC CGCGGGCGCG CTGCGCTCCG AGATCTCCAC GCTCGAGCCC
GGCCAGAACG TGGTCGAGGC CTTCCAGGGC CTGGCACTGT CCTGCGTGCG CCAGTGGCAG
GGCAACGCCG CGACCGCGCT CGCGCAGGGC GACCCCGACG CGATCGACCC CGACAACATC
CACCAGCTGC GCGTCGCCCA TCGCCGCCTG CGCGCGCTGC TCAAGATCTT CGCACCCGCC
CTGCCCGAGA CCTTCGCCGG CACCTGGAAC GCCCGCCTGC GCGACAACGC CAACCGCTTC
GGCGATGCGC GCGACCTGGA TGTGTTCCAC GCCGAGCTGC TCGAGCCGGT GACCCCCGAG
GGCCTCGCCG ACGCAGCCTC GATGGCGGCC CTGCTCGAGA CCGTGCGCAC GGCGCGCGCA
AGCGCCCGCC ACCACGCCGG CGTCAGCCTC GACCTCGCCA CCCAGGGCCG GCTGCTGCTG
GAGTTCACCG CCGCACTCCA CCGCCTGCGT GCCGACAGCC TCGCCGAAGC CGCCGACCTG
CGCAGCTTCG CCCGCCTGCG CCTCGACCGG CTGCGCAAGC GCGCCCGCCG CGGCGCCCGG
GCGGCGGCCA GCCTGGAGCC CACCCGCCTG CACGCGCTGC GCATCGACTT CAAGATGCTG
CGCTACGGCG TCGAGTTCTT CGCGCCGCTG TTCGGCACCA GGAGCATCAC CCGCTATCTC
GACGGCGTGG TGCGCGCGCA GACCACGCTC GGCTTCCTGC AGGACGTCGA CACCGCGCAC
CAGCGCCTGG CCGACTGGTC GCGAACGCAG CCCGCGCTCG CCGCGGCCGC GGCCTTCGTG
CTCGGCTGGC ACGCCCCGCG CTACGCCCGC CTGCGCCGCC GCGTGCTGCG CGAGTGCGAG
CCGCTGCTGT GGGGCGGCAA GCCGTGGTGA
 
Protein sequence
MSHEIELKLA LPRRALPALR RHPLVAAAEK CGNALTLDNT YYDTPKLQLK ARKVAVRTRR 
QGRQMLQTVK CAAVSSGGLS QRPEWETAWT GGFDFTPVDD PATARLLERH RAELVPVFTT
RFRRETRRLV PQAGVSILLM IDTGAVHVRT PEGVEREAEI CELELELESG RAQDLLDLAC
TLAQDLPLMP ADLSKAERGY RLFLDTPAGA LRSEISTLEP GQNVVEAFQG LALSCVRQWQ
GNAATALAQG DPDAIDPDNI HQLRVAHRRL RALLKIFAPA LPETFAGTWN ARLRDNANRF
GDARDLDVFH AELLEPVTPE GLADAASMAA LLETVRTARA SARHHAGVSL DLATQGRLLL
EFTAALHRLR ADSLAEAADL RSFARLRLDR LRKRARRGAR AAASLEPTRL HALRIDFKML
RYGVEFFAPL FGTRSITRYL DGVVRAQTTL GFLQDVDTAH QRLADWSRTQ PALAAAAAFV
LGWHAPRYAR LRRRVLRECE PLLWGGKPW