Gene Tmz1t_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1970 
Symbol 
ID7084438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2221486 
End bp2222556 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content73% 
IMG OID643698995 
Producthypothetical protein 
Protein accessionYP_002355617 
Protein GI217970383 
COG category[S] Function unknown 
COG ID[COG4255] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00552765 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATCC AGCTCCTGAT CCCCGGCCTG CTATGGCCTG TCGCAACCTT GCTCGGGCCG 
GCCTCCGGAC TCGCGCTCGA GGGTCTCGCG ACCCTGCTCG GCCGCGGCCG CCGGGAGCTG
ACCGCCTTCG AGCCTTACGA TCGCCAGCTC GCGCGCCTGT TCGGCGTGCA CGGCGACACC
CTGCCGATGG CGACCCTGCG CCGCCTGGGC GAGGCCGATG CGCCGGCGCC CGAGCCCGGC
AGGCACTGGC TGTGCGCGGA TCCGGTGAAC CTGTCGTTCG CCCGCGAACA CCTGCTGCTG
CAGGCTTTTC CGGACGAAGA GCTGGACGCG GCGGAGAGCG CCGAGCTGGT CGCCGAGCTC
AACGCGATCT TCGCCGACCT CGGCCGCTTC GAAGCCTGCA CGCCCACGCG CTGGTACCTG
CGCCTGCACC GCCCGACCGC GGTCACGCTC TACCCGCTCG ACGACGTGAC CGGGCGCCCG
GTCAAGCACT TCCTGCCCGA AGGCGAAGAT GCCCGGTTGT GGCAGCGCAC CATGAACGAA
GCGCAGATCG TGCTGCACAA CCATGCGCGC AGCCGCGCCC GCGAGGAGGC CGGCCACCGC
GCGGTCAACA GCGTCTGGCT ATGGGGCGCG GGCGCGCTCG ATGCGCCGCC GCGGGCGCCC
GCCCGCCAGG TCCAGGCGAG CGATCCGGTC AGCATCGGCC TCGCGCGCGC TGCCGGGGTG
GCGTTCGGTG CGCCGGATCC CGCTGCAGCG CTTGCGCAGG ACACGCTGGT CGTCCTCGAC
GAGCTGCGCA AGCCCGCACA GCAGCTCGAC CTCGACACCT GGCGGCGCGG CCTCGAGGCG
ATGGAGCGCG ACTGGTTCGG CCCGCTCGCC GAGGCCTTCC GCGCCGGTCG CATCGACACC
CTGCGTCTGA CCGCCCCCGG CGATCGCGGC ACGCTGCAAC TCGAGCTGCG CGCCGGCGAA
CGCTGGAAGT TCTGGCGCAA GCCCTACGCC TTCGACGCGC TGCTGAAGTC CATCGCCCCC
GCGCCGATGC AGATGCCCGA CGCCCCGCGC CCCGCCCATG GCGCCCCATA G
 
Protein sequence
MQIQLLIPGL LWPVATLLGP ASGLALEGLA TLLGRGRREL TAFEPYDRQL ARLFGVHGDT 
LPMATLRRLG EADAPAPEPG RHWLCADPVN LSFAREHLLL QAFPDEELDA AESAELVAEL
NAIFADLGRF EACTPTRWYL RLHRPTAVTL YPLDDVTGRP VKHFLPEGED ARLWQRTMNE
AQIVLHNHAR SRAREEAGHR AVNSVWLWGA GALDAPPRAP ARQVQASDPV SIGLARAAGV
AFGAPDPAAA LAQDTLVVLD ELRKPAQQLD LDTWRRGLEA MERDWFGPLA EAFRAGRIDT
LRLTAPGDRG TLQLELRAGE RWKFWRKPYA FDALLKSIAP APMQMPDAPR PAHGAP