Gene Tmz1t_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2039 
Symbol 
ID7083799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2299831 
End bp2302218 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content70% 
IMG OID643699066 
ProductRNA binding S1 domain protein 
Protein accessionYP_002355683 
Protein GI217970449 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.883528 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCCCC CGATCGAATA CCGCATCGCC GAAGAGCTCG GCGTCTCGCC GCGCCAGGTC 
ATCGCCGCCG TCCAGCTCAT CGACGATGGC GCCACCGTGC CCTTCATCGC CCGCTACCGC
AAGGAAGTCA CCGGTGGCCT GGACGACACG CAGTTGCGCA CGCTGGAGGA GCGCCTGGGC
TACCTGCGCG AGCTCGAAGA CCGCCGCGCC ACCGTGCTGG CCTCGATCGA GGAGCAGGGC
AAGCTCACCG CCGAGCTGCG TGCCGAGGTC GAGGACGCCG ACACCAAGCA GCGCCTCGAG
GACCTCTACC TTCCCTACAA GCCCAAGCGC CGCACCAAGG CACAGATCGC GCGCGAGGCC
GGCATCGGCC CGCTGGCCGA GACATTGCTC GCCGACCCGA CCCTGACGCC CGAAGTCGAA
GCCGAAAAGT ACATCAACGC CGAGGCCGGT TTTGCCGACA CCAAGGCGGT GCTCGACGGC
GCGCGCCAGA TCCTGATGGA GCACTTCGCC GAGGACGCCA CCCTGCTCGG CGAGCTGCGC
AGCTACCTGC ACGAACATGG CCGGGTGCGC TCGAGCGTGA TCGAGGGCAA GGAGAACGAA
GGCGCCAAGT TCCGCGACTG GTTCGACTTC GCCGAACCGA TCGCCACCAT GCCCTCGCAC
CGTGCGCTGG CGCTGCTGCG CGGGCGCAAC GAAGGCGTGC TGCGCCTGGC ACTCGACGTC
GAGCGCCCCG ACCCCGACGC CCCGCATCCC TGCGAAGGCC GCATCGCCGT GCGCTTCGGC
ATCCGCAGCC AGGGCCGCCC GGGCGACCGC TGGCTCGCCG AGACCGCGCG CTGGGCGTGG
AGCGTGAAGA TCTCCATCCA CCTCGAACTC GAGCTGATGA GCGAGCTGCG CGAGCGCGCC
GAGGAAGAGG CGATCCGCGT CTTCGCGCGC AACCTCCACG ACCTGCTGCT CGCCGCCCCC
GCCGGCACGC GCGCCACCAT CGGCCTCGAC CCCGGAATCC GCACCGGAGT GAAGGTCGCG
GTGGTGGATG CCACCGGCAA GCTGGTCGAC ACCGCCACCG TGTATCCCTT CGAGCCGCGC
CGCGACCGCG AATCGTCGAT CGCCACCATC GCCGCGCTGG CGAAGAAGCA TGCGGTCGAG
CTGATCGCCA TCGGCAACGG CACCGCCTCG CGCGAGACCG ACGCGCTGGT GGCCGAGCTG
ATGAAGCGCT ACCCGGAGCT GCAGCTGACC AAGATCGTCG TCTCGGAGGC CGGCGCCTCG
GTGTATTCCG CGTCCGAACT CGCCGCACGC GAGTTCCCCG AGCTCGACGT CAGCCTGCGC
GGCGCGGTTT CGATCGCCCG CCGCCTGCAG GATCCGCTCG CCGAGCTGGT CAAGATCGAT
CCCAAGTCGA TCGGCGTCGG CCAGTACCAG CACGACGTCA ACCAGGGCCG CCTGGCGAAG
AGCCTGGACG CCGTGGTGGA AGACTGCGTG AACGCGGTCG GCGTGGATGT GAACACCGCG
TCTGCACCGC TGCTCGCACG CATCTCGGGT CTGAACGCCA CGCTCGCCGG CAACATCGTC
GAGTATCGCA ATACCCGGGG CCCCTTCCGC AGCCGCAGCG CATTGAAGGA CGTGCCCCGC
CTCGGCCCCA AGACCTTCGA GCAGGCCGCG GGCTTCCTGC GCATCCCCAG CGGCGACAAC
CCGCTCGACG CCTCCTCGGT GCACCCCGAG GCCTACCCGG TGGTCGAGCG CATCCTCGCG
CGCGTGCGGA AGAGCGTGCG CGAACTGATG GGCAACGGCG GCTTCCTCAA GAGCCTGAAA
CCGGAGGAGT TCACCGACGA GCGCTTCGGG CTGCCCACCG TGCAGGACAT CCTCGCCGAA
CTGGAAAAGC CCGGCCGCGA CCCGCGCCCC GAGTTCCGCA CCGCGACCTT CCGCGAGGGC
GTGGAGACCT TGAAGGACCT CGAGCCCGGC ATGCTGCTCG AAGGGGTGGT CACCAACGTC
ACCAACTTCG GCGCCTTCGT CGACATCGGC GTGCATCAGG ACGGCCTGGT GCATATCTCC
GCGCTGTCCA ACACCTTCGT CAAGGACCCG CACAGCGTGG TCAAGGCGGG TCAGGTGGTG
AAGGTGAAGG TGCTGGAGGT GGACATCCCG CGCCACCGCA TCGCGCTCAC CATGCGCCTG
GGCGACGAGG TGACTGCGAA GCGCGGCGAG TCCGGCGCCC GGCGCGACGA GGGTGCGCCG
CGGCGCGGGG AAGGCGGTCA GGCCCCGCGT AGCGAGCAGC GCCCGCGCGG CGGTGACAAC
CGGCGCCCGC CCGCGCAGGG CAAGGGTGGC GCGCGCCAGG AGGCTTCCCG CCCCGCCGCT
GCCAACGCAC TCGCCGAAGC ATTCGCCCGC GCCCGCGGCA ATCGCTGA
 
Protein sequence
MLPPIEYRIA EELGVSPRQV IAAVQLIDDG ATVPFIARYR KEVTGGLDDT QLRTLEERLG 
YLRELEDRRA TVLASIEEQG KLTAELRAEV EDADTKQRLE DLYLPYKPKR RTKAQIAREA
GIGPLAETLL ADPTLTPEVE AEKYINAEAG FADTKAVLDG ARQILMEHFA EDATLLGELR
SYLHEHGRVR SSVIEGKENE GAKFRDWFDF AEPIATMPSH RALALLRGRN EGVLRLALDV
ERPDPDAPHP CEGRIAVRFG IRSQGRPGDR WLAETARWAW SVKISIHLEL ELMSELRERA
EEEAIRVFAR NLHDLLLAAP AGTRATIGLD PGIRTGVKVA VVDATGKLVD TATVYPFEPR
RDRESSIATI AALAKKHAVE LIAIGNGTAS RETDALVAEL MKRYPELQLT KIVVSEAGAS
VYSASELAAR EFPELDVSLR GAVSIARRLQ DPLAELVKID PKSIGVGQYQ HDVNQGRLAK
SLDAVVEDCV NAVGVDVNTA SAPLLARISG LNATLAGNIV EYRNTRGPFR SRSALKDVPR
LGPKTFEQAA GFLRIPSGDN PLDASSVHPE AYPVVERILA RVRKSVRELM GNGGFLKSLK
PEEFTDERFG LPTVQDILAE LEKPGRDPRP EFRTATFREG VETLKDLEPG MLLEGVVTNV
TNFGAFVDIG VHQDGLVHIS ALSNTFVKDP HSVVKAGQVV KVKVLEVDIP RHRIALTMRL
GDEVTAKRGE SGARRDEGAP RRGEGGQAPR SEQRPRGGDN RRPPAQGKGG ARQEASRPAA
ANALAEAFAR ARGNR