Gene Tmz1t_1691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1691 
SymbolnusA 
ID7084111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1897156 
End bp1898631 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content65% 
IMG OID643698712 
Producttranscription elongation factor NusA 
Protein accessionYP_002355342 
Protein GI217970108 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.349056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCG AGATCCTGCT GCTTGTCGAT GCCTTGGCGC GCGAGAAGAA CGTCGCCAAG 
GACATCGTTT TTTCCGCGCT CGAAACCGCC TTGGCCTCTG CGACCAAGAA ACGCATCCAC
GACGATGCCG ACGTGGTGGT GTCGATCGAC CGCGACTCCG GCGACTACAC CTCGAAGCGC
CGCTGGCTGG TGATGCTCGA CGAGGAGGTC GCGAACGACG AGGCCGAGAT GGGCATCATC
GATGCCCGCG AGCTGCGCGC CGACGTGCAG ATCGGTGACT ACATCGAGGA AGAGCTCGAG
CCGATCGACT TCGGTCGCAT CGGCGCCCAG GCCGCCAAGC AGGTCATCCT GCAGAAGATC
CGCGACGCCG AGCGCGAGCA GGTGCTCAAC GACTTCCTCG ACCGCAAGGA GTTCCTCGTC
TCCGGCTCGA TCAAGCGCAT GGAGCGCGGC AACGCGATCA TCGAGGTCGG CCGCATGGAA
GCCGTGCTGC CGCGCGACCA GCAGATCCCG CGCGAGAATC TGCGCGTGGG CGATCGGGTC
AAGGCCTTCC TGCTGCGCAT CGACCGCGGC GCGCGCGGCC CGCAGCTGGT GCTGTCGCGC
ACCGCGCCCG AATTCCTCAT GAAGCTCTTC GAGCTCGAGG TCCCCGAGAT CGAGGACGGC
CTGCTCGAGC TCAAGGCCTG CGCCCGCGAC GCCGGCCTGC GCGCCAAGAT CGCGGTCAAG
TCCAACGACC AACGCATCGA CCCGATCGGT ACCTGCGTCG GCCTGCGCGG CTCGCGCGTC
ACCGCCGTGC GCAACGAGAT CGCCGGCGAG CAGATCGACA TCATCGTGTG GTCGCAGGAT
CCCGCCCAGT TCGTGGTCGC CGCGCTGCAG CCCGCCGAGG TCGTCTCCAT CGTCGTGGAC
GAGGAGTCGC ACGCGATGGA CGTGGTGGTC GACGAGAACA ACCTCGCGAT CGCCATCGGC
CGCAGCGGCC AGAACGTCAA GCTCGCCTCC GAGCTCACCG GGTGGACGAT CAACCTGATG
AGCGAGCAGG AGTCGGCCGA AAAGACCGCC CAGGAGCAGC AGGGCCTGCG CGCGCTGTTC
ATGGAAAAAC TGGACGTCGA CGAGGAAGTC GCCGACATCC TGATCGAGGA GGGTTTCTCC
TCGCTCGAAG AGGTGGCCTA CGTGCCGCTC TCCGAAATGC TCGAGATCGA GGCCTTCGAC
GAGGACACGG TCAACGAACT GCGCAATCGA GCGCGCAATG TGCTGCTGAC CGAGGCCATC
GTCACCGAGG AGCAGCTCGA GAAGGTTTCC GACGACTTGC TCGGCCTTGA AGGCATGGAC
AAGTCGCTGG CCGCCACACT GGCCCAGCAG GGCATTCGTA CCCGCGACGA CCTGGCCGAC
CTTGCGGTCG ACGAGCTGGT CGAAATGGCC GGGATCGACG AAGAAAGAGC CAAGGCGCTG
ATTTCCGTTG CGCGCGCCCA TTGGTTCGAA GAATGA
 
Protein sequence
MSREILLLVD ALAREKNVAK DIVFSALETA LASATKKRIH DDADVVVSID RDSGDYTSKR 
RWLVMLDEEV ANDEAEMGII DARELRADVQ IGDYIEEELE PIDFGRIGAQ AAKQVILQKI
RDAEREQVLN DFLDRKEFLV SGSIKRMERG NAIIEVGRME AVLPRDQQIP RENLRVGDRV
KAFLLRIDRG ARGPQLVLSR TAPEFLMKLF ELEVPEIEDG LLELKACARD AGLRAKIAVK
SNDQRIDPIG TCVGLRGSRV TAVRNEIAGE QIDIIVWSQD PAQFVVAALQ PAEVVSIVVD
EESHAMDVVV DENNLAIAIG RSGQNVKLAS ELTGWTINLM SEQESAEKTA QEQQGLRALF
MEKLDVDEEV ADILIEEGFS SLEEVAYVPL SEMLEIEAFD EDTVNELRNR ARNVLLTEAI
VTEEQLEKVS DDLLGLEGMD KSLAATLAQQ GIRTRDDLAD LAVDELVEMA GIDEERAKAL
ISVARAHWFE E