Gene Moth_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1047 
Symbol 
ID3831853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1078015 
End bp1079103 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content56% 
IMG OID637828975 
ProductNusA antitermination factor 
Protein accessionYP_429904 
Protein GI83589895 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000154988 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000137146 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAACAGTG AGTTTATCCA GGCACTACGG GACCTGGAGA GGGAAAAGGG TATTAACGCC 
GATATTTTAC TGGAGGCCAT TGAAGCGGCT TTAATCTCGG CTTACAAGAA GAACTTTGGC
TCCCTGCAGA ATGTCAGGGT GGACATCCAG CGTGATACCG GGGAGATTAA GGTTCTGGCC
CAGCGCCAGG TGGTAGAAGA GGTAACCGAT CCCCGGCAGG AGATCTCCCT CGAGGAAGCT
CGGGCCATCA ACAGTAAATA TGAACTGGGG GACATAGTGG AGAAAGAGGT TACTCCCAGG
GATTTTGGCC GCATCGCTGC CCAAACGGCC AAACAGGTGG TCGTCCAGCG CATCCGGGAA
GCCGAGCGCG GCTTGATTTA TGAAGAATTT ATCGGCCGGG AAAATGACCT CGTTACGGGT
GTAGTCCAGC GCCAGGAGGG CAAAAACATT ATCCTTGACC TGGGCCGGGC CGAGGCGATC
CTGCTTCCCA GCGAACAGAG CCCCGGAGAG ACCTACCGCC AGGGCGAACG CCTGAAGGTC
TATGTCCTGG AGGTCAGGAA GACTAACAAA GGGCCCCAGA TTCTCGTGTC CCGGACCCAT
CCCGGCTTGA TAAAGAGGCT TTTCGAGCTG GAAGTTCCGG AAATCCACGA TGGCATTGTT
GAAATTAAGG GAGTCGCCAG GGAACCTGGG GCGCGCTCCA AGATTGCCGT TCATTCCCGG
GATGAAAAGG TGGATCCGGT GGGCTCCTGC GTAGGTCCCA AGGGGGCACG GGTACAGGCT
GTGGTCCAGG AGCTGCGGGG CGAGAAGGTA GATATCATTA AATGGAGCGA TGACCCGGCT
GTTTATGTGG CCAACTCCTT GAGCCCGGCC CGGGTCCTGG ACGTGACTGT CGACGAAGAA
AATAAGGTGA GCCAGGTCAT CGTTCCTGAT AACCAGCTCT CCCTGGCCAT TGGTAAGGAA
GGCCAGAATG CCCGCCTGGC AGCCAGGATC ACCGGCTGGA AAATCGATAT TAAACCGGAA
TCCGAAGCTG GCGATTGGGA TTCCTGGGAT GCCGACCTGG ATCTTGACGG CACGATAGAG
GAGGAGTAA
 
Protein sequence
MNSEFIQALR DLEREKGINA DILLEAIEAA LISAYKKNFG SLQNVRVDIQ RDTGEIKVLA 
QRQVVEEVTD PRQEISLEEA RAINSKYELG DIVEKEVTPR DFGRIAAQTA KQVVVQRIRE
AERGLIYEEF IGRENDLVTG VVQRQEGKNI ILDLGRAEAI LLPSEQSPGE TYRQGERLKV
YVLEVRKTNK GPQILVSRTH PGLIKRLFEL EVPEIHDGIV EIKGVAREPG ARSKIAVHSR
DEKVDPVGSC VGPKGARVQA VVQELRGEKV DIIKWSDDPA VYVANSLSPA RVLDVTVDEE
NKVSQVIVPD NQLSLAIGKE GQNARLAARI TGWKIDIKPE SEAGDWDSWD ADLDLDGTIE
EE