Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1399 |
Symbol | |
ID | 3831685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1446185 |
End bp | 1447429 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637829335 |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_430255 |
Protein GI | 83590246 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAGAG GTAAATTAAA GCTGATAATT GTTTTCCTGT TGGCCTGCCT GGCGGCAGGC GGCGGGTTCC TTGCCGCCAG GCTGTTCTTT TTTCCTCCTG TATCTGGCCA GGAAACGGGG TCCACGGATA CCGGCGGGAG TCAGCCAGGG ACCCTTAATA TTTTACTTTT AGGCACCGAC GCCCGCCCCG GCGAAAAGGT GGGTAATACA GACACCATCA TCCTGGCCCA CTTTGACGGG GAGAGGCTGG CCCTGTTGTC CATTCCCCGC GACACCAGGG TAAATATCCC CGGCCACGGG GTGGACAAGA TCAATGCCGC TTATAGTATA GGCGGTCCCG ATTTAACAAC CAGCATCGTA GCAGACCTAA CCGGTGTGCC CATATCCAAG TACGTCTTAC TTCGCTGGGA CGGTTTCATT AAAATAATCG ATCTCCTGGG GGGGGTGACG GTGAATATCC CCAGGGATAT GTACTACTAT GACCCGGTTG ACGGGCCCCA GTATAAAATC AACTTGAAGA AGGGTCTCCA GCACCTGGAT GGCCACCAGG CCCTGGCCTT TGTCCGCTTC CGGAAGGAAG CCCTGGGTGA CATTGACCGT ACCGGCCAGC AGCAGGAGCT CATCAAAGCC CTGCTGGAAA AGGTCCGGCA GCCGGGTACG TTATTAAAAA TGCCCCGGCT GCTGCCTGAG ATCTATAAGA ACGTCGAAAC TAATATGGGC CTGGACGAAA TGCTAACCAT GGCCAGGGCA GGCTTGCACC TGAAAAATAT GACTGTTGTT AGCCAGACCC TACCAGGGTA TTTTCAAACT ATAAATGGCA TTAGCTACTG GGGAGTGGAT CCTGCTCAGG CCCGGCAGGT AGCCCAGGCC CTGTTTGAAT ACGGGCAAAC AACCAAGCAG GTTGTCCTGG ACGCCCCGGC TTCCCAGACA TCCAGCAACA GTAAGGGGAC AACGTCTACC TTTAAGGCCG ATACCAGGGG AACCTCTACC AGCAACCGTG TATCATCCCC ACGCCCACCC GCTGTAAAGG ATATCATCAT AACAACCCCT ACTCATGATA AAGGTTCAAA CCGTACGCTG TCGTCTAATA ACCCGCCCTC CCAAAAAGCA GGTACAGGTA TAACTACCAA CAATGGCAAG GAACCGCCTG CCAGTGGAAC AAAAAACAGT AATACTGCGA ACCCTGGTAG TACTTCCGGT TCAACGCCAG GCAATGGGCC GGGTAGTACA GATAAATCGG GTTAA
|
Protein sequence | MLRGKLKLII VFLLACLAAG GGFLAARLFF FPPVSGQETG STDTGGSQPG TLNILLLGTD ARPGEKVGNT DTIILAHFDG ERLALLSIPR DTRVNIPGHG VDKINAAYSI GGPDLTTSIV ADLTGVPISK YVLLRWDGFI KIIDLLGGVT VNIPRDMYYY DPVDGPQYKI NLKKGLQHLD GHQALAFVRF RKEALGDIDR TGQQQELIKA LLEKVRQPGT LLKMPRLLPE IYKNVETNMG LDEMLTMARA GLHLKNMTVV SQTLPGYFQT INGISYWGVD PAQARQVAQA LFEYGQTTKQ VVLDAPASQT SSNSKGTTST FKADTRGTST SNRVSSPRPP AVKDIIITTP THDKGSNRTL SSNNPPSQKA GTGITTNNGK EPPASGTKNS NTANPGSTSG STPGNGPGST DKSG
|
| |