Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1380 |
Symbol | |
ID | 3831627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1425155 |
End bp | 1426366 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637829316 |
Product | CdaR family transcriptional regulator |
Protein accession | YP_430236 |
Protein GI | 83590227 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3835] Sugar diacid utilization regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000000902469 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACCAC AAAAATGGCG CCGTTTTCTG GAAATGGCTG CTGCGGGCAA GGGCTTGGTC TCCATTGCCC GCTACCTGGC GGAGGTTAGC GGGCGGCCGG TTGTAATCTG CGACCTAACC CTGCGTATCC TCGCCAGCCA GGCGCTGCCG AGTAAGACCC TGCACTTTGG TGACTACCTG CTAGTGGAAC TCCCCAGGGA GACGGTAGAG GGGGCGTTTT ACCGCGGCCG TTTACAGAAG ACCGGTACTG GAACCCCCTT TCTCATGCTC CCAATTGGTG AGGTTACGCT GTATGGTTAC CTCTTTCTCC TGGAGGTGGG CGAAGACTGG CGCCCTTATC GGGAGCCCCT GAAAGCAGCC GCTCTGGCGG CCATGATTGA GATGTCCCGG GCCCGCATAG CCCAGGAAAC CGAGAGGCGT TACCGGAATG AATTCATCCA GGATATTCTT TATAATAATC TACCAAACCG GGAAGCCATG GTCAACAGGG GTCGCCTCTG GGGCTGGGAT TTAACCCGTC CCCATTTACT GGTGGTCCTA TCCCTGGACC GTCAGGCCCA GCAGGAAAGG GACGAAACCC TGTGGGAACG GTGGCGCCAG TTAATGCAAT ATCACCTGAA ACACCAGGCG CCGGAGATTA TCCTTGCTGA TCGCGGTGAC CAGCTAATCC TCTTGATACC GGTGGTTACA GATGAATTAG GGGTTAATAA AGGAAAAATT GCCGGGCTGA TTAAATCCTT GCAAAAGTCG ACTGCCGCTC ACCTGGAAGG CAGGACCTTC TCCGCCGGTG CAGGGCGCTT TTACGAAGAT GTCACCGATC TCTACCGGGC CTACCAGGAA GCCAAGGTAG CTCTGGAGAT CAGCCGCCTC CTGCGGCGCC GGGGTACTTT GACCTTTTTT GATGAACTCG GAGTGCTGCG GCTGATCTTT AACCAGGGGG AACAGGAACT GGAGGACTAT TACGAAGAAA CCCTGGGTGC AATCCAGAAA TATGATGCCA AGCATAATAC CAATCTCATG GAAACCCTGG CTGCCTACCT GTATGCCTCC GGGGATCATA ACCGGGCGGC CGAGGACATG TTTATCCACG TCAACACCCT GCGCTATCGT TTAAAAAAAA TAGAAGAACT CCTGGGCCAG GACTTGCGCA GGATTGATGT CCAGGTTAAC CTGTATACCG CCCTCCAGGT TAAGGTAATG CTCGGCCGGT AA
|
Protein sequence | MEPQKWRRFL EMAAAGKGLV SIARYLAEVS GRPVVICDLT LRILASQALP SKTLHFGDYL LVELPRETVE GAFYRGRLQK TGTGTPFLML PIGEVTLYGY LFLLEVGEDW RPYREPLKAA ALAAMIEMSR ARIAQETERR YRNEFIQDIL YNNLPNREAM VNRGRLWGWD LTRPHLLVVL SLDRQAQQER DETLWERWRQ LMQYHLKHQA PEIILADRGD QLILLIPVVT DELGVNKGKI AGLIKSLQKS TAAHLEGRTF SAGAGRFYED VTDLYRAYQE AKVALEISRL LRRRGTLTFF DELGVLRLIF NQGEQELEDY YEETLGAIQK YDAKHNTNLM ETLAAYLYAS GDHNRAAEDM FIHVNTLRYR LKKIEELLGQ DLRRIDVQVN LYTALQVKVM LGR
|
| |