Gene Moth_1380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1380 
Symbol 
ID3831627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1425155 
End bp1426366 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content55% 
IMG OID637829316 
ProductCdaR family transcriptional regulator 
Protein accessionYP_430236 
Protein GI83590227 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3835] Sugar diacid utilization regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000902469 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCAC AAAAATGGCG CCGTTTTCTG GAAATGGCTG CTGCGGGCAA GGGCTTGGTC 
TCCATTGCCC GCTACCTGGC GGAGGTTAGC GGGCGGCCGG TTGTAATCTG CGACCTAACC
CTGCGTATCC TCGCCAGCCA GGCGCTGCCG AGTAAGACCC TGCACTTTGG TGACTACCTG
CTAGTGGAAC TCCCCAGGGA GACGGTAGAG GGGGCGTTTT ACCGCGGCCG TTTACAGAAG
ACCGGTACTG GAACCCCCTT TCTCATGCTC CCAATTGGTG AGGTTACGCT GTATGGTTAC
CTCTTTCTCC TGGAGGTGGG CGAAGACTGG CGCCCTTATC GGGAGCCCCT GAAAGCAGCC
GCTCTGGCGG CCATGATTGA GATGTCCCGG GCCCGCATAG CCCAGGAAAC CGAGAGGCGT
TACCGGAATG AATTCATCCA GGATATTCTT TATAATAATC TACCAAACCG GGAAGCCATG
GTCAACAGGG GTCGCCTCTG GGGCTGGGAT TTAACCCGTC CCCATTTACT GGTGGTCCTA
TCCCTGGACC GTCAGGCCCA GCAGGAAAGG GACGAAACCC TGTGGGAACG GTGGCGCCAG
TTAATGCAAT ATCACCTGAA ACACCAGGCG CCGGAGATTA TCCTTGCTGA TCGCGGTGAC
CAGCTAATCC TCTTGATACC GGTGGTTACA GATGAATTAG GGGTTAATAA AGGAAAAATT
GCCGGGCTGA TTAAATCCTT GCAAAAGTCG ACTGCCGCTC ACCTGGAAGG CAGGACCTTC
TCCGCCGGTG CAGGGCGCTT TTACGAAGAT GTCACCGATC TCTACCGGGC CTACCAGGAA
GCCAAGGTAG CTCTGGAGAT CAGCCGCCTC CTGCGGCGCC GGGGTACTTT GACCTTTTTT
GATGAACTCG GAGTGCTGCG GCTGATCTTT AACCAGGGGG AACAGGAACT GGAGGACTAT
TACGAAGAAA CCCTGGGTGC AATCCAGAAA TATGATGCCA AGCATAATAC CAATCTCATG
GAAACCCTGG CTGCCTACCT GTATGCCTCC GGGGATCATA ACCGGGCGGC CGAGGACATG
TTTATCCACG TCAACACCCT GCGCTATCGT TTAAAAAAAA TAGAAGAACT CCTGGGCCAG
GACTTGCGCA GGATTGATGT CCAGGTTAAC CTGTATACCG CCCTCCAGGT TAAGGTAATG
CTCGGCCGGT AA
 
Protein sequence
MEPQKWRRFL EMAAAGKGLV SIARYLAEVS GRPVVICDLT LRILASQALP SKTLHFGDYL 
LVELPRETVE GAFYRGRLQK TGTGTPFLML PIGEVTLYGY LFLLEVGEDW RPYREPLKAA
ALAAMIEMSR ARIAQETERR YRNEFIQDIL YNNLPNREAM VNRGRLWGWD LTRPHLLVVL
SLDRQAQQER DETLWERWRQ LMQYHLKHQA PEIILADRGD QLILLIPVVT DELGVNKGKI
AGLIKSLQKS TAAHLEGRTF SAGAGRFYED VTDLYRAYQE AKVALEISRL LRRRGTLTFF
DELGVLRLIF NQGEQELEDY YEETLGAIQK YDAKHNTNLM ETLAAYLYAS GDHNRAAEDM
FIHVNTLRYR LKKIEELLGQ DLRRIDVQVN LYTALQVKVM LGR