Gene Moth_2363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2363 
Symbol 
ID3832543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2483046 
End bp2484176 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content68% 
IMG OID637830282 
Productputative transmembrane transcriptional regulator (anti-sigma factor) 
Protein accessionYP_431188 
Protein GI83591179 
COG category[K] Transcription 
COG ID[COG5662] Predicted transmembrane transcriptional regulator (anti-sigma factor) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.824355 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000839626 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGTGCC GCGAGGCAAA GGAGTTTTTC TCGCCCTACC TGGACGGGGA ACTCACAACA 
GAGGAAAGGG ATATAGTGCG GCGGCACCTG GCGGCCTGCC CGGCCTGCCG GGAGGAAATG
GCCCGCTGGG AGGAACTCTC CCGGGCCCTG CGGGAGTTAA AGGCGCCGGT GGCGGCGCCG
CCGGGTTTCG CGGCGGCGGT AATGGCGCGG GTCAACCCCG GCCGCCAGGC GCGGTCCTGG
TGGCGGGTGC GGCGCCTGGT GGCCGCAGCG GCGGCGGCGG TCCTGCTGGT CGCCGGCTCC
GTCGGCTATG CCGCCCGGGG CTTGTGGCAG CAGCTGCCGG CCAGCATCGC CGCACTGCAC
CCGGGCCATG ACGGCAACGG GCGCCAGGTT ACCGTCGCAC CCGGCGACAA AACAACTTCA
TCGGAGGCTA GCCAGCCGGG CGCAGGAGCC AACCCCGGCA ACCAGCCGGG CGACGGTACG
GCAACGGGCG CTAACAGCCC GGCGGGGACA ACGAGTAAAG AGTCCGGGAA CGGCGCCACC
GCCCCCGGTA ACGGCCTGGG GAGCAGCCAG CCCGCCGGCA GCGGGAAAAA TAGCAATTCT
CCCGGCAGCG CAGGCCGGCA GCCAGCGGGC GACAGCGGTA ACCGGCAACC TGCCATGGTG
GCAGGCAGCG AGCCTTACCC GGCCCGGACC TTCCTGAGCG CCGGGCGCCA GGTTACCAGC
ACCATGCTGA AGCTCAAGGT GACCGATATG GCCGCCGCCA GGGCTAAGGC GCTGGGTCTT
GGTGGCAGCG CCGGCGCGAC CGCCCAGGTC ATCGCCGTCC AGGACAACGG CCAGGAGAAA
AGGTTCATCT GCCAGTTTAC CGTGGCGGAA GACCGTGCGG CGGTTCTCCT GGCGGGTCTC
AAAGGCCTGG GCCAGGTAAT CTCCCAAAAT ACCAGCACCC AAGACCTGAC CCAGCAGTTC
AGCGCCACCC TGGAGCAGTA CCAGGCCAAA GTGGCCCAGG TGAACGCCGC CACCGACCCG
GCGGAAAAGG AGAAGCTCAC CCGGGAGGCC AAAGCCCTGG AGCAGCAACT GACCTCCTGG
GACGATGCCA GCAAGAAGCA GGTAATCATC TTATGGCTGG AGACGCAATA A
 
Protein sequence
MECREAKEFF SPYLDGELTT EERDIVRRHL AACPACREEM ARWEELSRAL RELKAPVAAP 
PGFAAAVMAR VNPGRQARSW WRVRRLVAAA AAAVLLVAGS VGYAARGLWQ QLPASIAALH
PGHDGNGRQV TVAPGDKTTS SEASQPGAGA NPGNQPGDGT ATGANSPAGT TSKESGNGAT
APGNGLGSSQ PAGSGKNSNS PGSAGRQPAG DSGNRQPAMV AGSEPYPART FLSAGRQVTS
TMLKLKVTDM AAARAKALGL GGSAGATAQV IAVQDNGQEK RFICQFTVAE DRAAVLLAGL
KGLGQVISQN TSTQDLTQQF SATLEQYQAK VAQVNAATDP AEKEKLTREA KALEQQLTSW
DDASKKQVII LWLETQ