Gene Moth_2273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2273 
Symbol 
ID3831384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2380468 
End bp2381469 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content57% 
IMG OID637830193 
ProductLacI family transcription regulator 
Protein accessionYP_431103 
Protein GI83591094 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0311079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000018406 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGTCACTA TTAAAGACGT AGCCAGACAT GCCGGGGTGT CGGTTACTAC GGTCTCCCGG 
GTCCTCAACA ACAGCCAGCA TCCTATCAGT CCTGCTACCA AACAGCGCGT TCTAGCAGCC
ATCGAAGAAC TCGGCTTTTG CCCCAACGCC GCGGCCCGCA GCCTGCAGCT CAATGAAACC
AGGACCATTG GCCTTATCCT GCCGGATATC GCCAACCCCT ACTACCCCGG CATCGTCCGG
GGCGTCGAGG ACGTGGCCCA TGAATCCGGT TACACGGTTA TCCTCTGCAA TACCGACCGT
TCCCGTGAAC GTACTCAGGA ATACTTAAGA GTGCTACGGG AAAAGCGGGT GGACGGAGTA
ATCTTTACCG GCGGCGGGGC CGTGGAAGAC GCCAGCCAGA GCCACTTTTT CGACCAGGAA
AGAATAGCTA CCGTGGTTAT CGGACGTCAC CGTGGCAAAC TGCCGGCGGT GCAGGTGAAT
AATACCCTGG CGGCACGGGA GGCGGTAGAG CATCTGCTAT CCCTGGGACA CAGGCGTATC
GCAACTATCA CTGGACCGGC GACCTCAACT ACAGCCAGTG ATCGCCTGGA TGGTTACCGG
TGTGCCCTGG CCGGGCGAGG TATGAAAGTA GACCCCCTTT TAATCGTTGA AGGTAATTTT
GAATTCGAGA GCGGGTACCA GGCTATTGAC CGGCTGCCCC TCAGGGGCCC CGGGGCTATA
ACGGCTATCT TTGCCCATAA CGATTTGATG GCTATCGGGG CTATGAAGGC CCTTCAGGAA
CGGGGCTTAC AGGTTCCCGG CGATATAGCA GTTATGGGCT TTGACAATAT TCCCCTCGCT
TCCTTTATCA CCCCCCAGCT TTCTACCGTT GCTGTTCCTG TCTATGACCT GGGGGTAACG
GCCATGAAGG TGCTGGCTGA GCTCCTCGCC GGCCGGGAGG TGCCGCCGGT CACCACCCTG
GCTACCAAGC TCCAGGTCCG GGACTCTACT ATAATTAAAT AA
 
Protein sequence
MVTIKDVARH AGVSVTTVSR VLNNSQHPIS PATKQRVLAA IEELGFCPNA AARSLQLNET 
RTIGLILPDI ANPYYPGIVR GVEDVAHESG YTVILCNTDR SRERTQEYLR VLREKRVDGV
IFTGGGAVED ASQSHFFDQE RIATVVIGRH RGKLPAVQVN NTLAAREAVE HLLSLGHRRI
ATITGPATST TASDRLDGYR CALAGRGMKV DPLLIVEGNF EFESGYQAID RLPLRGPGAI
TAIFAHNDLM AIGAMKALQE RGLQVPGDIA VMGFDNIPLA SFITPQLSTV AVPVYDLGVT
AMKVLAELLA GREVPPVTTL ATKLQVRDST IIK