Gene Moth_1262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1262 
Symbol 
ID3833057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1306526 
End bp1307569 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content45% 
IMG OID637829198 
ProductLacI family transcription regulator 
Protein accessionYP_430119 
Protein GI83590110 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000350604 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000118995 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGAACCGT TACCAAGAGG AATGAAAAAT ATGCGCATAA CCATCAAAGA TATAGCCCAA 
AGAAGCGGGG TATCCTGTTC TACGGTTTCA CGTGTTTTAA CAAACCACCC CAACGTTGAT
CCCAAAACCA GGGAACGGGT CAAGCAAGTT ATAGATGAAC TTGGTTACCG CCCCAGTCGC
ATTGCCCGCG GTCTGGTTAT GGGCCAAATT AACGTAGTTG CCCTTATAAT TGGCGATATT
CGTAACCCTT TCTATGCAGA ACTAACCCGT GCCGTAAAAG ATATCCTGAA TAAAGAAGGT
TATATGGTTG TAGTTAGCGA TAGTGATTAT GACCCGCAAA AAGAGGAGAT ATATATTCGG
GCAGCCGAGG AATATGGCTT TGCTGGCATT ATCATGATTA CAGCCATGGA AACCGAGGCT
TTGATCCAAC AGTTAGAAAA ATTACGCTGC CCGGTTGTTT TGCTCAATCG CTACCTACCC
TCCGTCGAGA CAGATGTTAT CTCCGTAGAT AATTATCTAG GCGGTTACCT GGCCGCTGAG
CACCTCATCA AGCTAGGGCA TCGTAATATC GCTCACCTGG CTGGTTTTAA AAACTCCAGT
GCTACCCGAG ATCGCCTTCG TGGCTTCATC GACGCCCATG TCCATTATGG TATCCAGTTA
AATCAAGAAA GAATAGTTTA TGGTAATTTG CAGATGGAAG CAGGGTATAA ATTCGCTAAA
GAATATTTAA GCCAGAACGA AGACATTACT GCAGTCTTTT GTGGAAACGA TCTCATGGCC
CTGGGATTGA TAGAAGCCCT GTACGAGGAG GGGAAAGAAA TACCCCGAGA TATAAGTGTT
ATAGGTTACG ATGATATTGA CATGGCTTCT CTGGCAAGGG TAAAACTCAC GACCATCCGC
CAGCCCCAGT ATGAAATGGG CCAGACAGCT GCCGAGGTCT TAATTGACAG AATGAAAGGT
AAAATAGGAG CACCAAAACG TATTATCTTT ACACCTAAGC TGATCATCCG TGAGAGTACC
GCCGAGTATA AACCCGGTAA ATAA
 
Protein sequence
MEPLPRGMKN MRITIKDIAQ RSGVSCSTVS RVLTNHPNVD PKTRERVKQV IDELGYRPSR 
IARGLVMGQI NVVALIIGDI RNPFYAELTR AVKDILNKEG YMVVVSDSDY DPQKEEIYIR
AAEEYGFAGI IMITAMETEA LIQQLEKLRC PVVLLNRYLP SVETDVISVD NYLGGYLAAE
HLIKLGHRNI AHLAGFKNSS ATRDRLRGFI DAHVHYGIQL NQERIVYGNL QMEAGYKFAK
EYLSQNEDIT AVFCGNDLMA LGLIEALYEE GKEIPRDISV IGYDDIDMAS LARVKLTTIR
QPQYEMGQTA AEVLIDRMKG KIGAPKRIIF TPKLIIREST AEYKPGK