Gene Moth_1635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1635 
Symbol 
ID3831264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1670275 
End bp1671585 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content53% 
IMG OID637829560 
Producthypothetical protein 
Protein accessionYP_430480 
Protein GI83590471 
COG category[S] Function unknown 
COG ID[COG2855] Predicted membrane protein 
TIGRFAM ID[TIGR00698] conserved hypothetical integral membrane protein 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.972059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACG GTAATAAGCG CAGTCCCTTT CTTGCCAGTG AAGACTGGTG GTCAGTTTAC 
CTGGGCCTTT TTCTGGTGCT ATTAACATAC TTCGCTTTTA AAGCCGGTTC TTCCCTGGAT
TTTTTAAAGG CTGCCATGCC AGTTGAATGG CCGACCAAAA GCCTGGGGGC CCACTTCGCT
GCTAATATCG GCGCCTATAT TGCCATGTAC TTTATTCTGC TGGTACTAAC CACCATTGCC
GTAGCCGTTA TGGGCGGCAA GGTGGGCAAT TATATTGCTT CCTTTACAGT TTTATTCATA
GCTTCCCTCA TTATTTTAAT TATTGGGAGC CAGCATACGA TCAAGCATTA TGGACTGGAG
TACCCCTTCT GGTCCCTGGT AATAGGACTC ATTATTGGAA ACTTTACGAC ATTACCTGAG
TGGTTCCAGG AAGGCGCCAA AAGAACGGAA TTCTTTATTA AGACCGGTAT TGTCCTGCTG
GGTGCCGGCT TGCCCTTTAC CGTCATCGTT TCTGGCGGCG TCTGGGGCTT CCTGGAGGCC
ATATGCATTG TTGCCATTGG CTTCACTGTG GCCTTTACCA TTGCCCGCAG GCTGGGTTAT
GATCCCCGTT TTGCGGCCGT CCTGGGTGCT GGCGGTTCCG TCTGTGGCGT TTCGGCAGCC
ATTGCCGTCG GGAGTTCGGT TAAAGCTGAA GAGAAGCATG TCGGTTACGT GGTTTCTTTG
GTGGTCCTGT ATGCGCTAGT TCTCATTTTC CTCTTGCCAG TATTGGGTAG ACTTTTCGGC
CTCAACGAGT ATGTCACCGG GGCCTGGATT GGCGGTTCCG AACTGGCCGA TGCTGCCGGC
CTGGCTGCAG CAGCCATGGT TTCAGATAAT GCCGTCAAAG CCTTTACCCT GGTGAAACTC
AACCGCGACG TGATGATCGG TGTCCTGTCC TTTATCTTTG CTACCCTGGC GGTCACTCGC
TGGGAGGTTG CAGCCAGCGG CGAGCGGCCC AGCGCCATGG TTATCTGGGA GCGTTTCCCC
AAGTTCGTCC TGGCCTTCCT GGTAGCTTCG TTCATCACGA CTTCATGGGT CGTGTCCCTG
GGGAAACCCG CTGTTGATGC CCATATCTCG GCCAACCTCA CTACCATCCG CACCTGGCTC
TTCGTCCTGG CTTTCCTGTG CATCGGCCTG AACACCAAGA TCCAGGATAT CCGGGCCATG
GGCCGGAAAC CCATCATTGC CTTCACTACG GTGGTCCTGG TTAACGTCAT CGTGGGCTTC
ATTGTTGCCA ACCTCTTCTT TGGCGGTATC ATTGCCGCAC CGCTGCATTA A
 
Protein sequence
MPNGNKRSPF LASEDWWSVY LGLFLVLLTY FAFKAGSSLD FLKAAMPVEW PTKSLGAHFA 
ANIGAYIAMY FILLVLTTIA VAVMGGKVGN YIASFTVLFI ASLIILIIGS QHTIKHYGLE
YPFWSLVIGL IIGNFTTLPE WFQEGAKRTE FFIKTGIVLL GAGLPFTVIV SGGVWGFLEA
ICIVAIGFTV AFTIARRLGY DPRFAAVLGA GGSVCGVSAA IAVGSSVKAE EKHVGYVVSL
VVLYALVLIF LLPVLGRLFG LNEYVTGAWI GGSELADAAG LAAAAMVSDN AVKAFTLVKL
NRDVMIGVLS FIFATLAVTR WEVAASGERP SAMVIWERFP KFVLAFLVAS FITTSWVVSL
GKPAVDAHIS ANLTTIRTWL FVLAFLCIGL NTKIQDIRAM GRKPIIAFTT VVLVNVIVGF
IVANLFFGGI IAAPLH