Gene Moth_1414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1414 
Symbol 
ID3832242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1458303 
End bp1459580 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content53% 
IMG OID637829350 
Producthypothetical protein 
Protein accessionYP_430270 
Protein GI83590261 
COG category[S] Function unknown 
COG ID[COG3681] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000941621 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTGC TTGATCAGCA AACTTTGATA AATTTATTAC ATCAAGAAGC CGATGTAGCA 
ATCGGGTGTA CGGAACCGGT AATGGTGGCC CTGGCCGCTG CTAAAACCAG GGATATGCTG
GGTACTTTAC CGCGGCTGGT GGACATTTCC GTGAGTTCGG CGGTTTGGAA GAATGCTCGC
CGGGTTGGTT TGCCGGGAAC CGGAGAGAAG GGCCTGGCAA TGGCGGCGGC CATGGGATTG
CTGGCGCCGG TAGAGGCAGG CCAGCGCCTG CTGGCCGCTT TAACTCCCGT GCAGGTGGAA
CAGGCAAAGA TATTGGTCCG GGAGGGAGTT GTCAAGGTCG GGGTTGTTGC CGCCAAGGAG
GGTTTATATG CCCGGGCTGT GGCCCGGTCC AACCAGCATG AGGCCATAGT CGAGTTAAAC
GGCAGCCATA AAAACTTCTC GGCCTTATGG TTGGATGGGA GGATGGCAGG AGGCGCAGGA
GAGAATTTAA ATTTAAAACT GGAAGCGTTG TTGGCGCAGG ACTACCAGTC CCTGCTAAAA
CAAGTTTTAT CCCTATCACC GGAGGAGCTA TATTTTTTAT ACCAAGGGGC TGAAGATATT
CTAACCTTTG CCCGGGAAAT CCATCAAGGC GGTAGGAATC CCCTTTCCGC CATGGCTTCG
TTTTTCAGGC GAACAGAAAG TGGAGGGGAA AGTTTAGAAG TACTTATCCG TAACCTCACA
GGTATCGCGG TGGCAGAGCG GATGGCGGGA GCTACATACC CCGTCTTGAC CTGCGCCGGC
AGTGGGAACC AGGGTATCTT GGCAGCAGTA TCGTTGCTAT TAGCAGGCCA GGAATTGCGA
GCCGGTCCGG AGAGTGTGAC CCGGGCCCTG GCAATAGCTC ACTTTACCAA CATGTATCTG
AAGGCCTATA CCGGGAAGCT ATCACCATTA TGCGGGGCAG TGACCGGGGG TGCCGGTGTG
GCGGCAGCCA TCTGCTGGCT TTTGGAAGGT AGCTGCCAGC AAATCATTAA CGCTATGCAA
ATCGTATTGG GTAATCTTTG CTGCGTTATA TGCGACGGAG CCAAGGAAAG CTGTGCTTTA
AAAATAAGCA CTGCAGCCGT TGAAGCAGTC CGGGCAGGCT ACATGGCATG TCAGGGGATA
AACCTGGAGG CCGGTACGGG TATTGTGGGC AAAAAGTTGG AGGATACCAT GGAGCTGGTT
AGAAAGGTGT ACCAGGGAGG GCTGGGCGAA ATAGATTACT ACTTGGGCAA GGTCGATTAT
CTTCTGTCAA CCAACTAA
 
Protein sequence
MNLLDQQTLI NLLHQEADVA IGCTEPVMVA LAAAKTRDML GTLPRLVDIS VSSAVWKNAR 
RVGLPGTGEK GLAMAAAMGL LAPVEAGQRL LAALTPVQVE QAKILVREGV VKVGVVAAKE
GLYARAVARS NQHEAIVELN GSHKNFSALW LDGRMAGGAG ENLNLKLEAL LAQDYQSLLK
QVLSLSPEEL YFLYQGAEDI LTFAREIHQG GRNPLSAMAS FFRRTESGGE SLEVLIRNLT
GIAVAERMAG ATYPVLTCAG SGNQGILAAV SLLLAGQELR AGPESVTRAL AIAHFTNMYL
KAYTGKLSPL CGAVTGGAGV AAAICWLLEG SCQQIINAMQ IVLGNLCCVI CDGAKESCAL
KISTAAVEAV RAGYMACQGI NLEAGTGIVG KKLEDTMELV RKVYQGGLGE IDYYLGKVDY
LLSTN