Gene Moth_2512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2512 
Symbol 
ID3832784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2618215 
End bp2619405 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content57% 
IMG OID637830435 
Producthypothetical protein 
Protein accessionYP_431337 
Protein GI83591328 
COG category[S] Function unknown 
COG ID[COG1641] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00299] conserved hypothetical protein TIGR00299 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0156591 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGATAG CCTACTTTGA TTGTTTTTCC GGAATTAGCG GTGATATGTG CCTGGGAGCG 
TTAATAGCCT GTGGCCTCAG CCAGGACGAG CTAACCTCTG GCTTAAAAGG ACTGGGGCTC
GAGGGATGGG AATTAAGGGT TAGGGAGGTA AAGCAACACA GCATTGCCGC CACCGATGTC
GCTGTCCAGG TGACAGGGAG CCAGCCCCAC CGCCACCTGG CGGATATCCT GGGGCTGATC
AATAACAGTT CTTTGCCTGC CCCGGTTAAG GAAAAATCTG CGGCGGTATT TAAAAACCTG
GCCCGGGCCG AAGGCCAGGT ACATGGCATC GACGCCAGCC AGGTCCACTT TCATGAAGTC
GGGGCGGTAG ACGCCATTAT CGACATCGTC GGCAGTATCC TGGGGTTGCA CCTCCTGGGT
ATAGAGAAAG TCATCTCCTC CCCCTTACCT GCTGGTTCCG GCTGGGTGGA CTGCCGGCAC
GGCAAATTAC CAGTTCCCGC CCCGGCAACC CTTTACCTTC TCCAGGGCTA CCCGGTTTAT
GGTACTGAAG ATAAAGCCGA GCTGGTAACC CCTACCGGCG CGGCCTTGAT TACCACCCTG
GCCGACAGCT TTGGCCCCTT TCCAGCCATG AACCTGACCA GGGTCGGTTT CGGTGCCGGA
AAAACCGAAC TTCCCCATCC CAACCTCCTG CGCCTGGCCC TGGGTGAGAT CAACAGCGGG
CAGCTGGAAG GAGAGGAAAG CAGCCTGGTT ATCGAAACAA CCATCGACGA TATGAACCCC
GAATTCTTTC CCGCCCTCCT TGAGGAGACC ATGGCCGCCG GCGCTGTTGA TGCCTTCTTC
ACCCCGGTAC AAATGAAAAA AGGCCGACCC GGGATCCTCT TTACGGCCCT CTGTCCGGAG
AATAAACTGG CCGCTGTTGC GGCTGCCATC TTTACCCATT CCAGCACCCT GGGGTTACGT
TTTCGCCGGG ACCAACGCCT GGTATGCCAG CGACGGATGG CTGAGGTAGT CACCCCTTAT
GGCACTGTCC CCGTTAAACT GGGCCTCTAC CGTGATCCCA CAGGACAGGT TATTACCAAC
ATCGCACCCG AATATGAATC CTGCCGTCAG ATTGCCAAGT CTGCCGGCGC CCCCCTGAAG
GAAGTCTATG CTGCTGCCCT GGCCGCCGCC AGGGCGCTAA AGGCTTTTTA A
 
Protein sequence
MKIAYFDCFS GISGDMCLGA LIACGLSQDE LTSGLKGLGL EGWELRVREV KQHSIAATDV 
AVQVTGSQPH RHLADILGLI NNSSLPAPVK EKSAAVFKNL ARAEGQVHGI DASQVHFHEV
GAVDAIIDIV GSILGLHLLG IEKVISSPLP AGSGWVDCRH GKLPVPAPAT LYLLQGYPVY
GTEDKAELVT PTGAALITTL ADSFGPFPAM NLTRVGFGAG KTELPHPNLL RLALGEINSG
QLEGEESSLV IETTIDDMNP EFFPALLEET MAAGAVDAFF TPVQMKKGRP GILFTALCPE
NKLAAVAAAI FTHSSTLGLR FRRDQRLVCQ RRMAEVVTPY GTVPVKLGLY RDPTGQVITN
IAPEYESCRQ IAKSAGAPLK EVYAAALAAA RALKAF