Gene Moth_1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1019 
Symbol 
ID3832639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1047621 
End bp1049006 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content48% 
IMG OID637828947 
Producthypothetical protein 
Protein accessionYP_429876 
Protein GI83589867 
COG category[S] Function unknown 
COG ID[COG1434] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000506367 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000514024 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAAAAAA ACATATCCAT AGCAATAATT TTTGCCGCCT TAATTACCTT GATATTAGGT 
AGTACGTACT GGTATATCGA ATGGTTTGGG TGGAACACGG CTCCCAAGCG AGCTGATGTG
ATTATAGTGT TAGGTGCTGC TGTTTGGGCG AATGGTCCAA GCCCGGCTTT AATGGAACGT
ATTACACTGG CCGAAACTCT TTATCAACAG GTATATGCAG CGACGCTAAT TACGACAGGT
GGCATTGGGA GATCTAATCC TACCCCGGAA GGAAGCGCCG CCCGGCAGGT TCTTATTTCC
CATGGTATAC CTGCCAATGT AATTTATGAG GAAGTCACTT CGAGCAACAC CAGGGAAAAC
CTGGTTGGGG CCCTCAACAT TATGCGCAAA CACGGTTGGA AAAGTGCCGT TATCGTCACC
CATGATTTTC ACCTTTTACG GGCCATGACC GAAGCTCGCC GGCTAGGCAT AGAGGTTTCC
GGCGCCGGTG TCCATGAAAC AGCCATGTTT AGGCCGCCGC TGGTACTACG AGAGGTAATC
GCTAACCTGG TTAAAGCGAT CGGGTATAAC TTGCAAATGT ACCGCATGGA AGAAGGGGAT
AATGTGCTCG CCGGCAACAA TCGGATCCTG TTGAAAGCAT TAATCGTTAT GACCTTACTA
TCAATCCTCG CCTTTACAGC CTGCGCCCGA TCTTCTCAAT CGCCAAAAGA AATCGAGGAG
GCAGTAGCCC TTGTAAATGG TCAACCAATA AATAAGGAAG CTCTGGAAAA AGAAATGCTT
AGAATGCAAT TAATGGCTGA AATGAGGGTT CAATCAGGAA CTGTTTCTAT AGACGAGTTT
CTTAAGCAAT CCGGACGGGA CTGGTCCAAG ATGTCGCCAG AAGAAAAGCG TTACTACCTG
CGGGCAAAAC GTCAAAGCGA AATGACAGGG GAGAAGAATG AGGCTTTTAA CCGGCTGGTG
CGGGAAGAGG TTCTGTACCA GGAAGCGGTT AAAGAGGGAT ATGAAGTTTC TATAGACGAG
GCCCGGCGGC GTTACCAGGA AATAGAGACC CTTTCCCAGG AATCCCTAAA AGAGGCGCTT
AAAGACGCAA AGGCCAAGGA AGAGATAGAA AGGCTGCAAG AGGTTGAAAA GAAGTTCATG
GAATTGATGG GCTTTACCAG TCCGGAAGCG CTAACAGAAT ACCGGGTGCA AAGGCTCATG
CGAACCATGC CCATTAGCCG TTTACGGGAA AAGTTTAAAG CGGATTGGGG CAATAAACAC
CCGGAAATCC GCGGGGACGA GTTCCGGTAC ATGGTTGAAA ATCGCTGGGA AGATTATACT
AACGAACTTT TGCGCCAGGC TAATATTCGT ATCAAAGACA AAGACCTCGA GGTTATTTAT
GAGTAG
 
Protein sequence
MKKNISIAII FAALITLILG STYWYIEWFG WNTAPKRADV IIVLGAAVWA NGPSPALMER 
ITLAETLYQQ VYAATLITTG GIGRSNPTPE GSAARQVLIS HGIPANVIYE EVTSSNTREN
LVGALNIMRK HGWKSAVIVT HDFHLLRAMT EARRLGIEVS GAGVHETAMF RPPLVLREVI
ANLVKAIGYN LQMYRMEEGD NVLAGNNRIL LKALIVMTLL SILAFTACAR SSQSPKEIEE
AVALVNGQPI NKEALEKEML RMQLMAEMRV QSGTVSIDEF LKQSGRDWSK MSPEEKRYYL
RAKRQSEMTG EKNEAFNRLV REEVLYQEAV KEGYEVSIDE ARRRYQEIET LSQESLKEAL
KDAKAKEEIE RLQEVEKKFM ELMGFTSPEA LTEYRVQRLM RTMPISRLRE KFKADWGNKH
PEIRGDEFRY MVENRWEDYT NELLRQANIR IKDKDLEVIY E