Gene Moth_0445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0445 
Symbol 
ID3830969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp446035 
End bp447084 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content63% 
IMG OID637828380 
Productradical SAM family protein 
Protein accessionYP_429319 
Protein GI83589310 
COG category[R] General function prediction only 
COG ID[COG2516] Biotin synthase-related enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCAA CCAGTCCGGA GTACGTTAAA ACCAGTACCG CGGCGGCCAT AACCCTGGGC 
TTTCAGCCCG GTAGCTTTCA CCGCGACGCC CGGCTCACCG GCCTCAACCT CCTTCTGACT
TATAACGAAC CCTGTGCCGG CCGCTGTGCC TACTGCGGTC TCTCCGGTAA CCGCCTACCT
GACGCCGAAC CGACCTTTAT TCGCGTTGAC TGGCCGGTTT ATGCCCTGGA CGCGATCCTT
AAGGAAGTCC GGCGGGACTC TCGCGGCCTG GAGCGGGTGT GCATCGGCAT GCCGACCCAC
CGCCGGTCGT GGGATGACCT CCTTAAAGTC GTCAACCGCT GGCACCGGGA AAGCGATCTC
CTCATCAGCG CCCTCCTGAC TCCTACCGCC TGCCGCGGCC GGGATTTTTT TGAACTGCGT
GCAGCGGGTG CGGACATGGT CGGTATTGCC ATCGATTGCG CCACACCGGA ACTATTTGAA
CGTTACCGCG GCAGGGGGGT CAAAGGTCCC CATCGCTGGG AGGAGTACTG GGAGGGGGTC
TCCCGGGCCG TAACCGTCTT TGGCCGCGGC CGGGTCGGCA TCCATCTTAT CGTTGGCCTG
GGGGAAACCG AGGCCGAGAT GATCCAGACC ATCCAGAGGG CCCAGGATAT GGGGGTCAGA
ACCCACCTCT TCAGCTTTTT CCCGGAAACC GGCACGATTC TGGCCCGCCG CCGCCAGCCG
CCCCTGGGCC AGTATCGCCG GGTCCAGCTG GCCCGTTATA TCATTAACGA GGGCCTGGGG
CGGGCTGAGG ACATGACCTT TAATGACGCC GGCCAGGTGA TGGATTTCGG GATGGATATC
ACCCCCCTGG TCAAAGCCGG GGAAGCCTTC CGGACCTCCG GTTGTCCGGG GAAGGATGGT
CGCACAGTAG CCTGCAACCG GCCCTACGGT AACGAACGTC CCTCCCAGGC CATCCGCAAT
TTCCCTTTTG CCCCGGAACC CGGGGATATC CGGGCCGTCG AGCGCCAGCT CCGGCAGGGT
CTTAAGGGGG CCGTTGCCCA TGCCGGTTGA
 
Protein sequence
MPATSPEYVK TSTAAAITLG FQPGSFHRDA RLTGLNLLLT YNEPCAGRCA YCGLSGNRLP 
DAEPTFIRVD WPVYALDAIL KEVRRDSRGL ERVCIGMPTH RRSWDDLLKV VNRWHRESDL
LISALLTPTA CRGRDFFELR AAGADMVGIA IDCATPELFE RYRGRGVKGP HRWEEYWEGV
SRAVTVFGRG RVGIHLIVGL GETEAEMIQT IQRAQDMGVR THLFSFFPET GTILARRRQP
PLGQYRRVQL ARYIINEGLG RAEDMTFNDA GQVMDFGMDI TPLVKAGEAF RTSGCPGKDG
RTVACNRPYG NERPSQAIRN FPFAPEPGDI RAVERQLRQG LKGAVAHAG