Gene Moth_0443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0443 
Symbol 
ID3830967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp443402 
End bp444994 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content64% 
IMG OID637828378 
Productbiotin/lipoate A/B protein ligase 
Protein accessionYP_429317 
Protein GI83589308 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0095] Lipoate-protein ligase A 
TIGRFAM ID[TIGR00545] lipoyltransferase and lipoate-protein ligase 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCCAGT GGCGTTTGCT GGATACCGGC AGCCGGACGG CGGCCGAGAA TATGGCCCTG 
GACGAGGTCC TGTTAACAGC CCGTTCCCAG GGGCAGGCCC CGGATACTTT GCGCTTCCTG
CAGTTTAACC CGCCCTGCGT CCTGGTGGGC TTTCACCAGG TGGTAGAGCA AGAAGTAAGG
CTGGAGTACT GCCGCCGGGA GGGGATAGAG ATCAACCGCC GCATTACCGG CGGCGGCGCC
CTTTTCTGGG ACACCAACCA GCTGGGATGG GAAGTTATAA CCACCCTCGA CTACCCCGGG
GTTGCCAGGC GGCTGGAGGG GCTCTATGCC CAGCTCTGCG GCGCCGTGGT CCGGGCCCTT
AAGCGGTTGG GGGTACCGGC GGCCTACCGC CCCCGCAACG ACATCGAGGT CGGGGGCCGC
AAGATCTCCG GTACCGGCGG GACAGAACTG GGAGGAGCCT TTCTCTACCA GGGCACCCTG
CTCATCGACT TTGACGTCGA AACCATGCTC CGGGCCCTGC GCATTCCCAC AGAAAAGCTT
AAGGCCAAGG AGATCGCCTC CCTTAAAGAG CGGGTAACCT GCCTCAAGTG GGAACTGGGC
CGGGTGCCAC TCCTGGAGAC TATCAAACAG GTTATAGCCG AAGAGTTCTG CCGCGAGTTC
GCCATGGAAC TGATCCCCGC CGGCCTGACG CCGGCGGAAG AAGCTCTCCT GGCCCATCAA
CTGCCCTATT TCCAGTCGGA GGAGTGGATT AACGCCGTCC AGGGGCCGGA GGGGCGGACG
GAACTGCGGT CCAGCCGGCG CACCCGTGGT GGTTTCTTGC GCAGTTCCCT GGTCCTGGGG
CCGGGCAACA GCCGCATCGA GAGCCTCTAC CTGACGGGGG ACTTCTTCGC CCATCCCAGG
CGGTCCATTT ACGACCTGGA GGCGAGGCTT AAAGGCCTGC CGGCCGACCC GGTGCTTATC
AGCCGGCAGG TAGAAGAATT CTTCCGGGAA AGCGGCGCCC GCCTGCCGGG CATTAAAGCC
GCGGAGGTGG CAGCTGCCAT CAACGACGCC CTGGTCAAGA AGGATTACCC CCGGCAGGGG
ATACCGGCGG CGGCTGTCAA TGACGTCTTC ACCGTCGTTA AACCCCTGGA GGAAATAACG
GCGGCGCCGG TAGTCCTCTT GCCCTATTGC GCCAAACTGC CCACCTGCCG CTTCCGCGGC
CGCCAGGGGT GCAGCGAGTG CGGCCGTTGC GACATTGGTA CAGCCTATGC CCTGGCCAGG
CAATATGGCC TGGAACCCCT GACCATCCAG AATTACGAAA TGCTGGCCCG GGTCTTACGC
CGGCTGCAAC GGGAGGGGGC GCCGGGATTT CTGGGCAGTT GCTGCGAGGC CTTCCTGGCC
AAGCACCGGC GGGACCTGGA ACGGATCGGC CTGCCGGGTA TCCTCCTGGA TATTGACAGC
TCCACCTGCT ACGAACTGGG CCAGGAGCGC GCCGCCCATG CGGGCCGTTT TGAAAACCAG
ACTACCCTCA AGCTGGATCT CCTGGAGCTC CTCATGGCCC GGGTAGCCCC CGGTAAGGCC
CGGCGGCAGG TGGCGGTGGC AGCCCATGCT TAA
 
Protein sequence
MRQWRLLDTG SRTAAENMAL DEVLLTARSQ GQAPDTLRFL QFNPPCVLVG FHQVVEQEVR 
LEYCRREGIE INRRITGGGA LFWDTNQLGW EVITTLDYPG VARRLEGLYA QLCGAVVRAL
KRLGVPAAYR PRNDIEVGGR KISGTGGTEL GGAFLYQGTL LIDFDVETML RALRIPTEKL
KAKEIASLKE RVTCLKWELG RVPLLETIKQ VIAEEFCREF AMELIPAGLT PAEEALLAHQ
LPYFQSEEWI NAVQGPEGRT ELRSSRRTRG GFLRSSLVLG PGNSRIESLY LTGDFFAHPR
RSIYDLEARL KGLPADPVLI SRQVEEFFRE SGARLPGIKA AEVAAAINDA LVKKDYPRQG
IPAAAVNDVF TVVKPLEEIT AAPVVLLPYC AKLPTCRFRG RQGCSECGRC DIGTAYALAR
QYGLEPLTIQ NYEMLARVLR RLQREGAPGF LGSCCEAFLA KHRRDLERIG LPGILLDIDS
STCYELGQER AAHAGRFENQ TTLKLDLLEL LMARVAPGKA RRQVAVAAHA