Gene Moth_1735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1735 
Symbol 
ID3833035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1786255 
End bp1787307 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content65% 
IMG OID637829659 
Productbiotin synthase 
Protein accessionYP_430579 
Protein GI83590570 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00262623 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.855022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGACCGG AATTTGCTGC AAGCTGGCAG CGGGCGGCAG CAGGTGAAGA ACTCGATCGT 
GAAGATATTG TTAATCTCCT CGCCGCCACC CCCGGTGAAG AGGAGGAAGC CCTCTACCGG
CTGGCGGACG GCGTCCGCGC CCGGATGGCG GGGGACGAGG TCCACCTGCG CGGTGTCATC
GAGTTTTCTA ATTACTGTCG TCGTCGCTGT TGCTACTGCG GCCTCCGGGC TGATAACGCC
CGGTTGCACC GCTACCGCCT GATGCCGGAG GATATAGTAG CTGTGGCCAG ACAGGGAGTA
GAGCTGGGCT ATGGTACCAT CGTCCTCCAG TCCGGGGAGG ACCCCTGGTA TACGGCCCCG
GTCCTGGCAG GGATCGTCCG GGAGATTAAA GAAATGGGGG TGGCCGTGAC CCTGTGTGTA
GGCGAGCGCT CCCGGGAGGA GTACGCCCTG TGGCGGGAGG CCGGGGCTGA CCGCTACCTC
TTGAAACACG AAACGGCCAA CGAAGAACTT TACGCCCGCC TGCACCCCGG CATGAGCTGG
CAGGAACGCC TCCAATGCCT CCAGTGGCTG CGGGAACTGG GCTACCAGGT GGGCTCCGGC
AACATCATCG GCCTGCCGGG CCAGACCCTG GCCGACCTGG CCGACGACTT GCTCCTCCTG
CGACAGCTGG ATGTGGAGAT GGCGGGTTTG GGGCCCTTTA TCCCCCACCC GGCAACACCC
CTGGCCGGGG AGCCGGCCGG GAGTTTGGAA CTAACTTACC GGGTGGTGGC TACGGCCCGC
CTGGTCATTC CTTTCGCCCA CCTGCCGGCC ACCACAGCGG TGGGTACCCT GGCGCCCAAC
GGCCGCCAGA AGGCCCTGCA GCGGGGGGCC AACGTCATCA TGCCCAACCT GACCCCCACC
CGCTACCGGG CCGACTACCA GATCTATCCC AACAAGATTT GTATCAATGA GGGACCGGAG
GATTGCCGCT ACTGCCTGGA GGGCATGGTC CGCGCCTTAG GGCGACGCCT GGGCCGGGGA
CCGGGGCATA CCTTGAAGCC CATTCCTGCT TAG
 
Protein sequence
MRPEFAASWQ RAAAGEELDR EDIVNLLAAT PGEEEEALYR LADGVRARMA GDEVHLRGVI 
EFSNYCRRRC CYCGLRADNA RLHRYRLMPE DIVAVARQGV ELGYGTIVLQ SGEDPWYTAP
VLAGIVREIK EMGVAVTLCV GERSREEYAL WREAGADRYL LKHETANEEL YARLHPGMSW
QERLQCLQWL RELGYQVGSG NIIGLPGQTL ADLADDLLLL RQLDVEMAGL GPFIPHPATP
LAGEPAGSLE LTYRVVATAR LVIPFAHLPA TTAVGTLAPN GRQKALQRGA NVIMPNLTPT
RYRADYQIYP NKICINEGPE DCRYCLEGMV RALGRRLGRG PGHTLKPIPA