Gene Athe_2647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2647 
Symbol 
ID7407011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2779187 
End bp2780242 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content39% 
IMG OID643717016 
Productbiotin synthase 
Protein accessionYP_002574485 
Protein GI222530603 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000036845 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTAA AAGATATTTT GGAAAAAGCA TGTTATGAGA ATATTCTCAC AAAAGATGAG 
ATAAAACTTT TGCTGATGGC AGAAGGTGAT GATAAAGAAC TTCTTTTCAA AACAGCTGAT
AGTGTAAGAA AAGAACATGT TGGAGATGAG GTCTTTTTAA GAGGGCTTAT TGAATTTTCA
AGCTACTGCA AAAACGACTG TTTTTACTGT GGTCTGAGAC GAAGCAATAG CCAAGCTCAG
CGTTACAGAA TGCAGGAAGA TGAGATTGTA GAGGTTGCGA AAAGGGCGTA TCAGATGGGG
TACCGCACGG TTGTATTGCA GTCTGGTGAG GATATGTATT ACACCAAAGA CATGCTGTGT
TCAATTATAA AAAAGATAAA AAGTAGCGTG GATGTTGCTA TAACACTTTC AATTGGTGAA
AGGTCATATG ATGAGTACAA GGCATTCAAA GATGCCGGAG CAGACAGGTT TTTGATGAGA
TTTGAAACTT CAAACAAAAA GCTATATAGA AAATATCATC CCGGAATGAG CTTTGAAAAC
AGGATAGAAT GTCTCAAATG GATAAAAAAT CTTGGGTATG AGCTTGGGAC AGGTTTTTTG
ATAGGTCTTC CGGGGCAAAC TATTGATGAT TTGGCACAGG ATATACTTCT TGTAAAAGAG
CTGGATGCAG ATATGATAGG CATAGGACCT TTTATTCCTC ATCCACAGAC GCCTCTAAAA
GATGCAGAGG AAGGTTCGGT GGATTTAACT TTAAAGAGCA TTGCCATTTT GAGGCTTTTG
ATTCCAGATG CTAATATTCC TGCAACAACT GCGCTTGGCA CTTTAGACCC TCTTGGAAGA
CAAAAAGGTC TCATGTGCGG TGCAAACATT GTGATGCCAA ATGTAAATGA CCTTGAGTAC
AAGCTCAAAT ATGAGTTGTA TCCTGGAAAG ATTTGCATAA ATGAAGATGC GACAAAGTGC
AGAGGTTGTA TTGAGTCAAT TATAGTTTCG CTTGGTAGAA AAGTTGGACA GGGAAAAGGA
CAAAGCAGGC ATTACAAAAG AGCTGCTGCG TCTTAA
 
Protein sequence
MKVKDILEKA CYENILTKDE IKLLLMAEGD DKELLFKTAD SVRKEHVGDE VFLRGLIEFS 
SYCKNDCFYC GLRRSNSQAQ RYRMQEDEIV EVAKRAYQMG YRTVVLQSGE DMYYTKDMLC
SIIKKIKSSV DVAITLSIGE RSYDEYKAFK DAGADRFLMR FETSNKKLYR KYHPGMSFEN
RIECLKWIKN LGYELGTGFL IGLPGQTIDD LAQDILLVKE LDADMIGIGP FIPHPQTPLK
DAEEGSVDLT LKSIAILRLL IPDANIPATT ALGTLDPLGR QKGLMCGANI VMPNVNDLEY
KLKYELYPGK ICINEDATKC RGCIESIIVS LGRKVGQGKG QSRHYKRAAA S