Gene Athe_2541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2541 
Symbol 
ID7409411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2659674 
End bp2660687 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content34% 
IMG OID643716905 
Productbiotin synthase 
Protein accessionYP_002574382 
Protein GI222530500 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00179392 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTGAATT TTCTTCAATC AGTTCAATTT GTCAAAGAAG TTGAAAAAAA GATAATTGAA 
TATGACAAAG ACATTGCCTT CAATGAGGCT ATCATACTTT ATGAAATCGC AAAACATGAT
GCAGATTTGG TAAAAAATCT CGCCAGTACG ATAAACCAAC ATTATTTTAA AAATACTATT
GAGCTTTGCT CCATTTATCC TGCAAAGGTA GGACTTTGCC CGCAAGATTG CAAGTTCTGT
TCCCAGTCTA TCCATCACAG TTGTTTAATT GAAATAAAAG ATCTTGCTGC GCTTGATGAA
GTAATAGAGT ATCTTGAGTA TGTAATATCT TTTCCAATAA AAAGATTTTG CTTAGTCACA
AGTGGTGAAA AGCTTGATGA CTCAGAATTT GAAAAAATTT TAGACATCTA TTCACACATC
TCAAAGAATT ATAACATACT TCTGTGCGCA TCACTCGGCT TTCTTACTCA AGAAAGGGCA
AAAAAGCTAC TTAAAGTTGG AGTTGTAAAG TATCACAATA ACTTAGAGAC ATCCAGCACA
TATTTTAAAA ATATCTGCTC CACCCACACT CAACAGCAAA AGATAGAGAC TTTAAAAATT
GCAAAAGAGG CAGGGCTTCA AATCTGTAGC GGTGGAATAA TCTCAATGGG TGAGGACATG
ATTGAAAGAA TCAAACTTGC ATTTGAACTA AGAGAATTAG ATGTTGACTC TGTTCCAATC
AACATATTAA ACCCAATAAA AGGCACGCCT TTAGAAGATA TAAAGATCAT AGACAAAAAC
GAAATTTTTA TCACCCTGGC ACTATTTAGG ATTGTGCTAC CAAAAAAGAC AATTCTTCTT
GCAGGTGGAA AAGAAAATGC GCTTGGAGAT ATGGAAAAAA TGGCATATGA GTGTGGCGTA
AATGGTTGTA TGGTTGGAAA TTATCTTACA ACAAGGGGAA TGGGAATAAG AGAGAAGATT
GAGATGTTGG AATCCTTGGA TTTAAAGTTT CAAACCAATA TGCATAATAA TTAG
 
Protein sequence
MLNFLQSVQF VKEVEKKIIE YDKDIAFNEA IILYEIAKHD ADLVKNLAST INQHYFKNTI 
ELCSIYPAKV GLCPQDCKFC SQSIHHSCLI EIKDLAALDE VIEYLEYVIS FPIKRFCLVT
SGEKLDDSEF EKILDIYSHI SKNYNILLCA SLGFLTQERA KKLLKVGVVK YHNNLETSST
YFKNICSTHT QQQKIETLKI AKEAGLQICS GGIISMGEDM IERIKLAFEL RELDVDSVPI
NILNPIKGTP LEDIKIIDKN EIFITLALFR IVLPKKTILL AGGKENALGD MEKMAYECGV
NGCMVGNYLT TRGMGIREKI EMLESLDLKF QTNMHNN