Gene Anae109_3562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3562 
Symbol 
ID5378141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4177100 
End bp4178206 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content73% 
IMG OID640845084 
Productbiotin synthase 
Protein accessionYP_001380727 
Protein GI153006402 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0246986 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTGAGA CCGCCAGGAC CCTTCCGCCG GACATCGTCC CGATCACCGT CGAGGAGGCG 
AGGACGCTCA TCCGCCACAC CGGCGGCCCG GCGCTCGACG CCCTGCTCGA CCGCGCCGAC
GCCGTGCGGC GCGCCGTCCA CGGAGACGAG GTCGCGCTGT GCGGCATCAC CAACGCGAAG
AGCGGGCGCT GCCCGGAGAA CTGCGGCTTC TGCTCCCAGT CCGCGCACTT CCCGGGCGCG
GACGCCCCGG TGTACCCGCT CGTGCCGGCC GAGGAGATGG TGGCCCAGGC GAAGGTGGCC
GAGCGCGCCG GAGCGCGCGA GTTCTCCATC GTCACCTCGG GCACGCGCGT CGCGAAGGAG
AGCGAGCTCG CCACGATCGA GGAGGCGGTG CGCAAGCTCC GGGCGGAGAC GGCCGTCGAG
CCGTGCGCGT CGCTCGGCCT CGTCCGCGAG CCGGAGCTCG TCCGGCTCAA GGCTGCCGGG
CTCATGCACT ACCACCACAA CCTCGAGACG GCCCGCAGCT TCTTCGAGAA CGTCTGCACC
ACGCACACGT ACGACGAGCA GCTCGAGACC ATCCGCGCCG CGAAGCGGCA GGGCCTGAAG
CTCTGCTCGG GCGGGATCCT CGGCATGGGC GAGACCCCGG AGCAGCGGGT CGAGCTCGCC
GCCACGATCC GCGAGCTCGG GATCGACTGC GTGCCCATGA ACTTCCTGAA CCCTCGTCCC
GGCACGCCGA TGGAGCACGT GCAGGCGATC ACGGCGGAGG AGTGCCTCGC CGCCGTGGCG
GTGTTCCGGC TCATGATGCC CGCGGCGCAC ATCTTCGTGA TGGGCGGGCG CGAGGTGAAC
CTGGGCGGGC GGCAGCACCT CATCTTCCGC GCCGGCGCGA ACGGCACCAT GGTCGGCAAC
TACCTGACGA GCGCAGGACG GGGACCGGGC GAGACGGTGC GGATGGTGGA GGAGCAGGGC
CTGCGGCTCA GGGCGCCGGA CACCGGGCGC GAGTGGGCCT TCGACGGGAG CGCCCCCGCC
GAGGCGGAGT GGAACCGCCG CGCCGCCGAG CCGGGCGGCA AGCGCGGGCT GCCGGTGGTG
GGCCCGCCGC GCGGCGGCTG CGCCTAG
 
Protein sequence
MCETARTLPP DIVPITVEEA RTLIRHTGGP ALDALLDRAD AVRRAVHGDE VALCGITNAK 
SGRCPENCGF CSQSAHFPGA DAPVYPLVPA EEMVAQAKVA ERAGAREFSI VTSGTRVAKE
SELATIEEAV RKLRAETAVE PCASLGLVRE PELVRLKAAG LMHYHHNLET ARSFFENVCT
THTYDEQLET IRAAKRQGLK LCSGGILGMG ETPEQRVELA ATIRELGIDC VPMNFLNPRP
GTPMEHVQAI TAEECLAAVA VFRLMMPAAH IFVMGGREVN LGGRQHLIFR AGANGTMVGN
YLTSAGRGPG ETVRMVEEQG LRLRAPDTGR EWAFDGSAPA EAEWNRRAAE PGGKRGLPVV
GPPRGGCA