Gene Lferr_0436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_0436 
Symbol 
ID6876388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp413215 
End bp414165 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content62% 
IMG OID642788309 
Productbiotin synthase 
Protein accessionYP_002218897 
Protein GI198282576 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000726909 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACAACA GCACAGCACT GCAGACTCTC GATGCCATCC TCGAAATCTA TGCCCGCCCA 
TTCAACGATC TGATTTATGC GGCGCAACAG GTGCATCGGC TACACTTTGA CCCAAACGCC
ATCCAGTGCT CCACCCTCCT CTCCATCAAG ACCGGCGGTT GCCCCGAGGA TTGTGGCTAC
TGCTCCCAGA GCGTCCATCA CCAGACGGCC CTCCAGGCCG AACCGCTGAT GGATCTTGAA
CAGGTGCGTG CCGCCGCGCG CGAGGCCAAA GCCAATGGGG CCCAACGCCT GTGCATGGGG
GCTGCCTGGC GCTCGCCCCA CGACCGGGAT ATCGAAAAAG TAGCCGCTAT GATCGGCGTG
GTCAAAGAGT ATGGCCTGGA AAGCTGCGTG ACGCTGGGGA TGCTCAAACC GGGGCAGGCG
GAACGCCTAC AGCATGCTGG CCTCGACTAC TACAACCACA ATCTCGATAC CTCCCCCGAG
TTTTATGGTG AGGTCATCCA CACCCGCAGT TATCAGGACC GCCTCGACAC CCTGGAGGCG
GTGCGTGACG CCGGTATCCG GATTTGCAGC GGCGGCATCC TCGGTATGGG AGAATCCCGC
CGGGACCGGG CGCGGATGCT GCAGGTGCTC GCGCAGTTGC CTCAGGCTCC GGAGAGTATC
CCCATCAACG CCCTGGTGCC CATTCCCGGC ACCCCGCTGG AGGCTGCGGA GCCCATTGAC
GGCTTCGAAT TCGTGCGCAC CGTCGCGGTC ACCAGAATCC TCTTCCCCAA GGCCTACGTA
CGGCTTTCGG CAGGGCGCGA AGCCATGAGC GACGAATTAC AGGCACTCGC CTTCCTGGCC
GGTGCCAACA GCATTTTTCT TGGGGATCGC CTGCTGACGA CAGGGAATGC CAGCACAGGG
CATGACCAGG CCCTGTTCAA CCGACTGGGC CTGCACCGCA GCGCGGACTG A
 
Protein sequence
MNNSTALQTL DAILEIYARP FNDLIYAAQQ VHRLHFDPNA IQCSTLLSIK TGGCPEDCGY 
CSQSVHHQTA LQAEPLMDLE QVRAAAREAK ANGAQRLCMG AAWRSPHDRD IEKVAAMIGV
VKEYGLESCV TLGMLKPGQA ERLQHAGLDY YNHNLDTSPE FYGEVIHTRS YQDRLDTLEA
VRDAGIRICS GGILGMGESR RDRARMLQVL AQLPQAPESI PINALVPIPG TPLEAAEPID
GFEFVRTVAV TRILFPKAYV RLSAGREAMS DELQALAFLA GANSIFLGDR LLTTGNASTG
HDQALFNRLG LHRSAD