Gene Afer_1859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_1859 
Symbol 
ID8323954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp1945777 
End bp1947426 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content68% 
IMG OID644952991 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003110446 
Protein GI256372622 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCA TTCCTGTCGC GTCATTGCGG CGCCGTCACG TCGACGAAGC GCCAACCGAT 
CACGTCGTAC AAGCTGCACA CCCGCAGCAG CGCCGCGTGG TTCGTGAGGC TCCGCCACTG
CCCCATGACC TCTCTGGCGA CGACGATACC GTCCTCGTCC CCTTCTTGGA GGTGGCGACC
GCCGAGGGTC CCGTCATCCT GCCGACGAGC CTGCCCGGTG CCCCACTCGG ATCAGACGGC
CTACCACAGG TCCGGGCACA GGTCCTCCGC CATCGCTATG GCGCCACCGT GACCCGTACG
CTGCCTGGGG GGCCGCTGCC CAACGGTGCT CCCGCCCGTC CCACGTACGC CGCCAACCAG
ATCCTGACGC AGCGGACCCT CGCGCGCCGC GGATTTGGCT CGTGGGAGCT GCGCTTCGTG
GCAGCCCGAG AGCAGGTCAC GCTCGAGCGC GTGCTCGACG ACGTCGCGAG CGGGCGGGCG
GTCGTGCCGG CAAATCCCAA CCATCCGGAA CTCGAACCGA CGATCATCGG CCGGAACTAC
CGGACCAAGG TGAACGTCAA CCTCGGGGCG TCGAGCGTAC GCGCCGACAT CGACGAAGAA
CTCGCCAAGG TCGCGGTGGC GCTCCAGGGT GGCGCCGACA CCATCATGGA CCTCTCCACC
GGGACCAAGC TCGCCTGGGT GCGCGAGTGG ATCCTGCGCA ACAGCTCGGT TCCGGTGGGC
ACCGTGCCGA TCTACGAAGC GCTCGATCGC GTCGGTGGTC GGCCCGAGCG GCTGAGCTGG
TCGCTGTTCG CCGAGGTGCT CGAGGAGCAG GCGCTCCAGG GGGTCGACTA CGTCACGGTG
CACGCCGGCC TGCTCCGCGA CTACGTGCCG CTCGCGGCAC GCCGGACCAC CGGTATCGTC
TCGCGCGGCG GCTCCCTCAT GGCCGCATGG AGTCAGCATC ACCAACGCGA GAACTTCTTG
TTCGAGCACT TCGATGAGCT GCTTGCTATC GCCGGTCGCT ACGACCTCAC GCTCTCGCTC
GGCGACGGCC TGCGGCCCGG CTCCACCGCA GATGCCAACG ACGAAGCTCA GCTGGCCGAG
CTCCGCACGC TCGGCGACCT CGCACGACGA GCTGGAGAGG TCGGCGTCCA GGCCATGATC
GAAGGGCCAG GCCACGTCCC ACTCCACAAG ATCGCGGAGA ACGCACTGCT CGAGCAAGAA
CTCTGCGACG ATGCGCCGTT CTACACCCTC GGACCCCTCG CGATCGATGT TGCGGCGGGG
TGGGATCACG TCTCATCGGC GATCGGAGCT GCCGTGATCG GAGCGCACGG GGCAGCCATG
CTGTGCTACG TCACGCCCAA GGAGCACCTC GGCCTCCCGA ACCCGGAGGA CGTGCGTGCG
GGGCTCATCG CCTATCGCAT CGCCGCGCAC GCAGCCGATC TCGCCAAGGG GATCAACGGC
GCACAGGCGT GGGACGATGC CATGAGCCGA GCCCGCTACG AGTTTCGCTG GAACGACCAG
TTTGCCCTCG CCATCGATCC CGCAGGAGCG CAGGCACACC ACGACGAAAG CCTGCCAGCG
CGCGCAGCCA AGGATGCCGA GTTCTGCTCC ATGTGTGGGC CGAACTTCTG CGCGATGGCA
CACTCGCATC GCGCGCTTCA CGACGCCTGA
 
Protein sequence
MNTIPVASLR RRHVDEAPTD HVVQAAHPQQ RRVVREAPPL PHDLSGDDDT VLVPFLEVAT 
AEGPVILPTS LPGAPLGSDG LPQVRAQVLR HRYGATVTRT LPGGPLPNGA PARPTYAANQ
ILTQRTLARR GFGSWELRFV AAREQVTLER VLDDVASGRA VVPANPNHPE LEPTIIGRNY
RTKVNVNLGA SSVRADIDEE LAKVAVALQG GADTIMDLST GTKLAWVREW ILRNSSVPVG
TVPIYEALDR VGGRPERLSW SLFAEVLEEQ ALQGVDYVTV HAGLLRDYVP LAARRTTGIV
SRGGSLMAAW SQHHQRENFL FEHFDELLAI AGRYDLTLSL GDGLRPGSTA DANDEAQLAE
LRTLGDLARR AGEVGVQAMI EGPGHVPLHK IAENALLEQE LCDDAPFYTL GPLAIDVAAG
WDHVSSAIGA AVIGAHGAAM LCYVTPKEHL GLPNPEDVRA GLIAYRIAAH AADLAKGING
AQAWDDAMSR ARYEFRWNDQ FALAIDPAGA QAHHDESLPA RAAKDAEFCS MCGPNFCAMA
HSHRALHDA