Gene Lferr_1911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1911 
Symbol 
ID6877896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1902088 
End bp1903179 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content63% 
IMG OID642789781 
ProductDihydroorotate oxidase 
Protein accessionYP_002220339 
Protein GI198284018 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTACG GCCTGCTGCG CCCTCTGCTT TTTACGCTGG ACCCGGAGCG CGCCCACACC 
CTGAGTATCG CGGCGCTGGA AGCACTGGGA CACATGCCAC GTGGCCTCGC CCGTGTGGCC
CGGCGCTACA CCGCCCACGA CCCGCGTCTG GCGCAGGACT TTTGGGGCCT GCATTTTGCC
AATCCCGTCG GCCTCGCCGC CGGCTATGAC AAGGATGCCC GCGCCACCGC CGCGCTCCCC
GCTCTCGGCT TCGGCTTCAT CGAAATCGGC ACAGTGACCC CGCGCCCGCA GCCTGGTAAT
CCGCGCCCCC GGGTCTTTCG TTATCCGGCG CAGCAGGCGG TCATCAACCG CATGGGTTTC
CCCGGTGAAG GGGCTGCGGC GGTTGCCCGA AGACTGGCAG CATTACCTGG CCATCCGGTG
CCTATCGGCA TCAATCTCGG CAAAAACAAG GACACCCCGC TGGAGCGGGC GCAGGACGAC
TATGTCGCCG CGCTGGAGTT GCTCTTTCAC TATGGCGACT ATCTCTGCAT CAATGTCAGT
TCACCCAACA CGCCGGGTTT GCGCTTGCTG CAGGGTGAAG AAGCCTTACG GGGACTGCTC
AAGGCCGTCG CCGCAGCCAA CCAGCGTCTG GCCCTACAGC ATCAGCGCCC GCCCCTGCCT
CTGCTCCTCA AGATCGCACC GGATCTGGAC AACGATGATC TCAACGCCAT CGGTAGTCTG
GCCTTAGGCA CGGCACCTCT GGTGAATGGT TTCATCGCCA CCAATACCAC CATAGAACGC
CCGGCCTCTC AACCCGGACT CTCCGAAAGC GGGGGCCTGA GCGGTGCACC ATTGCTGCAG
CAATCCAATG CCGTCATCGC GCAACTCTAT CGTGCGACCC AGGGACAGGT GCCCATCATT
GGCGTCGGCG GCATTCTGAG CGCGGCAGAC GCTTATGCCA AAATTCTGGC CGGGGCCAGC
CTGGTACAAG TCTACAGCGG CCTGATTTTC CGCGGACCCG GGCTGGTACG GGAGATTCTG
GAAGAACTGC CGGGGCTTTG GTTAAAGGAT GGTTATCCCG ATCTTGCTCA TGCGCGGGGT
AGTACCGCCT GA
 
Protein sequence
MSYGLLRPLL FTLDPERAHT LSIAALEALG HMPRGLARVA RRYTAHDPRL AQDFWGLHFA 
NPVGLAAGYD KDARATAALP ALGFGFIEIG TVTPRPQPGN PRPRVFRYPA QQAVINRMGF
PGEGAAAVAR RLAALPGHPV PIGINLGKNK DTPLERAQDD YVAALELLFH YGDYLCINVS
SPNTPGLRLL QGEEALRGLL KAVAAANQRL ALQHQRPPLP LLLKIAPDLD NDDLNAIGSL
ALGTAPLVNG FIATNTTIER PASQPGLSES GGLSGAPLLQ QSNAVIAQLY RATQGQVPII
GVGGILSAAD AYAKILAGAS LVQVYSGLIF RGPGLVREIL EELPGLWLKD GYPDLAHARG
STA