Gene Lferr_1907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1907 
Symbol 
ID6877892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1897779 
End bp1899344 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content62% 
IMG OID642789777 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002220335 
Protein GI198284014 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGAGA TTACAAGGGC GCTCATCAGC GTTTCCGACA AGCGGGGCGT TGTAGAATTT 
GCCCGGCGGT TGCAGGATTT TGGGGTAGAG ATACTCTCTA CCGGCGGTAC CGCCAAAGCT
CTGATGGCCG ATGGTGTCGC GGTGCAGGAA GTGGGCGACT ACACGGGTTT CCCGGAACTA
CTGGAGGGCC GCCTCAAAAC CCTGCACCCC AAAATCCACG GCGGACTGCT GGCGAAGCGC
GACGACAGCA GCCACACCCG GCAGATGGCC GAGTACGGCA TCCCCGCGAT CGATCTCCTC
TGCGTCAATC TCTACCCCTT CGCCGAGACC ATCGCCAGTG CCGATTGCAC ACTGGAAGAA
GCCATGGAAA ACATCGATAT CGGCGGCCCG ACCATGCTCC GTGCGGCGGC GAAGAACTGG
GAGGGCGTCA CTGTCCTCGT CGACCCCGAT GACTATGCCG CTGTGTTGCA GGAAATGGAA
CAGAGTTACG GCGGCGTCGG CGCCAGTACC CGCTTCCGCC TGGCCACCAA GGTCTTCGCC
CACACGGCGC GCTATGACGG TGCTATCGCC AACTATCTCT CCAGCCTGGG TCCCGATGGC
AACCGGACAA CCTTCCCGCA GACTCTGTCC CTGCAATTTA AGAAAGCGCA GGATCTGCGC
TACGGCGAAA ATCCTCATCA GGCCGCCGCC TTCTACCGCG ATGGCAGCGG CGGCGGACTG
GCGGACGCCC ATCAGTTGCA AGGCAAGGAA CTGTCTTACA ACAATATCGG GGACGGTGAT
GCCGCCGTCG CGCTGGTGAT GGAATTTGCC GAACCCGCCT GTTGCGTGGT GAAGCATGGC
AATCCCTGCG GCGTGGCCGT GGGGCCGGAT CTGCTCGGTG CCTATCAGCG CGCATGGGCC
GGCGATCCGA TATCCGCCTT CGGCGGCATC GTCGCCTGTA ACCGGCCGCT GGATGCACAG
ACTGCCGAAC TCATTAGCGA TCAGTTCATC GAGATGGTAC TGGCGCCCGC TATTTTGCCC
GATGCCCGGC CCATTCTGGC CAAAAGGAAA AACCTGCGGG TGCTCGCCTT TGACGATGGC
CGCGCCTGGC GGCGGACAGG CTGGGATTAC AAGCGTGTGC GGGGGGGGTT GTTGGTACAG
AACTTTGACC AGGCCATGGA AGCGGAAACG GACTGGAAAG TGGTCTCGGA ACGCGCACCG
ACGGTACAGG AAGCCCGTGA TCTCGCCTTT GTCTGGCGGG TCGGTAAATA CGTGCGCTCC
AACGCCATTG TCTATGGCCG AGAAGGCCAG ACCGTCGGCA TCGGTGCAGG ACAGATGAGC
CGGGTGGACG CGGCCAGATG CGGCGTAGCC AAGGCCCTGG AACTGGGCTT CGATCTGCAC
GGGGCAGCGC TGGCTTCTGA CGCGTTCTTC CCCTTCCGCG ATGGGATCGA TGCGGCGGCG
GCTGCGGGCG TAAAGGCGAT CATTCAACCC GGCGGCTCCA TCCGCGATGA AGAAGTCATC
GCCAGCGCCA ATGAACACGG CATCGCCATG GTCTTCACCG GCGTGCGCCA TTTCCGACAT
GGTTGA
 
Protein sequence
MGEITRALIS VSDKRGVVEF ARRLQDFGVE ILSTGGTAKA LMADGVAVQE VGDYTGFPEL 
LEGRLKTLHP KIHGGLLAKR DDSSHTRQMA EYGIPAIDLL CVNLYPFAET IASADCTLEE
AMENIDIGGP TMLRAAAKNW EGVTVLVDPD DYAAVLQEME QSYGGVGAST RFRLATKVFA
HTARYDGAIA NYLSSLGPDG NRTTFPQTLS LQFKKAQDLR YGENPHQAAA FYRDGSGGGL
ADAHQLQGKE LSYNNIGDGD AAVALVMEFA EPACCVVKHG NPCGVAVGPD LLGAYQRAWA
GDPISAFGGI VACNRPLDAQ TAELISDQFI EMVLAPAILP DARPILAKRK NLRVLAFDDG
RAWRRTGWDY KRVRGGLLVQ NFDQAMEAET DWKVVSERAP TVQEARDLAF VWRVGKYVRS
NAIVYGREGQ TVGIGAGQMS RVDAARCGVA KALELGFDLH GAALASDAFF PFRDGIDAAA
AAGVKAIIQP GGSIRDEEVI ASANEHGIAM VFTGVRHFRH G