Gene Lferr_1117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1117 
Symbol 
ID6877089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1093170 
End bp1094645 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content58% 
IMG OID642788998 
Productprotease Do 
Protein accessionYP_002219566 
Protein GI198283245 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0590546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAAATT GTGTGCTGGG TAGGCAGCGC CTTGTGATGG CGCTTTGTAT GGGTTTGGGA 
CTGAGCATAG GGGCGATGAC GCCTGCTTGG GCGGATTCCG GTGCCAGCGG TCAGGCGTTG
GTGGGGCTGC CTGATTTTAC GCCCATCGTC AAGCAGTATG GACCTGCAGT AGTGAACATC
AGCACGACGG AGACCAGAGT GGCGCGTGGT GTGACTTCCC CCTTCCCGCC GAACTCTCCC
CTGAATCAGT TTTTCGCCCC CTTCTTTGGC GCACCGGGGC AACCTGGAGC ACCTGGTGGC
GGAGCCGGAC AGAAATATCA GGTGCAATCC CTGGGTTCCG GTTTTGTCAT CAGCTCCGAC
GGCTATATCG TGACTGCGGC GCATGTGGTG AAAGGGGCTC AGAAGATCAT TGTCAGTCTC
ACCAATCATC ATCAATATGC AGCTCACCTG GTGGGCCTGT CGGCGCGTAT GGATGTGGCG
TTGCTCAAAA TTGACGCGAA GAATCTGCCG GTGGTACAGA TTGGTGACTC CAGCAAGCTG
GAGGTCGGAC AGTGGGTGCT GGCGGTGGGT TCGCCCTTCG GCTTTGAGAA CAGCGTCACC
CAGGGTGTGA TCAGTGCGAC CTCGCGGCCT TTGCCGGATG ATCCCTACAT CCCGTTCGTT
CAAACGGATG TGCCGATCAA CCCTGGTAAC TCCGGTGGCC CGCTATTCAA TATGCGCGGT
CAGGTCATCG GCATCAACGA CCAGATCTAT ACCAATAGCG GTGGCTACAT GGGGTTGTCT
TTCTCTATCC CCATCAATGT CGCCATGGAT GCGGTCAAAC AGTTAAAGCT GCATCAGAAA
GTGCATTTTG GCTGGCTCGG GGTCATGATT CAGGATGTCA GCATGGATCT CGCCAAGTCC
TTCCACATGA AAGAGCCGGT GGGTGCCTTG GTGTCACAGG TTGTGCCTGA CGGTCCGGCT
GCCAAGGCGG GGTTACGTCC GGGAGATGTC ATTGTCTCCT TTGACGGTCA GGCCATCTAT
AACTCTGGTC AATTACCGCC GCTGGTGGGA GTATTGCCCG CCGGTTTCAA GGCGAAGCTG
GGGGTTATCC GTGATGGCAA GCCCATGAGC CTCAACATCG TGGTGGAGAG TCTGCCCGGC
AACCTGGAGA ATACGGTGGA ATCCGCCGCA TCCGGCGGTC CGGCGCAGGA AGGTGAAGTC
AAACGACTGA ATGTGCAGGT GGGTCCGCTG ACTGCGGAGG CACGTAAGCA ACTGCACGTG
AATACTGGTG TCCTGGTCCT CGGGGTTGGT GTGGGGCCGG CGGCAGAAGC CGGTATTCGT
CCCGGTGATG TGATCTTGCA GGTGGCACAG CAGCAGATTA CCAATGCCGC CGACTTGCAG
AAGCTGGTGG CTGCCTTGCC GGCGGGCCAG CCGATCCCGG TGCTGGTGCG ACGTGGTGAG
GGGAGTTTCT ATCTGGTGCT TTCGCTGCCG CATTGA
 
Protein sequence
MKNCVLGRQR LVMALCMGLG LSIGAMTPAW ADSGASGQAL VGLPDFTPIV KQYGPAVVNI 
STTETRVARG VTSPFPPNSP LNQFFAPFFG APGQPGAPGG GAGQKYQVQS LGSGFVISSD
GYIVTAAHVV KGAQKIIVSL TNHHQYAAHL VGLSARMDVA LLKIDAKNLP VVQIGDSSKL
EVGQWVLAVG SPFGFENSVT QGVISATSRP LPDDPYIPFV QTDVPINPGN SGGPLFNMRG
QVIGINDQIY TNSGGYMGLS FSIPINVAMD AVKQLKLHQK VHFGWLGVMI QDVSMDLAKS
FHMKEPVGAL VSQVVPDGPA AKAGLRPGDV IVSFDGQAIY NSGQLPPLVG VLPAGFKAKL
GVIRDGKPMS LNIVVESLPG NLENTVESAA SGGPAQEGEV KRLNVQVGPL TAEARKQLHV
NTGVLVLGVG VGPAAEAGIR PGDVILQVAQ QQITNAADLQ KLVAALPAGQ PIPVLVRRGE
GSFYLVLSLP H