Gene Lferr_1346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1346 
Symbol 
ID6877320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1298504 
End bp1299988 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content51% 
IMG OID642789220 
Productprotease Do 
Protein accessionYP_002219787 
Protein GI198283466 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.62004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAA TCCAATGGAA AAGACCACGC TATATAGTAT CCCTTGCTGT TGCCGTTGCG 
TTGGGTTTCG GACTGGGAGC AACGGGATGG GTTTTTGGGG AATCAGGTCA TACCATTACG
CCTCTGCCCT CACCAAGTGA GGCAGAAACA AGGCCAAGAT TGGTCAAGAT ACCGGACTTT
AGCCCAATCG TAAAAAAATA TGGCAACGCA ATCGTTAGAA TCAGCGATTC TGACACAAAA
ATAGTTCATG AACATCAGGG GTTTTTTAAT CCATTTCCGA AAGACTCGCC GTTTTTTGGT
TTCTTTCGTG GGCTACCAAA TTACACGCCC CCGGAAAAGG AGGTGACCAA GGCTCTCGGG
TCTGGATTCA TCATCTCTCA CAACGGTTAC ATCGTCACGG CGGGACATGT CGTGAGAGGC
ATGCACCATA TTATGGTGAC CCTGAATAAT CATCACGCGT ATCGAGCCAA AGTGGTCGGA
TTATCCGTCC ATTATGATAC CGCGTTGCTG AAGATTCATG CCCACGATCT GCCTATCGTA
CAACTGGGGA ACTCAAAGAA CCTTCAGGTC GGTCAGTGGC TGCTGGCTAT TGGTATGCCG
TTTGGACTCT ATAACACCGT AACCCAAGGC GTGGTCAGCG CCATGAATAG ATCACTACCT
CATGATAATC AGTACATACC ATTCATTCAA AGCGATGTGC CCATCAATCC TGGAAATTCT
GGCGGTCCGC TTTTCAACAT GCGTGGACAG GTCGTCGGGA TCAATGATCA GATTTATACT
AACGATGGCG GCTACATGGG GTTATCCTTC AGCATCCCGA TTGATACCGC AATGCGTGCC
GTTCATGCAT TCGAGCGCCA TCAGAAAGTA AAATTTGGTT GGCTGGGTGT CGAAATTCAG
TCGGTGACGC CACAAATGGC GCAGGCGATG CACCTTCCGG AACCAGTAGG CGCATTGATC
GCGCAGGTTA TGCCCTCGAG TCCTGCGGCA AAAGCGGGCA TTAAGTCCGG GGAAGTGATC
GTGGCTTATG ATCACCGTCC TATTTACAAC GTTAGCACGC TCCCTCCATT AGTAGGTGAC
ACACCACCAG GCAGGATTGT GCCCATCGGC ATCCTCGATC ACGGGAAGCC CAGGACATTG
CAGGTTCAGG TCGGCGAGAT GCCGCAAAAG ATGCTGGTGG CTGCCGATCA ACAGAGCATC
GACATTCGTC GTCTAGGGGT ACGCGTTGGC ACCTTGGGAC CAAAGGAGCA GCAGAAACTC
GGGGTGGATC ACGGCGTATT GATTCAGTCC GTCTATCCGG GGCCAGCATC TTTTATTGGC
CTACGGAGTG GAATGGCGAT TTTGTCCATC AACCAACTGC GGGTAACCAG CCCTGAACAA
TTGGCTCAAC TGGTGAAATC TCTCCCTGCG AATACACCCA TTTCCATGCG TATTCGCAAT
CACCATGGGA GTATTTTCGT CGTGATTACG CTGCCCACAC GATAG
 
Protein sequence
MRKIQWKRPR YIVSLAVAVA LGFGLGATGW VFGESGHTIT PLPSPSEAET RPRLVKIPDF 
SPIVKKYGNA IVRISDSDTK IVHEHQGFFN PFPKDSPFFG FFRGLPNYTP PEKEVTKALG
SGFIISHNGY IVTAGHVVRG MHHIMVTLNN HHAYRAKVVG LSVHYDTALL KIHAHDLPIV
QLGNSKNLQV GQWLLAIGMP FGLYNTVTQG VVSAMNRSLP HDNQYIPFIQ SDVPINPGNS
GGPLFNMRGQ VVGINDQIYT NDGGYMGLSF SIPIDTAMRA VHAFERHQKV KFGWLGVEIQ
SVTPQMAQAM HLPEPVGALI AQVMPSSPAA KAGIKSGEVI VAYDHRPIYN VSTLPPLVGD
TPPGRIVPIG ILDHGKPRTL QVQVGEMPQK MLVAADQQSI DIRRLGVRVG TLGPKEQQKL
GVDHGVLIQS VYPGPASFIG LRSGMAILSI NQLRVTSPEQ LAQLVKSLPA NTPISMRIRN
HHGSIFVVIT LPTR