Gene Lferr_2438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_2438 
Symbol 
ID6878436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp2400344 
End bp2401684 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content60% 
IMG OID642790295 
Productpeptidase M24 
Protein accessionYP_002220840 
Protein GI198284519 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000518121 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000118983 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCAAGC TTGATTTGCC TCGCCCCGAT CATGCCGCCC GCCGTCGTCT GCTGATGCAG 
AAGATGCACA CTGCCGGTGT CGCGATCATA CCCACCGCCA CACCCAAGAG CCGTAACGGC
GATGTACAGT ACCCATTTCG CGGCGACAGC GATTTCCTCT ATCTGACCGG CTTTGCAGAG
CCGGAGGCTA TACTGGTGCT GGCACCGGGA CACGCCGACG GGGAACAGAT CTTGTTCTGC
CGTCCCCGTG ATCCCGAGCG GGAAACCTGG GATGGGCGTC GCGCTGGGCT GGAGGGTGCT
CTTGAGCAAT GTCAGGTGGA TCGTTGCCTC TCGATTCATG ACCTGAACGA CGTTCTTCCA
CAACTGCTGG AAAATCGGGA ATTGCTGTTT TATCCCATGG GTCAGTCCAC GGATTTCGAC
GCGCGGGTCA TGCACTGGCG CAATGTTGCC AAGAGCAAGA TTCGCCAGGG TGTGCGCTAC
CCACTCGAAG TGGTCGATGT TGCCGACCTG ATCCATGAGA TGCGGTTGTT CAAAGACCCC
GAAGAGATCG AAATCCTGCG GGCGGCGGTA GGCATCAGCG GTGCCGGGCA TCGCCATGGC
ATGCGCCAAT GTCGTCCGGG TATGCTGGAA TACGAACTGG CCGCCGAGAT CGAGCATGTT
TTTCGTCGCC TCGGATCGCC CAGCGTGGCG TATCCCAGTA TCGTCGGTGG CGGAATCAAC
GGTTGCATTC TGCACTACAC GGAAAATGAT GCGGAACTGC GCGATGGCGA TCTGGTACTG
ATCGACGCCG GTGCAGAGGT CGGCGCCTAC GCCGGAGACA TCACCAGGAC CTTGCCGGTG
AACGGGGTTT TCAGCCCCGC ACAACGGGAA GTTTACGAAG TCGTTCTGGC CAGTCAGAAA
GTGGCAATCG CCGCCGTACA GGTAGGGCGC TCCGTTACCG ATTATCATGA CGAGGCCGTC
AAGGTGCTGG TGGACGGCTT GTTGGAGCTG AAGATACTTT CCGGCAGTCG CGACGCAGTC
ATCGAGCAGG GCAGCTACAA GGCCTTCTAT ATGCACCGGA CGGGGCACTG GCTGGGCATG
GACGTTCATG ACGTCGGCCA CTATCGCAGC GCGGATCAGT CCTGGCGGAA ACTGGAGGCG
GGGATGGTGC TGACGGTGGA GCCGGGCCTG TACTTCTCGC CGGATAATCC ATCGGTACCG
GAGCGTTGGC GCGGTATCGG GGTTCGCATC GAAGACGACG TGCTGGTGAC CACCGGCGGT
CCGGATGTTT TGTCGAGCGA AGTACCCAAG GAGGTTGCCG AGGTCGAAGC CATGATGGCA
GCAGGATGGG CGGAAGGCTG A
 
Protein sequence
MPKLDLPRPD HAARRRLLMQ KMHTAGVAII PTATPKSRNG DVQYPFRGDS DFLYLTGFAE 
PEAILVLAPG HADGEQILFC RPRDPERETW DGRRAGLEGA LEQCQVDRCL SIHDLNDVLP
QLLENRELLF YPMGQSTDFD ARVMHWRNVA KSKIRQGVRY PLEVVDVADL IHEMRLFKDP
EEIEILRAAV GISGAGHRHG MRQCRPGMLE YELAAEIEHV FRRLGSPSVA YPSIVGGGIN
GCILHYTEND AELRDGDLVL IDAGAEVGAY AGDITRTLPV NGVFSPAQRE VYEVVLASQK
VAIAAVQVGR SVTDYHDEAV KVLVDGLLEL KILSGSRDAV IEQGSYKAFY MHRTGHWLGM
DVHDVGHYRS ADQSWRKLEA GMVLTVEPGL YFSPDNPSVP ERWRGIGVRI EDDVLVTTGG
PDVLSSEVPK EVAEVEAMMA AGWAEG