Gene Lferr_1548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1548 
SymbolpepN 
ID6877523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1503276 
End bp1505903 
Gene Length2628 bp 
Protein Length875 aa 
Translation table11 
GC content60% 
IMG OID642789411 
Productaminopeptidase N 
Protein accessionYP_002219977 
Protein GI198283656 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00280197 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAATATCT CGGGATCGAA AGCGACGGTA ATCCGGCGTG GAGACTACCA GGCTCCCAGC 
TATCAAGTCT CCGAAATCGC CTTGGATGTG CGCCTGGATC CAGACAATAC CGAAGTACAC
ACCCGCCTTC AGTTGCATCG CATCGCACCG GAGCCGGTCG CGGAACTGCA TCTGGATGGC
GAGTCTCTGG AACTGTTGGG TCTCCAGCGG GATGGCCAGG CGCTGGCAGA AAGCGCATAT
CGGCTGACCG AGGGTGGTTT GCTCTTGCTG AATCCTCCGG AAGCCTTCAT TCTGGAAAGC
AGGGTACGTA TCCATCCTCG GGCCAACACC GCGCTTTCCG GTCTCTATCA TGCGGGCGGG
CAGTTTCTGA CCCAATGTGA AGCGGAGGGA TTCCGGCGTA TCACCTATTA CCTGGATCGC
CCCGACTGCC TCGCCCGCTT CACGGTGACC CTGCACGCCC CGCAGGACAG TTGCCCGGTA
TTGCTCGCCA ATGGCAACTG TATGGCCACG GGTGTTGAAG AGGGTGGCTG GCACTGGGCC
CGCTGGGAAG ACCCCTATCC CAAGCCGGCC TACCTCTTCG CCATGGTGGC CGGGGATCTG
GCGGTAGTCC GCGACCGGTA TCGCACTGCT TCCGGCCGGG AAGTAGCCCT GGAAATCTAT
GTGGCCGAAC GCGATACCGG TGCCTGCGCC CAGGCCATGG ACAGCCTCAA ACACGCCATG
CGCTGGGACG AAGAGGTCTA CGGCCGGGAA TACGATCTAA ACCGCTATAT GATCGTCGCT
ACCGACAGCT TCAATATGGG CGCGATGGAG AATAAAGGCC TCAATATTTT TAATGCCAAG
TATGTGCTCG CTAGCCCGGA AACAGCTACG GACAGCGACT ATCAGGGTAT CGAGTCGGTT
ATCGCCCACG AATATTTCCA CAACTGGACC GGCAATCGGG TGACCTTGCG CGACTGGTTT
CAGCTCAGTC TCAAAGAAGG CCTGACGGTG TTTCGCGATC AGGAATTCAG TGCCGATCAA
AATTCCCGCG GTGTCCAGCG CATCGGCGAT GTACGCCGCC TGCGTGCCGC CCAGTTCCCG
GAAGACGCTG GTCCGCTCGC CCACCCGGTG CGGCCCGATG CCTATTCCGA GATCAACAAT
TTCTACACCG CCACGGTTTA TGAAAAAGGC GCCGAATTGG TGCGAATGAT GCACACCCTG
CTTGGCAATG TGCCATTTCG GAAGGGTATG GATCTGTATT TCGAGCGTCA CGACGGCCAT
GCGGTAACCA TAGAGGACTT TATTGCCGCC ATGGAAGACG CCAACCAGCG TGACCTCTCC
GGATTTCGGC GCTGGTATGG CCAGGCGGGG ACGCCCGTGG TACGGGCTAC AGGCAGCTAC
GATCCTGCCC GGCACAGCTA TACCCTTACC CTGCATCAGG AAACCCCGGC AACCCCCGGT
CAACCCGTGA AAGAGGCGGT GCCGATCCCG GTTCGTATGG CGTTGCTCAA CACCCAAGGG
CAACGTGTGC CTCTGGAGGT TGTCGGGGGT GCCTCCGAAA CGGTATTGTT GCTGGAGCAG
TCGGAGCAAT CCTGGAACTT TGCGAACTTG CCGGGTCCGG TGATTCCTTC CCTTCTGCGC
GGCTTCTCCG CCCCGGTGCG GCTGCAGGAC TCTCTGGATG ACGACGCCCA TGGTTTTCTG
GCACGCCACG ATGATGATCC CTTCAATCGC TGGGAAAGCA TGCAGGACCT TGCTGTAAAA
GCCTTGCTCG CCGCTGTGGC GGATTCATCG GTCGCACCAT TGCCCATCAC ATTACGGAAC
GCCGTCGCAG CGACGCTGGC GGACCGTCAG GTGGACCCCG CCTTTTGTGC CGAACTGTTG
ACGCTCCCCG GCGAGGATTA TATTGGTGAA CAGATGCCTG TAGTCGCCGT CGAGGCCATC
CACCGGGCCC GGGACGGGAT GATGCGTGCC ATCGGCAGAC ATTTCGCAAA GGAATGGACC
GGTCTGTATC ACGATCTTGC TGCCGCCTAT CAGCGCGACG GTCTGTCCAT CGGTCGCCGG
CGCCTGCGCA ACCTGGCTCT GGCTTATCTC ATCGCCGGTG AGGGCAACAG CCGCGAAGGA
GCGATCCTGC ACGCCCGTCA GCAATATATA CAGGCCGATA ACATGACGGA CCGACTGGCA
GCTTTCCAGT TGCTGGCGCA ACACCGGCAT CATGATGCCG AAGACGTCAT CCTCGACTTT
TACCAAAGGT GGCACGAATA CCCGCTGGTG ATCGACAAAT GGTTTGCCAT ACAGGCGGCT
GCGCCCTACC CGAAAACCTT GCGGCAGGTG GAACATCTGC TGGTGCATCC CGCCTTCGAC
TGGCGGGTGC CCAATCGGGT GCGTGCGGTA CTCGGTGCGT TTGCCGCCAA TCCCACGGTT
TTCCATGCGG CGGATGGCTC CGGTTATACG TTCTTTGCCG AGCAGATCCG CCGTCTGGAT
GACATCAATC CGCAGACGGC GGCGCGCCTG GCGACACCCC TCAGTCGTTG GCAGCGTTAC
GATGCCCCGC GTCAGCAGGC GATGGTGACG GCCTTGAAAA TATTGGCTGG TAAGCCTGGG
CTTTCCCGCG ATCTGGCAGA AGTCATACAG CGTTCCCATC CAGAGTAG
 
Protein sequence
MNISGSKATV IRRGDYQAPS YQVSEIALDV RLDPDNTEVH TRLQLHRIAP EPVAELHLDG 
ESLELLGLQR DGQALAESAY RLTEGGLLLL NPPEAFILES RVRIHPRANT ALSGLYHAGG
QFLTQCEAEG FRRITYYLDR PDCLARFTVT LHAPQDSCPV LLANGNCMAT GVEEGGWHWA
RWEDPYPKPA YLFAMVAGDL AVVRDRYRTA SGREVALEIY VAERDTGACA QAMDSLKHAM
RWDEEVYGRE YDLNRYMIVA TDSFNMGAME NKGLNIFNAK YVLASPETAT DSDYQGIESV
IAHEYFHNWT GNRVTLRDWF QLSLKEGLTV FRDQEFSADQ NSRGVQRIGD VRRLRAAQFP
EDAGPLAHPV RPDAYSEINN FYTATVYEKG AELVRMMHTL LGNVPFRKGM DLYFERHDGH
AVTIEDFIAA MEDANQRDLS GFRRWYGQAG TPVVRATGSY DPARHSYTLT LHQETPATPG
QPVKEAVPIP VRMALLNTQG QRVPLEVVGG ASETVLLLEQ SEQSWNFANL PGPVIPSLLR
GFSAPVRLQD SLDDDAHGFL ARHDDDPFNR WESMQDLAVK ALLAAVADSS VAPLPITLRN
AVAATLADRQ VDPAFCAELL TLPGEDYIGE QMPVVAVEAI HRARDGMMRA IGRHFAKEWT
GLYHDLAAAY QRDGLSIGRR RLRNLALAYL IAGEGNSREG AILHARQQYI QADNMTDRLA
AFQLLAQHRH HDAEDVILDF YQRWHEYPLV IDKWFAIQAA APYPKTLRQV EHLLVHPAFD
WRVPNRVRAV LGAFAANPTV FHAADGSGYT FFAEQIRRLD DINPQTAARL ATPLSRWQRY
DAPRQQAMVT ALKILAGKPG LSRDLAEVIQ RSHPE