Gene Afer_0452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_0452 
Symbol 
ID8322511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp451746 
End bp452804 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content67% 
IMG OID644951604 
Productsortase family protein 
Protein accessionYP_003109093 
Protein GI256371269 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3764] Sortase (surface protein transpeptidase) 
TIGRFAM ID[TIGR01076] LPXTG-site transpeptidase (sortase) family protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGGGA AGGGCCGGGG AGCGCACAGC CGACAGCGAG GTCGGGGAAT CGCGATCGTA 
GGGGTCGTGG CGATGCTCGC TGGTCTCGGC CTCATCGGCT CGATCGTCGC CTTCTACGTG
CGCTCCTCCC TTGTGGGTGG CGGACTCATC CAGCAAGCGC AGAAGGCCCG GACGGTCGCG
GCGTGGCCGC GATCGCTGCT CGCGATCGTG CGTATCCCCT CGATCGGGCT CGTCGCGCCG
GTGGAGCAGG GCACCGGCCA GTCGGTGCTC GCTGTGGCGG TGGGTCATCT CACGACGAGC
GCGCTCCCTG GGAAACCAGG CACGTCGGTG CTCGCGGCGC ACAACGTCAG CTGGTTCTCG
GGCCTCGGTG GTCTCGGCTC GGGATCTCTC ATTGAGGTCG ATACACCGTA CGGGCAGCAG
GTCTATCGTG TGGCCTGGCA TCGCGTCGTG CACGTCGGTG CGCCCGTGGC CAACACCGCC
GCACCGACTC TGGTGCTCGA AGCGTGCTGG CCGCTCAATG CGCTCTACTT GACGCCCGAG
CGCTACCTCG TTGGTGCCAC CTTGGTGGCG ACGACGAAGA TCGCGGTCAC GCCGGTCACG
CCGTCGTCGG ACAGCTACCA GCCGCTCGGG CTTGCGCCGA CGCTCGCGCA CGAGAACCTC
TCGCTCGCGG CCAACGACCT ACCGATGGGG GTGCTCGCCA CCGTTGGCTC GCCTGCTGCA
GCATGGACGA GTTCACAGCG ACCCTACAAC TTCGCTGGAG CGGAAGTGAC GTGGACCATT
GCGTTGTTGC ATGCGCTCGA AGCTCACGAC CTCGTGCTCG TCGAATCGGT GACCCACGAG
CCAGCAAGCG TGGTCGCACC ATTGCTCAGC TGGGACGGAG GCTTCGCGAG CCTCGACGAC
CTCACCGAGG TCGTCGATGG TGTCACGGCG TCGGCTGGCT CCTCGCGAGT GTCGCTCCAG
ACCGATCACG GGCCGCTCGT CGTTACCTTG CGTTTTCGGG TCATCGGGCA TGGGGTCGAG
GTAGCTGGCG CTGCGGTCGG GACGTCGCAG GGCTCGTAG
 
Protein sequence
MRGKGRGAHS RQRGRGIAIV GVVAMLAGLG LIGSIVAFYV RSSLVGGGLI QQAQKARTVA 
AWPRSLLAIV RIPSIGLVAP VEQGTGQSVL AVAVGHLTTS ALPGKPGTSV LAAHNVSWFS
GLGGLGSGSL IEVDTPYGQQ VYRVAWHRVV HVGAPVANTA APTLVLEACW PLNALYLTPE
RYLVGATLVA TTKIAVTPVT PSSDSYQPLG LAPTLAHENL SLAANDLPMG VLATVGSPAA
AWTSSQRPYN FAGAEVTWTI ALLHALEAHD LVLVESVTHE PASVVAPLLS WDGGFASLDD
LTEVVDGVTA SAGSSRVSLQ TDHGPLVVTL RFRVIGHGVE VAGAAVGTSQ GS