Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_0378 |
Symbol | |
ID | 8322433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | - |
Start bp | 388694 |
End bp | 391510 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644951526 |
Product | protein of unknown function DUF470 |
Protein accession | YP_003109019 |
Protein GI | 256371195 |
COG category | [S] Function unknown |
COG ID | [COG2898] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.173445 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGTCGG CGCCCCTCGA GGTGCGCCTC GCGCCCGGCG CGCGCGTCGT CGTCCTCGGC AATCTCGACC TCCACGATCG GCGCCCAGGC CCCGATCGTG ACGAGCTCGC ACGCATCATC GGGGAGCTCG AGCCCGATGA CCTGCTCGTC CTCGCTGGGC GCGTCACCGC ACCCGAGCCG AGTCTCGACC CACGCTCCGC ACTCGGGCAC CACCACGACG TCGTCGACCT GCTCACGAGC GCCCGCGCAC GCACCGTACG CATCACCTCC GTGCGCGACG AGCCTCTGGC AATGGGGGCC GAGGTAGAAC ACACCGCCGA GCTGATCCTC CTCGGGCCAG GGCCCCGCGG CCATCGCGTC CGCATGAAGC CGGTCGGCAC GGATCTCGCA CGACGGCGCA CGCTCGAAGC GCTCGAACGA CTCCCAGGCT TCCGTGGCGC TGCCTGGCTC GATCCGGCCG CCTCCGTGAC CCACTTCGTC GTGGCGCGTT CGTTCGCACG GCGTGCACGC ATGCTCGCGT TGTGGGTCTT CGTCCCGCTC GTCCTGCTCG AGCTGGCCCG GCTGCCGTTC GTGCTCAGGG CCGCACCGCT CGTCGCACAC CTCCACCACC AGGCATCGCG CCACATCGCC CTCGTCGCGA GTTTCCTCGT CCTCGATGCC CTCATCGTGC TCCTTGCCAG CGTCCTCGCT TCGCGGTCGG TGACCCACCT CATCGAGGAG CCGATGGAGT CGATGGTGAG CGCACTGGCC ACCAACCAAG CCGCGCGTGA TCTGTTCGAG AGCCTCCCAC CGCACACCTC GCTCATCGCC GCCCAGGCCG AGCTCGCCGA ACTCCAACAC GACGACGACG CCTTCCTCGC CTCGCCAGGA CCGACCACGG CCGTGCTCAC GCCCGTTGCC GGCAGAGGTC TGCTCCCCAC CCGCTACGAG CCCATGTACC ACACGACGTG GATCGAGCTC GAGGTCGGCG CGGTCGTGCG CGCCTACCTG CGCCGGACGC GCCGTCGCGT TCGCCCGCCG AACGCGCTCT CGCGCCTGCT CTGGCCCGAT CAGGCCCTCC TCGATCATCA GGTGACGCTC GCGGGCGCCC CTGACGGCGG TTCGTGGCCC CCCGCGCCCA GGCTCGCCTC CGAGACCTCG CTCACGGTGC GCCGTGTCGG GGGACTTGCG GTCAGCGTCG CCGGTCTCGT GGAGGTGATC TCGGCGTTCG CACCCCCATT GGCACGACAC CTGTCGATCG TCACCACACT CTTCCCTACC CTCGGACCGA TCCCTCGCTA CGCCGACGCG ATCTCGGCGG CCGCCGGCAC GCTGCTGCTC GGCATCGCCC AAGGGCTCCG AGCCGGTCAA CGACGGGCGT GGCGGATCGC CATGGTGGTC CTCGCAGCAG CGGTCGCCTC CAACCTGGTC CGCCGCAGCG ACCTCATCAC GACCGGCCTC CTCAGCCTCG CGCTCGTCGC GCTGGCCATC TGGCGAAGCG CCTTTCGACA ACCCGCCCCC AGGGGCCGAC GAGTGTGGAG GGCACTCTCG GTCGTGGGCG CCGGCGTCTT GGTCGTCCTC GTCGCCGAAG TCGTCGCGAT CGGCGATGCC CTCGTACACC ATGCACCTCT CCGTCCCCTC GCGGCCCTCG GAGGGATCAT CGGCACCGTG CTCGGTCTTC CGGTTCAAGC ACCGCCGCCA TTCGCGGTCG GTGAGTTGCA AGACGCACTC ACGCTGGTCG GCGTCGCACT CCTCGTCCTC ATCGTGTGGC GCCTCGTGGC ACCCGTGCGC GATCGCCTCA GCGCGCAGCT CGCCTCCCTC AGAGCGCCCC GCAATCCCGC ACAGATCCTG CGAGCCCACC CACAGAGCAC ACTGGACTAC TTCGCGCTGC GCTCCGACAA GGAGCACCTC GTGCGTCATG GCGGCCTCGT GGCCTACGGC GAGTTCGGCT CCGTCGTCAT CGTCTCCCCA GACCCCATCG GAGCGACCGC CTCGGCACGC CTCGCCTTCC TCGAGCTCTT CGAGGCCACG TCTCGCGCCG GGAAGGCACT GTGCGTCCTC GGGGCGAGCG AGACCTGGAG CGAGTGGTAC CGCGAGGTCG GCTTGCATCC CCTCTACCTC GGTGACGAGG CGATCGTCAC GCTCGGTCAA CTCGACCTTG CCGGCAAGCG CCACAAGAGT CTGCGCCAGG CAGTCAATCG GATGCGCCGT TACGGCTACC GCGTGCGGGT CGTCGCTCCG CTCGAGCTCA CCGAAGAGGA GCGGCGCGAC GTGCTTCGCG TCATGAGCGA GTCACGCCGC GGCGGCCGAG AACGCGGGTT CTCGATGACG CTGGGGCGCG TCTTCGACCC CCGCGACACC GACGTCCTCA TGAGCGTGTG CACAGGACCC GACGGCCGGA TCGTCGGGTT CGTGCAGTGG GTCCCCGCGC CGAGCATCGA GGGCTACTCG CTCGACCTGA TGCGTCGCGA CCTCGGCAAC CATCCGAACG GCATGTTCGA CCTGCTCATC GTCGAGACCA TGACGCAGCT CCAAGCCCGC CACGTCAAGG CGATCAGCCT CAACTTCGCT GCAATGCGGG GCGTGCTCGC CGGTGAACGC GGCGGCGAAC TCCTGAGCGC CCGAGTCGAG CGCTGGGTGC TCGATCGGCT CTCGTCGAGC ATGCAGATCG AGTCGCTCTG GCGCTTCAAC GCGAAGTTCG AGCCCCGCTG GGAACCCCGC TACCTCGTCG TCGACGCCTA CGAGCACCTC GCCGCGATCG CGATCGCCGC AGCACGCGCC GAGTCACTCT GGGACCTGCC GCTCGTCGGA CGCTTCCTGG CGGACCCCCA TGCATGA
|
Protein sequence | MESAPLEVRL APGARVVVLG NLDLHDRRPG PDRDELARII GELEPDDLLV LAGRVTAPEP SLDPRSALGH HHDVVDLLTS ARARTVRITS VRDEPLAMGA EVEHTAELIL LGPGPRGHRV RMKPVGTDLA RRRTLEALER LPGFRGAAWL DPAASVTHFV VARSFARRAR MLALWVFVPL VLLELARLPF VLRAAPLVAH LHHQASRHIA LVASFLVLDA LIVLLASVLA SRSVTHLIEE PMESMVSALA TNQAARDLFE SLPPHTSLIA AQAELAELQH DDDAFLASPG PTTAVLTPVA GRGLLPTRYE PMYHTTWIEL EVGAVVRAYL RRTRRRVRPP NALSRLLWPD QALLDHQVTL AGAPDGGSWP PAPRLASETS LTVRRVGGLA VSVAGLVEVI SAFAPPLARH LSIVTTLFPT LGPIPRYADA ISAAAGTLLL GIAQGLRAGQ RRAWRIAMVV LAAAVASNLV RRSDLITTGL LSLALVALAI WRSAFRQPAP RGRRVWRALS VVGAGVLVVL VAEVVAIGDA LVHHAPLRPL AALGGIIGTV LGLPVQAPPP FAVGELQDAL TLVGVALLVL IVWRLVAPVR DRLSAQLASL RAPRNPAQIL RAHPQSTLDY FALRSDKEHL VRHGGLVAYG EFGSVVIVSP DPIGATASAR LAFLELFEAT SRAGKALCVL GASETWSEWY REVGLHPLYL GDEAIVTLGQ LDLAGKRHKS LRQAVNRMRR YGYRVRVVAP LELTEEERRD VLRVMSESRR GGRERGFSMT LGRVFDPRDT DVLMSVCTGP DGRIVGFVQW VPAPSIEGYS LDLMRRDLGN HPNGMFDLLI VETMTQLQAR HVKAISLNFA AMRGVLAGER GGELLSARVE RWVLDRLSSS MQIESLWRFN AKFEPRWEPR YLVVDAYEHL AAIAIAAARA ESLWDLPLVG RFLADPHA
|
| |