Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_0844 |
Symbol | |
ID | 8322907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | + |
Start bp | 863891 |
End bp | 866728 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644951978 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_003109463 |
Protein GI | 256371639 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.752183 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGAGT GGATCCGCGT TCGCGGCGCG CGAGAGCACA ACCTCCAGAA CGTCTCGGTG GACGTGCCAC GCGATCGTCT CGTGGTCATC ACGGGTCTGT CGGGCTCGGG CAAGAGTTCG CTCGCGTTCG ACACGATCTT CGCCGAGGGA CAGCGTCGCT ACGTCGAGTC GCTGTCGTCG TATGCCCGGC AGTTCCTCGG TCTCATGGAG AAGCCCGACG TCGATGTCAT CGATGGGCTC TCACCTGCCA TCGCGATCGA CCAGAAGTCG GCTTCGCACA ATCCCCGCTC GACCGTCGCG ACCGTCACGG AGATCTACGA CTACCTGCGT TTGCTCTACG CGCGCGTCGG TCATCCCCAC TGCCCATCGT GCGGACGTGA GGTGAGCCGG CAGACGCCCC AGGACATCGT GGACTGGGTC CTCGCACGTT ATGAGAACAG GCGTTTGCTG GTCGTGGCGC CGCTCGTTCG CGGCCGCAAG GGAACCTACG ACGAGCTGCT CGACCGACTG GCGCGCGAGG GATGGTCGCG CGTGCTCGTG GACGGGGTGC TGTGGTCGCT CGACGACCGA GGCTCCATCA ACCTGGCGCG CTACGAGGCG CACACCATCG CGGTGGTCGT CGACCGGATC GTGCCCAAGT CCGCCGACCG CCAGCGTCTG ACCGAGTCGG TGGAGACGGC GCTCAAGGCC GCTGGCGGCG TCGTCGAGGT CGTGCCGGTC GGCGACGACG GCGAGCTCGG AGCGGGCGAG CGCTTCTCCG AGACGCTCGC CTGTGCCGTC TGCGGCATCT CGCTGGGCGA GCTCGAACCT CGGAGCTTCT CGTTCAACAG CCCCTTCGGT GCCTGTCCGG CTTGTGGGGG GCTCGGGGTC CGCTTCGTCG TCGACGAGGA TCTCGTCATG CCCGACCCGA ATCGCTCGGT CCTCCAGGGC GGCGTCGTCG CGCTCGGGAC GCTGCGGGCC GATCTCGTTC GTCGGCAGAT GGAGCGCGCC CTGCGCGCCG CGGGTTTCTC GCTCTCGACT CCGGTCGGCA AGCTTCCCGA GGCGGCCCGA CGCCTCATCC TCGAGGGCGC TTCCGAGCCG ATCGAGGAGC GCTGGGTCGA TCGTGCCGGT CGTGAGCGCA GGGCCCAGGT GACGTTCCAG GGCGTCGCCG CCTTCCTCGA GCGACGGCGG AGCGAGGCCG AGTCGGAGGT GGCTCGTCAG CTCGCCGAGA GCTTCATGCG TGAGGTGCCG TGCGATGCCT GCGGCGGCAC CCGGTTGCGG CCAGAGGCCC GGAGCGTGAC CGTCGATGGG GTGAGCCTCG ACACGCTCGT CAACGCCTCG ATCCGCCGTG CGGTCGAGGT GGTGGATGCG ATGCGCTTCG GGGAGCGCGA GGCGACCATC GCCACCCCTG TGCTGCGCGA GATTCGCGCA CGGCTGCAGT TCCTCTGCGA CGTGGGGCTC GACTACCTGA CGCTCGGTCG GGCGGCGAGA ACGCTGTCGG GCGGGGAGGC CCAGCGCATC CGGCTTGCCT CGCAGATCGG CTCGGGCCTC GCCGGGGTGC TCTACGTGCT CGACGAGCCG TCGATCGGGC TCCACCAGCG GGACAATCGA CGCTTGATCG ACACGCTGGT GCACCTACGG GATCTCGGCA ACACGGTGAT CGTCGTCGAG CACGACGAGG AGACGATTCG CGCCGCGGAC TGGGTGGTCG ATGTCGGACC GGGCGCGGGC GAGCACGGCG GCACGATCGT GCACGCCGGC ACGGTCGCCG AACTCGAGAC GGTCGAGTCC TCCCTCACCG GTGCCTACCT CTCGGGGCGC CGTGCGATCG ACGTGCCGGT GGCTCGGCGC ATGCCCGAGC GGGGATGGTT GCGGGTGCTG GGCGCTCGCG AGCACAACCT GGCCGACATC GACGCTGCGT TCCCGATCGG CCTTTTGACC TGTGTGACCG GGGTATCGGG GTCGGGCAAG TCGACGTTGG TGAACGAGAT CTTGTTCCGG GCACTCAGGG CGCAGCTGCA CCGCTCGCAC GACGTCCCCG GTCGCCACAA GGCCATCGAG GGCATCGAAC TCGTCGACAA GGTGATCGAC ATCGATCAGG CACCGATCGG CCGCACCCCG CGCTCGAATC CTGCGACCTA CACCGGTCTG TTCGACCACG TTCGCCGTCT CTTCGCCGAG ACACCCGAAG CGCGGGCTCG TGGCTATCGA CCAGGTCGCT TCTCGTTCAA CGTCAAGGGG GGGCGCTGCG AGGCGTGCCA AGGCGAGGGA ACGGTTCGCA TCGAGATGAA CTTCCTCGCC GACGTGTACG TGACCTGCGA TGTCTGTGGC GGCTCGCGCT ACAACCGTGA CACCCTGGAG ATCCGCTACC GTGGGCTCAC CATCGCCGAC GTGCTCGCGC TCTCGGTCGA GGAGGCGTTG GCGTTCTTCG CGAAGCAGCC GGCGATCGCG CGCCACCTCA CGACGCTGGC CGAGGTGGGA CTTGGTTACG TCCGTCTCGG TCAGGCGGCC ACGACCCTTT CGGGCGGCGA GGCCCAGCGC GTCAAGCTCG CAACCGAACT CGCGCGGCGA GCGACCGGGC GAACGGTCTA CATCCTCGAC GAGCCGACGA CGGGACTGCA CTTCGAGGAC GTGCGCAGCT TGCTCGGGGT GCTGCACTCG CTGGTGGATC AGGGCAACAC GGTGATCGTG ATCGAGCACA ACCTCGACGT GATCAAGACG GCCGACTGGA TCATCGATCT CGGACCCGAA GGTGGCAGTG GCGGCGGCAC CGTCGTGGCG CAGGGTACGC CGGAGGACGT GGCCCAGGTG CCCAGCTCGC ACACGGGGGC CTTCCTCGCT CCGTTGCTTG GGCGATGA
|
Protein sequence | MPEWIRVRGA REHNLQNVSV DVPRDRLVVI TGLSGSGKSS LAFDTIFAEG QRRYVESLSS YARQFLGLME KPDVDVIDGL SPAIAIDQKS ASHNPRSTVA TVTEIYDYLR LLYARVGHPH CPSCGREVSR QTPQDIVDWV LARYENRRLL VVAPLVRGRK GTYDELLDRL AREGWSRVLV DGVLWSLDDR GSINLARYEA HTIAVVVDRI VPKSADRQRL TESVETALKA AGGVVEVVPV GDDGELGAGE RFSETLACAV CGISLGELEP RSFSFNSPFG ACPACGGLGV RFVVDEDLVM PDPNRSVLQG GVVALGTLRA DLVRRQMERA LRAAGFSLST PVGKLPEAAR RLILEGASEP IEERWVDRAG RERRAQVTFQ GVAAFLERRR SEAESEVARQ LAESFMREVP CDACGGTRLR PEARSVTVDG VSLDTLVNAS IRRAVEVVDA MRFGEREATI ATPVLREIRA RLQFLCDVGL DYLTLGRAAR TLSGGEAQRI RLASQIGSGL AGVLYVLDEP SIGLHQRDNR RLIDTLVHLR DLGNTVIVVE HDEETIRAAD WVVDVGPGAG EHGGTIVHAG TVAELETVES SLTGAYLSGR RAIDVPVARR MPERGWLRVL GAREHNLADI DAAFPIGLLT CVTGVSGSGK STLVNEILFR ALRAQLHRSH DVPGRHKAIE GIELVDKVID IDQAPIGRTP RSNPATYTGL FDHVRRLFAE TPEARARGYR PGRFSFNVKG GRCEACQGEG TVRIEMNFLA DVYVTCDVCG GSRYNRDTLE IRYRGLTIAD VLALSVEEAL AFFAKQPAIA RHLTTLAEVG LGYVRLGQAA TTLSGGEAQR VKLATELARR ATGRTVYILD EPTTGLHFED VRSLLGVLHS LVDQGNTVIV IEHNLDVIKT ADWIIDLGPE GGSGGGTVVA QGTPEDVAQV PSSHTGAFLA PLLGR
|
| |