Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_1033 |
Symbol | |
ID | 8323097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | + |
Start bp | 1067388 |
End bp | 1069154 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644952160 |
Product | protein of unknown function DUF1446 |
Protein accession | YP_003109644 |
Protein GI | 256371820 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.62804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGCC CGGTTCGCAT CGCCAACGTG AGTGGCTTCT GGGGCGATCG CGTGGCCGCC GCGCGCGAGA TGGTCGACGG CGGGCCGGTA GATGTGTTGA CGGGGGACTG GCTCGCCGAG CTCACCATGG TCATCCTCGC GCGCCAGCGG GCCCGAGATC CGCTGGCCGG CTTTGCGACC TCCTTTCTCA CCCAGGTCGA GGACGTGCTC GGCACCTGTC TCGATCGCGG GATCCGCTTC GTCGCCAACG CAGGCGGTCT CGCACCCGAG CGTTGCGCAG CGGCGGTGGA GGCGATCGCA GCCCGGCTCG GACTCGCACC ACGCGTGGCG TTCGTCGACG GCGACGACCT CGCAGGTTCG CTCGAGGCGA TCGCAGCCGG CGGGACGCCA CTCGTGCACG CCGAGACGGG AGAGGCCCTC GGTCAGCGGA CGGTCCTCAC GGCGAACGCC TACCTCGGGG GCTGGGGCAT CCGCACCGCG CTCGACGACG ACGCCGACGT CGTCGTCACT GGACGCGTCG CCGACGCATC GCTCGTCGTA GGGGCGGCAG CGTGGCACCA CGGTTGGGCT CGCACCGATT TCGACCGAAT CGCCGGCGCC ATCGTGGCTG GGCACGCGAT CGAGTGCGGC ACCCAGGTCA CCGGGGGCAA CTACGCCTTC TTCGACGAGA TCCCGAACGC CCTCCACCTC GGCTTCCCGA TCGCCGAGGT CGCAGACGAC GGGAGTGCCA TCATCACCAA GCACCCCGGT CACGGTGGTG CGGTCACCGT CGGGACCGTG ACCGCACAGC TGCTCTACGA GATCGGGGCG CCGAGCTACG TCGGACCGGA TGCGATCGCG CGCTTCGACA CGATCGTGCT CGAGGACCTC GGACACGATC GAGTCCGCAT TCACGGCGTG CGCGGGGAGC CACCTCCCCG AACCGCCAAG GTTGGGCTCG CCGTCGCAGA CGGATGGCGC CTCCGCCTCG GGGTTGCGAT CACCGGACTC GACGTCGAGG CCAAGGCCGC TCTCTTCGAG CGGCAGCTGC GCGCAGCCAC CGAGGGTCTC GGGCTGCGTC GACTCGAGGC GCACCTCGTG CGCACCGACA AGCGCGATCC GCGCTCCAAT GAGGAGGCGA CCGCCACTCT CGAGATCTCC GCCGACGCCG ACGACGAGGC GGTGGTCGGG CGAGCTCTCC GCGCTCGCGT GACCGAGCTC GCACTCGCGT CCTTCCCGGG CCTCTGGGTG AGAGGGGTCA CGGCCCGCCC AGAGCCCTTG GTGCGCTTCT GGCCGACCTT CGTCGGCTGG GAATGGATTC ACGAACGCGT CACCACGCCA TCGGCGACGA TCACGCTGGA GCCGCCGCCG TGGAGCGACG GCGCGCATCG CGAGGTCGCC CAACCCGAGC CCGCCGTCGA TGCGACCGTC CAGCCGAGCG ATGCGTTCGG CCCGTGCCGG ATCGTGCCGC TCGGTCGGCT CGTCGGAGCG CGCTCGGGCG ACAAGGGGGG CTCGGCGAAC CTCGGGGTGT GGGCCAGAAC GGAGGAAGCC TACGCCTGGC TCGCTGGCTT CTTGGACGTC GACGAGCTCC GAAGGCTGCT GCCGGAGGTT GCGCCGCTCC AGGTCGAGCG CTGTGCACTC CCCAACCTGC GAGCGCTCAA CTTCGTCATC CATGGACTGC TCGGGGAGGG CGCCTCAGCA CCGCGACGGG CGGATGCGCA GGCCAAGAGT CTTGGCGAAT GGCTGCGTGC CCGACTCGTC CCTATTCCGC TACGGTTCCT CGTGTGA
|
Protein sequence | MARPVRIANV SGFWGDRVAA AREMVDGGPV DVLTGDWLAE LTMVILARQR ARDPLAGFAT SFLTQVEDVL GTCLDRGIRF VANAGGLAPE RCAAAVEAIA ARLGLAPRVA FVDGDDLAGS LEAIAAGGTP LVHAETGEAL GQRTVLTANA YLGGWGIRTA LDDDADVVVT GRVADASLVV GAAAWHHGWA RTDFDRIAGA IVAGHAIECG TQVTGGNYAF FDEIPNALHL GFPIAEVADD GSAIITKHPG HGGAVTVGTV TAQLLYEIGA PSYVGPDAIA RFDTIVLEDL GHDRVRIHGV RGEPPPRTAK VGLAVADGWR LRLGVAITGL DVEAKAALFE RQLRAATEGL GLRRLEAHLV RTDKRDPRSN EEATATLEIS ADADDEAVVG RALRARVTEL ALASFPGLWV RGVTARPEPL VRFWPTFVGW EWIHERVTTP SATITLEPPP WSDGAHREVA QPEPAVDATV QPSDAFGPCR IVPLGRLVGA RSGDKGGSAN LGVWARTEEA YAWLAGFLDV DELRRLLPEV APLQVERCAL PNLRALNFVI HGLLGEGASA PRRADAQAKS LGEWLRARLV PIPLRFLV
|
| |