Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_1971 |
Symbol | |
ID | 8324071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | + |
Start bp | 2071341 |
End bp | 2074595 |
Gene Length | 3255 bp |
Protein Length | 1084 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644953098 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003110548 |
Protein GI | 256372724 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAGTC TCCTGCGCTT CCCCGTCGGT CAGGCGGTTT TTCAGCTTCC AGAACGGCTT ACCGAGGTCA ACTCATGGCA CGGGCACATC CCCTTCGCCT TCTGGATCGT CGAAGCACTC GAGCCGTCGG TCTTCGTCGA GCTCGGTGTC CACCGCGGGG ACTCCTACTT CGCCTTCTGC CAGGCCGTCA AGTCTCTCGG ACTTGACACC AGGTGCTACG GCGTGGACAC CTGGAAGGGC GACGCTCACG CCGGGTTCTA TGGGGAAGAG ATCTACGAAG ACTTCTCTGC GTACAACCGC GAGCACTATC AAGACTTCTC AAAGCCATTA CGCACCACAT TTGCCGAGGC TATCGAGCAG TTCGAGGACG GCAGCATCGA CCTTCTCCAC GTGGACGGCT ACCACACCTA CGAGGCTGTC AGGTCGGACT TTGGATGCTG GCTCCCCAAA CTGAGTGAGC GGGCCGTCGT GCTCTTCCAT GACATCGCAG TCACCGACCG GGGGTTTGGC GTCTGGCGCT TCTGGGAGGA GATCGCTGCG CAGTACCCTT CCTTCGGCTT CATGCATTCC TTTGGTCTCG GAGTGCTCGG CGTCGGCAAG GAGCTGCCAG ACAGCCTAGC TAGCTTCTTC GAAGACGCCA AAGCGAACCC TGAGATCCTG CACTCCTTCT ATGAGGCGCT TGGAACGAGA TGCCAACTCT TCGGCGATCT CCAGCGCGCT CGAGATGAGC TTGCGAATAC CACTGCAGCC CCAGCTATTT CCGAGGAAGT AGCCACCTTG CGCCAACAAG TACTCGACCT CACCTACCGC TACGAACGAG CGCTCGAGCG TAAGGAGGCG GAGGCCGAAC AGTTAGAAGC CAAGGTGGTA GACCTCGAGG CACGCCTTGG CCAGTCGTCG CACTCGGCCG CAGCCCTCGC AGAACACCTT GCGTCAGTCA CGGCTCAACG CGACGAGATC CTCCAATCGG AGACCTGGAA ACTCACCGCT CCCGCTCGCG GCTTCCTCTG GTGGATGCGT CGAGTGGCGA ACTGGAGACG CTTCACACAG ACGTTCCGTG TCACACTGCA GCCCCTTCAG GGTGTTGGCG AGTCACTCTT CGAGACCGAC AGCTTCGTGG CTCTCGGTGG CAGAATGCGC TTTGCAATCG AAGGCGCACC TCGGCCTCCC GGGTGGTACG AGCTGACTTG CACGGTCACC ACCACCTCCG ACCTCTCGAA AATGCGCCCC TACATCATCA CCGAGACACA AGACCACCAA CGCTACAGCC AACAGATTCC AGGCAAGGTC GACCCAGAGG GCCAGATCCG AGTCCTGTTC CACGTCAACA AGCAAGCTGC TCGCCACGAG CTGCTGCTCG TGGGCCTCAA CGGGATCACC TCCATTTCCG CACCACGCGT CAAGCCAGCG CTGCACCTTG GCGAGCCGAT GGCTCGCATA CTTGCCGCCG CGATTGTTCC CACAATCGCG TACGCAGACC TGCCACCCAT CGCAGCTCAT CAAGAGCTCC TCCCGCAGGA GGACGAGTTC TCCCGCTGGA TCGAACGCAA CGAGCGGATC AATCAAGACG ACCGAGAGCG CGTCGCCCGA GAGCTCGCCA CCTGGGAACA CCCACCACTG ATCTCCGTAC TCATGCCCGT CTACAACACG CCAATACGGC ACCTCGTAAC CGCGATCGAG TCGGTACGGG CGCAGTGGTA TCCCCACTGG GAACTGTGCA TCGCCGACGA TGCTTCAACC GATCCAGAGA TTCGGCCGAT CCTTACCCGC TATCAGGAGG CGGATCCCCG CATCAAGGTC GCCTTCCGAG ACGAGAATGG CGGTATCTCG GCAAACTCGA ATACTGCACT CACGCTCGCG AACGGCAAGT TTGTTGCATA TCTGGATGCC GACGACGAAA TCTCCGAAGT CGCACTCCTC CACTACGCTC GAGAGATTCA TGAGTATCCT GGAGTCGAGT TGCTCTTCTG CGACGAAGAC AAGATCACTG AAGATGGTGA TCGCTCCGAC CCCTACTTCA AGCCGAGCCT CTCCCCCGCG CTCCTTCTCG GGAAGAACTG CGTCACTCAC CTCGGCGTCT ATCGGACCGA CACCGTCCGC CGCCTCGGCG GAATGCGCTC AGAGTTTGAC GGATCCCAGG ACTGGGACCT AGCACTACGC TTTCTTCCGA TCGTCGGTAT AGACTTCACG CGCGCACGTC GCATCCCGCG GCTGCTCTAT CATTGGCGGC GGATCCATGG CTCGACGGCA ACCACACTCC GATCGAAGAG CTGGGCCGTT CTGGCGGGGC GGCATGCCGT GCAAGACTAC TTGGATACAG CAGTGCCAGG TGCGAAGGCT GAGCCCATTC CGCGCGCCTC CAATCTCAAC CGACTCGTTC TCCCGACTCC GGATCCAGCT CCATTAGTTT CGATTCTTCT GCCTACCGCG GGAAACTATC AGCTACTTCG CGGCTGCCTC AGCTCGCTCC TCGAACGCAC CGACTACCCC CGCTTCGAAG TGCTTATCAC GATCGACTCC GACAACCCCG ACGCGGACTC GCTCGCATAC CTTGACACCC TTGAGCACAC CGGAAAGGTC CGGGTGATCC GGCGTCGGCG CCCGCCTGGC GAAACCTTTA ACTACTCCCG GATAGTCAAC AACCTCGCTC GCTACGCAGC AGCTGACCTC CTCCTGCTCC TCAACGACGA CACCGAGGTC ATCAACGCTG GTTGGCTCAC CGAGATGGTC GCGGTACTAT CGCTGCCCGA CGTGGGCGTC GTTGGCGCGC ACCTCTACTA CGCCGACGGC AGTATCCAGC ACGCGGGGGT GATGACTGGA CACCACAGGG CGCTACATCT TTACAGCGGG CTGCCGGGCG CAAGCTGGGG ATACTATGCA GACTTACTAC TCGCGCGCAA CGTGAGTGCG GTCACTGGTG CTTGTCTCCT GACCTCGCGT CGAGTCTGGG ACGAGGTCGG TGGGTTGGAC GAGCAGCTTG CGGTCAGTTT CAACGACGTC GCCTACTGCC GCGCAGCGGG CGCACTGGGA TATCAGATCA TCGTCACCCC GCATGCCAGG CTCAAGCATT TCGAGTCAGT GACCCGAGGC TTCGACGACT TGACGTTGCC CCGCAGGTCA CGGCTCGCAT CAGAATTTCA GCGACTCGCC ACACTGTTCC CCGACATTGC CGCAGCTGAC CCGTTCTACA ACCCCAACCT CGTCCCCGAA GGGCAATTTC GGCTTCAGTA TGAATCGCCG ATACCGGTGG TCTAA
|
Protein sequence | MPSLLRFPVG QAVFQLPERL TEVNSWHGHI PFAFWIVEAL EPSVFVELGV HRGDSYFAFC QAVKSLGLDT RCYGVDTWKG DAHAGFYGEE IYEDFSAYNR EHYQDFSKPL RTTFAEAIEQ FEDGSIDLLH VDGYHTYEAV RSDFGCWLPK LSERAVVLFH DIAVTDRGFG VWRFWEEIAA QYPSFGFMHS FGLGVLGVGK ELPDSLASFF EDAKANPEIL HSFYEALGTR CQLFGDLQRA RDELANTTAA PAISEEVATL RQQVLDLTYR YERALERKEA EAEQLEAKVV DLEARLGQSS HSAAALAEHL ASVTAQRDEI LQSETWKLTA PARGFLWWMR RVANWRRFTQ TFRVTLQPLQ GVGESLFETD SFVALGGRMR FAIEGAPRPP GWYELTCTVT TTSDLSKMRP YIITETQDHQ RYSQQIPGKV DPEGQIRVLF HVNKQAARHE LLLVGLNGIT SISAPRVKPA LHLGEPMARI LAAAIVPTIA YADLPPIAAH QELLPQEDEF SRWIERNERI NQDDRERVAR ELATWEHPPL ISVLMPVYNT PIRHLVTAIE SVRAQWYPHW ELCIADDAST DPEIRPILTR YQEADPRIKV AFRDENGGIS ANSNTALTLA NGKFVAYLDA DDEISEVALL HYAREIHEYP GVELLFCDED KITEDGDRSD PYFKPSLSPA LLLGKNCVTH LGVYRTDTVR RLGGMRSEFD GSQDWDLALR FLPIVGIDFT RARRIPRLLY HWRRIHGSTA TTLRSKSWAV LAGRHAVQDY LDTAVPGAKA EPIPRASNLN RLVLPTPDPA PLVSILLPTA GNYQLLRGCL SSLLERTDYP RFEVLITIDS DNPDADSLAY LDTLEHTGKV RVIRRRRPPG ETFNYSRIVN NLARYAAADL LLLLNDDTEV INAGWLTEMV AVLSLPDVGV VGAHLYYADG SIQHAGVMTG HHRALHLYSG LPGASWGYYA DLLLARNVSA VTGACLLTSR RVWDEVGGLD EQLAVSFNDV AYCRAAGALG YQIIVTPHAR LKHFESVTRG FDDLTLPRRS RLASEFQRLA TLFPDIAAAD PFYNPNLVPE GQFRLQYESP IPVV
|
| |