Gene Afer_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_1971 
Symbol 
ID8324071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp2071341 
End bp2074595 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content60% 
IMG OID644953098 
Productglycosyl transferase family 2 
Protein accessionYP_003110548 
Protein GI256372724 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAGTC TCCTGCGCTT CCCCGTCGGT CAGGCGGTTT TTCAGCTTCC AGAACGGCTT 
ACCGAGGTCA ACTCATGGCA CGGGCACATC CCCTTCGCCT TCTGGATCGT CGAAGCACTC
GAGCCGTCGG TCTTCGTCGA GCTCGGTGTC CACCGCGGGG ACTCCTACTT CGCCTTCTGC
CAGGCCGTCA AGTCTCTCGG ACTTGACACC AGGTGCTACG GCGTGGACAC CTGGAAGGGC
GACGCTCACG CCGGGTTCTA TGGGGAAGAG ATCTACGAAG ACTTCTCTGC GTACAACCGC
GAGCACTATC AAGACTTCTC AAAGCCATTA CGCACCACAT TTGCCGAGGC TATCGAGCAG
TTCGAGGACG GCAGCATCGA CCTTCTCCAC GTGGACGGCT ACCACACCTA CGAGGCTGTC
AGGTCGGACT TTGGATGCTG GCTCCCCAAA CTGAGTGAGC GGGCCGTCGT GCTCTTCCAT
GACATCGCAG TCACCGACCG GGGGTTTGGC GTCTGGCGCT TCTGGGAGGA GATCGCTGCG
CAGTACCCTT CCTTCGGCTT CATGCATTCC TTTGGTCTCG GAGTGCTCGG CGTCGGCAAG
GAGCTGCCAG ACAGCCTAGC TAGCTTCTTC GAAGACGCCA AAGCGAACCC TGAGATCCTG
CACTCCTTCT ATGAGGCGCT TGGAACGAGA TGCCAACTCT TCGGCGATCT CCAGCGCGCT
CGAGATGAGC TTGCGAATAC CACTGCAGCC CCAGCTATTT CCGAGGAAGT AGCCACCTTG
CGCCAACAAG TACTCGACCT CACCTACCGC TACGAACGAG CGCTCGAGCG TAAGGAGGCG
GAGGCCGAAC AGTTAGAAGC CAAGGTGGTA GACCTCGAGG CACGCCTTGG CCAGTCGTCG
CACTCGGCCG CAGCCCTCGC AGAACACCTT GCGTCAGTCA CGGCTCAACG CGACGAGATC
CTCCAATCGG AGACCTGGAA ACTCACCGCT CCCGCTCGCG GCTTCCTCTG GTGGATGCGT
CGAGTGGCGA ACTGGAGACG CTTCACACAG ACGTTCCGTG TCACACTGCA GCCCCTTCAG
GGTGTTGGCG AGTCACTCTT CGAGACCGAC AGCTTCGTGG CTCTCGGTGG CAGAATGCGC
TTTGCAATCG AAGGCGCACC TCGGCCTCCC GGGTGGTACG AGCTGACTTG CACGGTCACC
ACCACCTCCG ACCTCTCGAA AATGCGCCCC TACATCATCA CCGAGACACA AGACCACCAA
CGCTACAGCC AACAGATTCC AGGCAAGGTC GACCCAGAGG GCCAGATCCG AGTCCTGTTC
CACGTCAACA AGCAAGCTGC TCGCCACGAG CTGCTGCTCG TGGGCCTCAA CGGGATCACC
TCCATTTCCG CACCACGCGT CAAGCCAGCG CTGCACCTTG GCGAGCCGAT GGCTCGCATA
CTTGCCGCCG CGATTGTTCC CACAATCGCG TACGCAGACC TGCCACCCAT CGCAGCTCAT
CAAGAGCTCC TCCCGCAGGA GGACGAGTTC TCCCGCTGGA TCGAACGCAA CGAGCGGATC
AATCAAGACG ACCGAGAGCG CGTCGCCCGA GAGCTCGCCA CCTGGGAACA CCCACCACTG
ATCTCCGTAC TCATGCCCGT CTACAACACG CCAATACGGC ACCTCGTAAC CGCGATCGAG
TCGGTACGGG CGCAGTGGTA TCCCCACTGG GAACTGTGCA TCGCCGACGA TGCTTCAACC
GATCCAGAGA TTCGGCCGAT CCTTACCCGC TATCAGGAGG CGGATCCCCG CATCAAGGTC
GCCTTCCGAG ACGAGAATGG CGGTATCTCG GCAAACTCGA ATACTGCACT CACGCTCGCG
AACGGCAAGT TTGTTGCATA TCTGGATGCC GACGACGAAA TCTCCGAAGT CGCACTCCTC
CACTACGCTC GAGAGATTCA TGAGTATCCT GGAGTCGAGT TGCTCTTCTG CGACGAAGAC
AAGATCACTG AAGATGGTGA TCGCTCCGAC CCCTACTTCA AGCCGAGCCT CTCCCCCGCG
CTCCTTCTCG GGAAGAACTG CGTCACTCAC CTCGGCGTCT ATCGGACCGA CACCGTCCGC
CGCCTCGGCG GAATGCGCTC AGAGTTTGAC GGATCCCAGG ACTGGGACCT AGCACTACGC
TTTCTTCCGA TCGTCGGTAT AGACTTCACG CGCGCACGTC GCATCCCGCG GCTGCTCTAT
CATTGGCGGC GGATCCATGG CTCGACGGCA ACCACACTCC GATCGAAGAG CTGGGCCGTT
CTGGCGGGGC GGCATGCCGT GCAAGACTAC TTGGATACAG CAGTGCCAGG TGCGAAGGCT
GAGCCCATTC CGCGCGCCTC CAATCTCAAC CGACTCGTTC TCCCGACTCC GGATCCAGCT
CCATTAGTTT CGATTCTTCT GCCTACCGCG GGAAACTATC AGCTACTTCG CGGCTGCCTC
AGCTCGCTCC TCGAACGCAC CGACTACCCC CGCTTCGAAG TGCTTATCAC GATCGACTCC
GACAACCCCG ACGCGGACTC GCTCGCATAC CTTGACACCC TTGAGCACAC CGGAAAGGTC
CGGGTGATCC GGCGTCGGCG CCCGCCTGGC GAAACCTTTA ACTACTCCCG GATAGTCAAC
AACCTCGCTC GCTACGCAGC AGCTGACCTC CTCCTGCTCC TCAACGACGA CACCGAGGTC
ATCAACGCTG GTTGGCTCAC CGAGATGGTC GCGGTACTAT CGCTGCCCGA CGTGGGCGTC
GTTGGCGCGC ACCTCTACTA CGCCGACGGC AGTATCCAGC ACGCGGGGGT GATGACTGGA
CACCACAGGG CGCTACATCT TTACAGCGGG CTGCCGGGCG CAAGCTGGGG ATACTATGCA
GACTTACTAC TCGCGCGCAA CGTGAGTGCG GTCACTGGTG CTTGTCTCCT GACCTCGCGT
CGAGTCTGGG ACGAGGTCGG TGGGTTGGAC GAGCAGCTTG CGGTCAGTTT CAACGACGTC
GCCTACTGCC GCGCAGCGGG CGCACTGGGA TATCAGATCA TCGTCACCCC GCATGCCAGG
CTCAAGCATT TCGAGTCAGT GACCCGAGGC TTCGACGACT TGACGTTGCC CCGCAGGTCA
CGGCTCGCAT CAGAATTTCA GCGACTCGCC ACACTGTTCC CCGACATTGC CGCAGCTGAC
CCGTTCTACA ACCCCAACCT CGTCCCCGAA GGGCAATTTC GGCTTCAGTA TGAATCGCCG
ATACCGGTGG TCTAA
 
Protein sequence
MPSLLRFPVG QAVFQLPERL TEVNSWHGHI PFAFWIVEAL EPSVFVELGV HRGDSYFAFC 
QAVKSLGLDT RCYGVDTWKG DAHAGFYGEE IYEDFSAYNR EHYQDFSKPL RTTFAEAIEQ
FEDGSIDLLH VDGYHTYEAV RSDFGCWLPK LSERAVVLFH DIAVTDRGFG VWRFWEEIAA
QYPSFGFMHS FGLGVLGVGK ELPDSLASFF EDAKANPEIL HSFYEALGTR CQLFGDLQRA
RDELANTTAA PAISEEVATL RQQVLDLTYR YERALERKEA EAEQLEAKVV DLEARLGQSS
HSAAALAEHL ASVTAQRDEI LQSETWKLTA PARGFLWWMR RVANWRRFTQ TFRVTLQPLQ
GVGESLFETD SFVALGGRMR FAIEGAPRPP GWYELTCTVT TTSDLSKMRP YIITETQDHQ
RYSQQIPGKV DPEGQIRVLF HVNKQAARHE LLLVGLNGIT SISAPRVKPA LHLGEPMARI
LAAAIVPTIA YADLPPIAAH QELLPQEDEF SRWIERNERI NQDDRERVAR ELATWEHPPL
ISVLMPVYNT PIRHLVTAIE SVRAQWYPHW ELCIADDAST DPEIRPILTR YQEADPRIKV
AFRDENGGIS ANSNTALTLA NGKFVAYLDA DDEISEVALL HYAREIHEYP GVELLFCDED
KITEDGDRSD PYFKPSLSPA LLLGKNCVTH LGVYRTDTVR RLGGMRSEFD GSQDWDLALR
FLPIVGIDFT RARRIPRLLY HWRRIHGSTA TTLRSKSWAV LAGRHAVQDY LDTAVPGAKA
EPIPRASNLN RLVLPTPDPA PLVSILLPTA GNYQLLRGCL SSLLERTDYP RFEVLITIDS
DNPDADSLAY LDTLEHTGKV RVIRRRRPPG ETFNYSRIVN NLARYAAADL LLLLNDDTEV
INAGWLTEMV AVLSLPDVGV VGAHLYYADG SIQHAGVMTG HHRALHLYSG LPGASWGYYA
DLLLARNVSA VTGACLLTSR RVWDEVGGLD EQLAVSFNDV AYCRAAGALG YQIIVTPHAR
LKHFESVTRG FDDLTLPRRS RLASEFQRLA TLFPDIAAAD PFYNPNLVPE GQFRLQYESP
IPVV