Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_0746 |
Symbol | |
ID | 8322806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | + |
Start bp | 755601 |
End bp | 758456 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644951881 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003109369 |
Protein GI | 256371545 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.38831 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTACCG CTCCACGGGT CGCGGTCGTC GTCACGGCGT TCCGTCCTGG TCGCTACGTC GACGAGTGTC TCGCAGCCCT TGCGCATCAG AGCTACCCAG ATCTCGACAT CACGGTGGTC GACGGCGCCG GTGACGATCG TGAATTGCGC GACGCCGTGG CCATGGCGGC ACCCAGGGCG CGCATCGTCG ATGGAGGAAG CTGTGGGGGG TACGGAGCCT GCGCGAACGC GGGGGCGCGG GTGCACCCCG ACACACCGTT CCTCGTCTTC GTCCATGACG ACGCCGTGCT CGCGCACAAC GCCATCGCGT CCATGGTCGA GGTGGCCTAT GCCGCCAACG CCGGCATCGT GACGCCGAAG CTGGTCGCCT TCGACGATCC ACGCAGTCTG GTGACGGTCG GGTGGGATCT CGACCCGTTC TTCAACCCGA CCGCTCGGGT GGAGGCGGGC GAGCTCGACC AGGGTCAGCT CGACGAAGTC GTGGACGTGG ACGCCGCACC GGGTGCTGCA ATGTTGGTTC GTCGTGACCT CTTCGACGCC CTCGGTGGCT TCGACGAGGT CTTCGGCCTC ATCGGGGAAG ACGTCGACCT GAGCGTGCGC GCTCGACTCG CTGGTGCTCG CGTCGTCACC GCTCCGCACG CCCGCGTCCG ACATCGAGGG GTGCGCATGG AGCGGCTGCG ACGCGCGCGC CGACGCCGCC AGGTCGGCGC CCAGGTCCTC GAGGAGCGCG TCGGTCTGCA AGATGCCGAG CGGGTCTGGC GCAGGCGCCG CGCCTCGCGC ATCATGATGT CGAGCCTGTA CGACGGCCCG ATCCGCTTGG TGCTGCTCCT CCTGTTTGCC ACGGAGCGCA TCGGCGAGCT GGTCTGGAGG GCCCTCGCGG GTTCGCCGTC GGCGGGTGTC GCGGGACTGC GGGCGCTCGT GCTGTCGCGC GACGACCGCG TGGCGCTCCG GCGTCGTCGT CATGGCATCG GCGAGCGCGC TCGGTCGCAT CGCGTCGTCG CGACGCGCAC CTGGACACCG GGGGAGCGTC TCGTGGCCGT CGGGGCCCAA GCAGGTACTC TCTCACCTGG GCCCACGGAT GCGCCCCCCA AGCGGCTCGC ACGGTGGGTG CCGCGCTCGG GCTCGGCTCG CTGGTTGCTC CTCGTCGCGA TGGTGATTGG CCTCGTCTCA GCGCGAGGTG CGCTGACGGC CTCCTCGCCC GGGGGGAGCC TCCTGGGTGG AGTGAGCGGC CTCGCGTTGC TCGATGCCTG GGTCCACGCT CGGGCCGTTC CGGCGGTCCT CGGGGCCGGC GTGGCGCCGG TGGGGACCGC GTTCATCGGG CTCGTGTCGC TGGTGTTCGG CGGTGACGTC GGACTCACCG TGCGATTGGC GGTACTCGTC GGCGCACTCG TGGCGCCCGT CGCGGTGGCT CGGCTCGCTC GGCGCGAGTT CGACGACGTG CGCTCCAGTG CGCTCGCGCT CGTGTGGTCG CTCGGTCCGG CCTTGGCACT CGGCATCGCC AGGTCCTCTC TGTCGCTCGT CGCTGCGTCG GCGCTCGCAC CGGTGCTCGT CGGCGCCACG CTCGCTGCCA CCCAAGGACA GCGTCTCAGT CCGCGTCGCG CCAAGCGTGC CCAGCGGCGG CTCGGGCTGG TTGCGGCCGT CGCGCTGGCG TTCGCGCCAG AGCTCGTCGC GGTGTGGCTG GTGGTCATCG CTGCCGAGGC GCTGTGGTGG GCCCGTGCGG CAGACGCCTA CCGACTCCGG CGTCTCGGGC GCGCGCTCGG CATCGCGGGC CTCATGGCGA TCGGCATCAA CCTGCCCTGG CTCGCGGCCC TCGTCCTGTG GCATCCTGGG GTCGCTGTCG TGGCCCACGG CGTGCCGGGA GCGCGAGTGC AGTCGCTCGG GGCGCACCTG TTCGGCGGAG GCTACCTCGC TGGCCCAGCA CCGTGGCTGT ACGTGGCGCT CGGTGCGATC GCCCTCGCTG CGGGCGTCGA GCGCAGCGAA CGAGCCAGAC TCGCGGTGGC TCGGGCCGTC ACGGGTCTTG CCCTCAGTGG CGTCGGGGCG CTGGCGATGC TCGGTGGCCT TGGCTCGACG CCGATCCAGC CGGCCTACTT CGACGTGCTC GGTGGCCTCT TGATCGTGCT GGCGGTGCCG ACTGGTGTCA GCGCGGCCCA GGCGTGGCTG CGCCGCAGAC GCCTCGGACT CTGGCATCTC GTGGGGGTGG TTGCGGCGCT CCTCGTCGCC ATCCTCGGCC TTGGTTCGGC GCTGTCGGCG CTCATCGCGC CGGCGGAGCC GATCGCGCTG CCGACCGCCA ACCTCGCGCT CGCTGGCGGG TTCTTCGCCC GTCCGGTGCC GACGTTGTGG ATCGAGGTCG GGCCCGGCGG ACCTGTTGGT GGCGCAGCGG TCGCCGCCAA CGTGACCGTT GCGGTCACCA CGGGTGCGGA GCCGAGCTTC CTCGGCCAGT TCGGTCCGCC GGTGACGGAG GGGTATCGGC GCATCGTGCC CTCGATTCTC GATGCGCTCT CTGGTCACAC GGTGCAACTC GGCGCGGTCC TGCGAAGGCT CGGCCTTGGT GAGGTCGTGG TGCTGGACGC ATCGCAGTCT CCGCTCGCCA GTGCAGTCAC GCTCGGCGTC GAGCGCCAGG TCGACCTGCG TCAGCTCCTC GGAACCTCGT CGATGACTAT CGCGGCGACG ACGGGGGTGC CTCGGCCGGT CGCCGCGGCG GCAACACCGC CGTGGTTCGT GGTGAGCGAG CTCATCGCAG CGGGGGTGCT CGTCTGGTCG GCAGCGAGCG TGTTCGGGAT CGAGGAGCGG ATGGTGAGGC GGCCTCGGCT GCGCATGCCT GCTGCCCCCA CGACCAGGGT CGAGGTGCTC CAGTGA
|
Protein sequence | MSTAPRVAVV VTAFRPGRYV DECLAALAHQ SYPDLDITVV DGAGDDRELR DAVAMAAPRA RIVDGGSCGG YGACANAGAR VHPDTPFLVF VHDDAVLAHN AIASMVEVAY AANAGIVTPK LVAFDDPRSL VTVGWDLDPF FNPTARVEAG ELDQGQLDEV VDVDAAPGAA MLVRRDLFDA LGGFDEVFGL IGEDVDLSVR ARLAGARVVT APHARVRHRG VRMERLRRAR RRRQVGAQVL EERVGLQDAE RVWRRRRASR IMMSSLYDGP IRLVLLLLFA TERIGELVWR ALAGSPSAGV AGLRALVLSR DDRVALRRRR HGIGERARSH RVVATRTWTP GERLVAVGAQ AGTLSPGPTD APPKRLARWV PRSGSARWLL LVAMVIGLVS ARGALTASSP GGSLLGGVSG LALLDAWVHA RAVPAVLGAG VAPVGTAFIG LVSLVFGGDV GLTVRLAVLV GALVAPVAVA RLARREFDDV RSSALALVWS LGPALALGIA RSSLSLVAAS ALAPVLVGAT LAATQGQRLS PRRAKRAQRR LGLVAAVALA FAPELVAVWL VVIAAEALWW ARAADAYRLR RLGRALGIAG LMAIGINLPW LAALVLWHPG VAVVAHGVPG ARVQSLGAHL FGGGYLAGPA PWLYVALGAI ALAAGVERSE RARLAVARAV TGLALSGVGA LAMLGGLGST PIQPAYFDVL GGLLIVLAVP TGVSAAQAWL RRRRLGLWHL VGVVAALLVA ILGLGSALSA LIAPAEPIAL PTANLALAGG FFARPVPTLW IEVGPGGPVG GAAVAANVTV AVTTGAEPSF LGQFGPPVTE GYRRIVPSIL DALSGHTVQL GAVLRRLGLG EVVVLDASQS PLASAVTLGV ERQVDLRQLL GTSSMTIAAT TGVPRPVAAA ATPPWFVVSE LIAAGVLVWS AASVFGIEER MVRRPRLRMP AAPTTRVEVL Q
|
| |