Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_1895 |
Symbol | |
ID | 5160442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | - |
Start bp | 2088916 |
End bp | 2091777 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 640553816 |
Product | glycosyl transferase family protein |
Protein accession | YP_001235015 |
Protein GI | 148260888 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.207282 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCGACCT TGCCGCCGGG CCGGAACCCG CCGCCAGCCG TGCCGCCGGG GACTGACGCC GCCTTCGCGC GCGTGCGCGA CCGCCGCAAC GACGAAGCGG TGGACGCCTG GCGCCGCGCC CGCCTCGCCA TCGCCGCCGG CGATCCGGCG GAGGCGGCGT TCTGGCTCGA CCGCGTCTGC CGCCTCGCGC CGGATGACGG CCGCCCGCGT TACGAGCGCG CCCTCCTGCG CGCCGCGACC GGCGATCCCG AGGCCGCCGC CGCCCTCGCC GACATCGCCA CCCGCTTCGA CCATGCCGGG GCCTGGATCG CGCTCGCCCG CCTGCGCCAT GCCGCCGGCG CGCATGACGA GGCCGCCGCC GCTCTCGCCG AACTCCTCGC CCGCCACGAG GTTCCGCCGG CGCTCGACGA TGGCGGCCGG TTCGAAACCC TCGCCACCGC CATCGCCCGC GCCGCCGGGT TCGCCGGCTG GCGCGGCATC GCCGAGGGCC GGCGCGTCGG CGACGGTCCG CACGGCGCGA ATCTGCTCGG CGCGCCCGAC CTCGCCGCGA TCCGCCATCA CGACGGCATC GCCATGCCGG CGGGGCAGGG GATCGAGGGC TGGGTCGCCC ATCCCGCCGC CCCCTGGCGG GTGCCGCGCC TCACCCTGGT CGACCGCGCC GGCACCCGCC GCGCCCTCAC CCCGCGCCGC CGCCTCGGCC CGTCCGACCA GCGGCCCTTC GTCGAGCGCC ACGGCTTCCG CATCGGCCCG GCGCGCCTCG CCGGTCTCGT CCCGCCCTTC ATCCTCGAAA CCGCGTCCGG CGCCGCCCTC GCCGGCACGC CGATCGACCC CGCCGCCCTC GCCGCGCTGA AGCCGGCCCG CATCCGCCGC CGCCGCCCCG TCCTGCCGCC GCGCGCCCCG CTCTGCGCCG TCATGCCCGC CTATCGCGAC GCGGCGGCGA CGGCGGATGC CATCGCCTCC TTCCGCCGCG CCGCCCCGCG CTGCCCGCTG GTGGTGGTGG ACGATGCCTC GCCCGAGCCC GCCCTCGTCC GCCTGCTCGA TGAAGCGGCC GCGGCCGGGC GCATCGCGCT CCGCCGCCAC CGCCGCAACC GCGGCTTCCC CGCCGCCGCC AACGCCGGCA TCGCCGCCGC CCGCCGCCTC GTCCCCGGCT GCGACATCCT GCTGCTCAAC GCCGACATCC TGCTGCCGCC GCGCGCCCCC GCCCGGCTGC ACGCCGCGCT CTACCGCGCG CCCGACATCG CCAGCGTCAC CCCGCTCTCG AACGAGGCGA CGATCATGAG CTACCCGGCC CGCCAGGGCG GCAACCCCGC GCCGGACCTC GCCGCCACCA CCCGGCTCGA CGCGCTGGCC CGCCGCGTCA ACGGCGCCGC CCTGGTCGAC ATCCCGACCG CGATCGGCTT CTGCATGGCG ATCCGGCACG ACGCGCTCGA CGCCATCGGC GGGTTCGACG CCACGCTCTT CGCCCAGGGC TACGGCGAGG AGAACGACTG GTGCCGCCGC GCCGCCCTGG CCGGCTGGCG CCATGCCGGC GCGCCCGGCA TCTTCGTCGC CCATCGCGGC GGCGCCTCCT TCGGCGCCAC CACCGCCGCG CTCTGCGCCC GCAACCTCGC CGTCCTCGAA CGCCGCCATC CCGGCTACGC CGCCCTCATC GCCGCCCATC ACGAGGCCGA TCCGCTGCGC GACGCCCGCC GCCGGCTCGA CCTTGCCCGC TTCGCCGCCG GCCGCGTCCG CCACGGCGCG GTCGCCCTCG TCGCGCATGA CCACGGCGGC GGCATCGCCC GCGTCGTCGC GGCGCGGATG GCGGCGCTGC GGGAGGCCGG GCTGCGGCCG ATCCTGCTCA GCCCCGACCT GATCCGCGCG GACGGCTCGA TCGCCGGCAC CGGCCCCGCC CGCCTCGCCG ATGGCGGCGC CGAACCGTTC CCCAACCTCG TCTTCGATCC CCGCACCGGG ATGCGCGCCC TCGTCGCCCT GCTCCGGCGC GAGGGGGTGC GCCATGTCGA GTTCCACCAC ACGCTCGGCC ACGACCCGGC GATCCTCGCC CTGCCGGCCC GGCTCGGCGT CCCGGCGGAT CACGTGGTCC ACGACTACGC CCATTTCTGC AAGCGGGTGA ACCTGATCGG CCCGGCCAGG CGCTATTGCG GCGAACCCGG CATCGACGGC TGCGAAACCT GCCTCGCCGC CGCCGGCAGC GCCTTGGCCG ACCCGATCGC GCCCGCCGCC CTGCGCGCCC GCTCCGCCGC CGCCTTCGCC GCCGCCCGCC GCGTCGCCGC CCCCTCGGCC GATGCCGCCG CCCGCCTGCG CCGCCATTTC CCCGGCCTCG CCGTCGCCGT CACCCCCTGG GAGGACGATG ACGACCTGCC GCCGCCCCGC CCGCCCGCGC CCGGCGCGCC CCGCCGGATC GCGCTCGCCG GCGGCATCGG CCCCGCCAAG GGGTTCGACG TCCTGCTCGC CTGCGCGCGC GATGCCGCCG CCCGCGCCCT GCCGCTCTCC TTCGTCCTCG CCGGCACCAG CGAGGACGAT GCCGCGCTGA TCGCCACCGG CCGCGTCCGC GTCACCGGCG CCTATGAGGA GGGCGAGGCC CAGGCCCTGC TGCGCGATGC CGGCGCCGCC CTCGGCTTCC TGCCCTCGAT CTGGCCGGAA ACCTGGTGCT TCGCCCTCGG CGAGCTCTGG CGCGCCGGGC TCTACGTCCT CGGTTTCGAC ATCGGCGCCC CGGCCGCCCG CATCCGCGCG ACCCGGCGGG GCGACGTCCT GCCCCTCGGC CTGCCGCCGG CCCGCATCAA CGACATCCTG CTCGCCTGGC AACCGCCCGC GCCCGCCGCC CGTCCTCCGG CCCTGACGGC CGCATCCCTT TCCCCCGCCT GA
|
Protein sequence | MSTLPPGRNP PPAVPPGTDA AFARVRDRRN DEAVDAWRRA RLAIAAGDPA EAAFWLDRVC RLAPDDGRPR YERALLRAAT GDPEAAAALA DIATRFDHAG AWIALARLRH AAGAHDEAAA ALAELLARHE VPPALDDGGR FETLATAIAR AAGFAGWRGI AEGRRVGDGP HGANLLGAPD LAAIRHHDGI AMPAGQGIEG WVAHPAAPWR VPRLTLVDRA GTRRALTPRR RLGPSDQRPF VERHGFRIGP ARLAGLVPPF ILETASGAAL AGTPIDPAAL AALKPARIRR RRPVLPPRAP LCAVMPAYRD AAATADAIAS FRRAAPRCPL VVVDDASPEP ALVRLLDEAA AAGRIALRRH RRNRGFPAAA NAGIAAARRL VPGCDILLLN ADILLPPRAP ARLHAALYRA PDIASVTPLS NEATIMSYPA RQGGNPAPDL AATTRLDALA RRVNGAALVD IPTAIGFCMA IRHDALDAIG GFDATLFAQG YGEENDWCRR AALAGWRHAG APGIFVAHRG GASFGATTAA LCARNLAVLE RRHPGYAALI AAHHEADPLR DARRRLDLAR FAAGRVRHGA VALVAHDHGG GIARVVAARM AALREAGLRP ILLSPDLIRA DGSIAGTGPA RLADGGAEPF PNLVFDPRTG MRALVALLRR EGVRHVEFHH TLGHDPAILA LPARLGVPAD HVVHDYAHFC KRVNLIGPAR RYCGEPGIDG CETCLAAAGS ALADPIAPAA LRARSAAAFA AARRVAAPSA DAAARLRRHF PGLAVAVTPW EDDDDLPPPR PPAPGAPRRI ALAGGIGPAK GFDVLLACAR DAAARALPLS FVLAGTSEDD AALIATGRVR VTGAYEEGEA QALLRDAGAA LGFLPSIWPE TWCFALGELW RAGLYVLGFD IGAPAARIRA TRRGDVLPLG LPPARINDIL LAWQPPAPAA RPPALTAASL SPA
|
| |