Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_2183 |
Symbol | |
ID | 5161348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | + |
Start bp | 2409226 |
End bp | 2412213 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640554105 |
Product | glycosyl transferase family protein |
Protein accession | YP_001235300 |
Protein GI | 148261173 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.610183 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCCCAGC CCGCCCATCT CCTATTGCGC CTCCCCGCCG AGCAACGACC CGCCTGGGCG CTCTTCGATC CGGTCTGGTA CGCCCTCCGC CATCCGGACA TCGCCGCCGG GCGCGACGAG GCCGCCCTGC TCGAATACTA CCTCGCCGTC GGCGGCCGGC TGGGGCACAG CCCGAGCCCG CTCTTCGACG AAGGCTTCTA CCGCGCCAGC AACCCCGACA TCGCGGCGCT GGTCGCCGAG GGCGGCTACG CCTCGGGCTT CGATCATTTC TGCCAGTACG GCCATCGCGG CCTCTCGCCG CACTGGCTGT TCGACGACAC GCTCTACGGC GACCTGCACG CCGACATGTC GCTCGAGAAT CTCGACCGCC ACCGCTGCTT CGGCCGCTAC GACCACTGGC TGAAATCCGG CCAGCGCGAG CAGCGGATCT GCCACTTCCT GTTCGACAGC GCGTTCTACC GCCGCGAGGT CGAGGCGGCG GGAGAGGGCG CGGCGCGGCT CGAGTCCATG GGCCCCTTCG TCCACTATCT CTACGCGCTC TGGGACGACA CGCCCGAACG CGCCACCTCG CCCTATTTCG ACCCCGCCTG GTACGCCGCG ATGCACGAAT CCGCGCGCGA CGACATCGCC GAGGGCCGCG CCCGCAACGC GCTGCACCAC TACATCACGC GCGGCGAGGC CGCCGGCTTC GACCCGGTGC CGGAATTCTC CGAGCGCTAC CATCTCGACA CCTATGACGA CATCGCCGCC GCCGTCGAGG CCGGCGTGTT CCGCAGCGGC TACCATCATT TCGTCCGCTA CGGCACGCTC GAACTACGCC GCCCCCGCGC CGACATCGAC CTGATCTACT ACCGCGACAC CAACCCCCGC GTCCGCAACG ATCTCGCCGC CGGCGCGGTG CGCGACGCCT TCGCCCATCT GCGCACGATC GGCCTGCGCG AGGGGCTCGA CCACTGCCCG CCCGAGCGCG TGCCGGACCT GCCCGAACGC GCCGCCCGCC AGCTCTTCGA ACTGCGCGCG CGCAACCAGT CCGCCCTGTT CGCCCGCCAG CGGATCGACT TCGCTCCGTC CGCCCCGCCC GCGCTTTCGG TCATCATGGT GCTGCACGAG AAATTCGAGC TGACCATGCA GGCGCTCGCC TCCCTGCGCG CCAACTTCCC CGGCGATCTC GACCTCATCC TGATCGACAA CGGCTCGACC GACGCCACCG CCCGGATCGA GACCTATGTC CGCGGCGCGA CCATCCTCCG CGAGCCCGGC AATCCCGGCT TCCTGCTCGC CTGCAATCGC GGCCTGCGCG AGGTCCGCGC CCCGGCGGTG CTTTACCTCA ACAACGATGT CGAGCTCGGC TTCGGCGCCA TCGCCGCCGC CCTGCGCCGC CTCGCCTCGG CACCCGATAT CGGCGCCGTC GGCGGCAAGG TCATCCGCAC CCATGGCCGC CTGCAGGAAG CCGGCTCGAT CGTCTGGGCC GACGGCTCGA CCCAGGGCTA CCTGCGCGAC GCCTCGCCGC TGGCCCCCGA GGCCAATTTC GTCCGCGACG TCGATTTCTG CTCCGGCGTC TTCCTGCTCT GCCGCACCGA TCTCGTGCGC CGCCTCGGCG GCTTCGACGA AGCCTTCCGC CCCGCCTATT ACGAGGAGGT CGATCTCTGC GTGCGGATGA TCGAGGCCGG CTTCCGCGTC ATCTACGACC CGGACGTCAC CATCCACCAC CTCGAATACG GCAGTTCGAG CAACGCCGAG GCGGCGATGC AGCAGATGCG CCGCAACCGC CGCGTCTTCG TCCGCAAGCA CAAGGCGTTC CTGCACTTCC AGCAGGCGCC GGCGCCGGGC GGCGAAATCC GCGCCCGCGC GCGCAACGGC GCCGGACGGC GGCTGCTGTT CCTCGAGGAT ACCGTGCCGC TCCGCCGCCT CGGCTCCGGC TTCGTCCGCG CCAACGACGT GGTGCACGCC ATCGCCGATG CCGGCTGGCA GGTCTCGGTC CTGCCGGTGA ACGGCGCGCG GCACGACCTG ATGAGCCAGT TCGGCGACCT GCCCGAAACC GCCGAGGTGC TGCACGACCG CTCGATCATG ACCCTGCCGG CCCTTCTCGC CGAGCGCCCC GGCTTCTACG ACGCGGTCTG GATCTCCCGC ACCCACAACC TGATGCGCAC CGCCCCGATC TTCGAGGCGG CGGGGATCGA TCCGGCCACA ACGCCCTTCG TGGTCGATAC CGAGGCCGTC GCCGCCACCC GCGAGGCCGA GGCCGCCGCC CTGCGCGGCG AGGCCGCGTT CGACCTCGCC GCCGCCGTCG CCGCCGAAAT GCGCCCGGCC ATGGTCTGCC GGCATGTCAC CGCGGTGAAC GAGGCGGAAG CCGCCCTGCT CCGCGCCGCC GGCCTGCCCG GCGTCGCCGT GCTCGGCACC ATCCGCGCGC CCGATCCCAC GCCGCGCCCC TTCGCCGCGC GCGCCGGCCT GCTCTTCGTC GCCAGCATCC ACCGCGAGGA CAGCCCGAAT CTCGACAGCC TGCGCTGGTA TCGCGACGAA ATCCTGCCCG CCCTGCGGCG GATCATGGAC GAGCCGCCGA CCCTCAGCTT CGTCGGCTAC ACCGCGCCGG ATCTCGACCT CGCCGAATTC CGCGGCATCC CGGGCATCGA GCTGCGCGGC ACGGTGGCCG AGCTGCGCCC CGCCTATGAC GAGCACCGCC TGTTCATCGC CCCCACCCGC TTCGCCGCCG GAACACCCTA CAAGGTCTAC GAAACCGCCT CGTTCGGCCT GCCCTGCGTG GCAACGCCAC TGCTCTGCCG CCAGCTCGGC TGGACCCCGG GCAAGGACAT CGCCACCCCG GCCACCGCCG ACGCCGCCGC CTTCGCCGCC GAGATCGCCT CGCTCTACCG CGACGAAACC CGCTGGACCG CGCTGCGCGA TTCCGCGCTG CGCCGCCTCG CCGCGGAAAA CGGCCGGGCC CCCTTCGAAG CCACGGTCCG CCGGATCCTC GACGCCATCA GCGCCTGA
|
Protein sequence | MPQPAHLLLR LPAEQRPAWA LFDPVWYALR HPDIAAGRDE AALLEYYLAV GGRLGHSPSP LFDEGFYRAS NPDIAALVAE GGYASGFDHF CQYGHRGLSP HWLFDDTLYG DLHADMSLEN LDRHRCFGRY DHWLKSGQRE QRICHFLFDS AFYRREVEAA GEGAARLESM GPFVHYLYAL WDDTPERATS PYFDPAWYAA MHESARDDIA EGRARNALHH YITRGEAAGF DPVPEFSERY HLDTYDDIAA AVEAGVFRSG YHHFVRYGTL ELRRPRADID LIYYRDTNPR VRNDLAAGAV RDAFAHLRTI GLREGLDHCP PERVPDLPER AARQLFELRA RNQSALFARQ RIDFAPSAPP ALSVIMVLHE KFELTMQALA SLRANFPGDL DLILIDNGST DATARIETYV RGATILREPG NPGFLLACNR GLREVRAPAV LYLNNDVELG FGAIAAALRR LASAPDIGAV GGKVIRTHGR LQEAGSIVWA DGSTQGYLRD ASPLAPEANF VRDVDFCSGV FLLCRTDLVR RLGGFDEAFR PAYYEEVDLC VRMIEAGFRV IYDPDVTIHH LEYGSSSNAE AAMQQMRRNR RVFVRKHKAF LHFQQAPAPG GEIRARARNG AGRRLLFLED TVPLRRLGSG FVRANDVVHA IADAGWQVSV LPVNGARHDL MSQFGDLPET AEVLHDRSIM TLPALLAERP GFYDAVWISR THNLMRTAPI FEAAGIDPAT TPFVVDTEAV AATREAEAAA LRGEAAFDLA AAVAAEMRPA MVCRHVTAVN EAEAALLRAA GLPGVAVLGT IRAPDPTPRP FAARAGLLFV ASIHREDSPN LDSLRWYRDE ILPALRRIMD EPPTLSFVGY TAPDLDLAEF RGIPGIELRG TVAELRPAYD EHRLFIAPTR FAAGTPYKVY ETASFGLPCV ATPLLCRQLG WTPGKDIATP ATADAAAFAA EIASLYRDET RWTALRDSAL RRLAAENGRA PFEATVRRIL DAISA
|
| |