Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_0570 |
Symbol | |
ID | 5160353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | - |
Start bp | 641181 |
End bp | 642311 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640552486 |
Product | endo-1,4-D-glucanase |
Protein accession | YP_001233713 |
Protein GI | 148259586 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACA TGGCGAGACG CAATTTTCTC GGTCTTCTCG GCGCGGCCGC GCTCACGGGC GGCCCGGTGA TGATGGCGCG GGAGCTGTGG CAATCGTCCT GGCGCGGCTA CCGGCACGGA TTCATCGACG GCCAGGGGCG GGTCATCGAC TATTCCGCCA ACAAGGGATT CAGCACTTCG GAGGGGCAGT CCTATGGCAT GTTCCTCAGT CTCGTCGCCG GCGACCGCGC GACGTTCCGC CGCATCCTGA ACTGGACGAA CACGAACATG GCGGGCGGGC GCCTCGGGGA GGTGCTCGCG GCGTGGAAAT GGGGGCTGCA CGGCGGCAAA TGGGGGGTGA TCGGCGCCAA CTCGGCGGCG GACGCGGATG CGTGGATGGC CTATTCGCTG CTCGAGGCCG CCCGGATCTG GAAGGATCAC AATCTCGGCG CCGAGGGGCA CAAGCTGGCG ACGCGCATCG CCGATGACGA GAGCGTGGCG ATCAACGGAT TTGGCCGGGT CCTGATACCC GGCGCCTCCG ATTTTCCGGA CACGCCGCCC GTCATCGTCG ATCCGAGCTA TACGCCGCTG TTTCTGGCCC GCGGCATCGC GCGCGCGACC AACCTGCCGA AATGGCAGGC GATCGCGGCC ACGCTGCCAC GGCTGATGAC GACGATCTGC CGCAACGGAT TCGCCCCCGA CTGGGCCTGG GCGCCGCAAG CCCCCGCCTC GCCGCCGGCG GGCCTGCCGG AGACCGGCAC CGGATCGTTC GATGCGATCC GGTGCTATCT CTGGGCGGGC CTGACCGCGC CGGAAACGGA GGGGAGCGCG ACCGTTCTCG CCTCGCTGAA GGGCATGGCC CGATACTTGG CCACCCACCG CGCGCCGCCG CAGAGCGTCG ATCTCGCGAG CCAGGCGACG CACGGGACGG GCGGGATCGG GTTTTCCGCG GCGCTGCTGC CCTACCTCGC GGCACTCGGG CGCCACCGGC TGCTCCATCA GCAGCTCGGC CGCGTGCTGG CCCAGCGGGA GACCAGCGGC CTGTTCGGCC AGCCCGCCGA CTATTACTCG GAAAACCTGA TCCTGTTCGG ACTTGGCGGA CTTTCGGGAA GCATCCGCTT CGACAAGCAA GGAGGTCTGA TCACGTCATG A
|
Protein sequence | MTDMARRNFL GLLGAAALTG GPVMMARELW QSSWRGYRHG FIDGQGRVID YSANKGFSTS EGQSYGMFLS LVAGDRATFR RILNWTNTNM AGGRLGEVLA AWKWGLHGGK WGVIGANSAA DADAWMAYSL LEAARIWKDH NLGAEGHKLA TRIADDESVA INGFGRVLIP GASDFPDTPP VIVDPSYTPL FLARGIARAT NLPKWQAIAA TLPRLMTTIC RNGFAPDWAW APQAPASPPA GLPETGTGSF DAIRCYLWAG LTAPETEGSA TVLASLKGMA RYLATHRAPP QSVDLASQAT HGTGGIGFSA ALLPYLAALG RHRLLHQQLG RVLAQRETSG LFGQPADYYS ENLILFGLGG LSGSIRFDKQ GGLITS
|
| |