Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_2935 |
Symbol | |
ID | 5160293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | + |
Start bp | 3202050 |
End bp | 3205382 |
Gene Length | 3333 bp |
Protein Length | 1110 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640554865 |
Product | trehalose synthase |
Protein accession | YP_001236044 |
Protein GI | 148261917 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.820681 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATCAT TGACCGATCA GATCATTCGA CCCCAGGCGA ACATGTTCGA TCCGCTCTGG TACAAGGACG CGGTGATCTA TCAGGTTCAC GTCAAATCGT TCTTCGACAA GAACAACGAT GGCGTCGGCG ACTTTGCCGG CTTGACCGAA AAGCTCGATT ATATCGCCGA ACTCGGCGTG ACGGCCGTCT GGATCCTGCC GTTCTATCCC TCGCCGCGCC GCGATGACGG ATACGACATC AGCGCCTATC GCGCGGTTCA TCCCGAATAC GGCTCGCTTG GCGACCTGCG GCGCTTCATC GACGCGGCGC ACCGCCGCGG CTTGCGCGTC ATTACCGAAC TCGTCGTCAA CCACACGTCC GATCAGCACC CCTGGTTCCA GCGCGCACGG CATGCCCGGC CGGGCTCGTC GGCGCGGAAC TACTATGTCT GGTCCGACAC CGACCGGAAA TACGACGGCA CACGCATCAT CTTTCTCGAT ACCGAGAAAT CGAACTGGAC GTGGGATCCG GTGGCGGGAG CCTATTACTG GCATCGGTTC TACAGTCACC AGCCAGATCT CAATTTCGAC AATCCGCGTG TCCTGCAGGA AGTGCTGGGC ATCATGCGCT TCTGGCTCGA TCTCGGCGTG GACGGCATGC GGCTCGATGC GGTCCCCTAC CTGATCGAGC GTGACGGAAC GAACAACGAA AACCTCCCGG AAACCCACGC CATCCTGAAG CAGATCCGCG CCGCGCTCGA TGCCCATGCG CCCGGGCGGA TGCTCCTCGC CGAAGCCAAT CAATGGCCCG AGGACGCCCG CCCCTATTTC GGCGAAGGCG ATGAGTGCCA CATGGCGTTT CATTTCCCGC TGATGCCGCG CATGTATATG GCGATCGCCC AGGAGGACCG CTTCCCCATT TCCGACATCA TGCGCCAGAC ACCGGAGATT CCGGAAAACT GCCAGTGGGC GGTGTTCCTG CGCAACCACG ACGAGCTCAC GCTCGAAATG GTTACGGACA AGGAACGTGA TTACCTTTGG GAAACCTACG CAGCCGATCG CCGGGCACGG ATCAATCTCG GCATTCGCCG CCGCCTCGCC CCCCTGCTCG AACGGGACCG GCGCCGCATC GAGCTGATGA ACGGCCTGCT GCTGTCGATG CCCGGCACGC CGGTCATCTA CTATGGCGAC GAGATCGGCA TGGGCGACAA CATCCATCTC GGCGATCGGG ATGGCGTGCG AACGCCGATG CAGTGGTCGC CCGACCGCAA TGGCGGCTTC TCGCGCGCGG ACCCGGCCGC CCTCGTCCTG CCGCCGATCA TGGATCCGCT CTACGGCTAT CAGGCGCTGA ATGTCGAGGC GCAGGCGAAG GATCCGTATT CGCTGCTGAA CTGGATGCGC AGGATGCTCG CCGTCCGCCG CCGGCATCGC GCATTCGGGC GCGGCGGCTT GCGCTTTCTC TATCCCGGCA ACCGGAAGGT GCTCGCCTAT GTGCGGGAGT GGACCGATCA GGACGGCGGA GAGGAAACGA TACTCTGCGT CTACAACCTC GCCCGCACCG CCCAGGCGGT CGAACTCGAC CTCGCGGCCT TCGGTGGCCG AATTCCGCTC GACCTCATCG GCGGCGCGCC GTTCCCGCCG GTGGGCCAGC TTCCCTACAT GCTGACCCTG CCGCCGTTCG CATTCTACTG GTTCAGCCTG ACGACCGAAG CGGCGATGCC CTTCTGGCGC ATCCAGCCGT CGGAGCCGCT GCCCGATTAC ATCACCCTCG TCATGCGGCT CGGCCTCGCC GACCTCGTTG CCGTCGACAG CCGGCACAGT CTCGAGACCG AAATTCTCCC GCCCTATCTC CAGCGGCGGC GCTGGTTCGC CGCGAAGGAC AGGCATGTCC GGAGCGTGAC GATCGCCAAT GCCCATATGC TCGGCACGGC GGAGGACGAT TTCCTGCTCT GCGAAATCGA GGTCGAGTTC GCGGGCGAGG GGAGGGGGGA TGTGTATCTC CTGCCGCTCG CGGTGGTTTG GGATGACGGC CCGGTTGCAA GTATCGTCCA GCAGCTCGCC TTGGCCCGCA TTCGCCGGCA CCGCCGGGTC GGCTATCTGA CGGACGCATT TGCGGTCGAC CGCTTCTGCC ACGACATCAT CGCCAGATTG AGGACAAAAT CCTGCATCTC GCTCGCCTCG GGCCGGCTCA GCTATGAACC CACCGCGCTG ATCGACGACC TGCCGCCGCT CGACGATGCG GAAATCCGCC GCTTTTCCGC CGAACAATCC AACAGTTCCC TGATCGTCGG CGACGCCGCG GTCATGAAGA TCCTGCGGCG GACGGAGCGC GGGATTCATC CCGAAACCGA AATGAGCCGC TTTCTGACCG ACGCGTCCTT TGCGAACATC CCCGCATTGC TGGGCGAAGT CGTGCGGCTC GATCCGGATG GCGAACGGCG CACACTGATC GTCGTCCAGC AGTTTGTCCG CAACCAGGGC GATGCCTGGC AGTGGACGCT GGATGTCCTC GGGCGCGCGG TCGATGGCGC GATTCACGCC GAACTCCGCG ACCCCGGCGG GATCGATCCG CTCTCCGGCT ATCTCTCCTT CGTCTCGGTG ATCGGACGGC GCCTCGCGGA GATGCACTCG GTCCTCGCGC AGTTCGGCAC CGGACCCGAT TTCGCGCCCG AACGGGCCGG CGAGGCCGAG ATCGCGGCAT GGGCCGAGGG CGCGAAAGGC CAGCTGGACG CCGCCGTCGC CGCGGTCGAA CAGATGGCCG ACCGGGCCGG GCCGGAAACG CAAGGCCTGA TCCGGCGCCT GCGCGACGAG CGGACGGCGA TCGAAACCCG ATTGCGGCGC CTCGCCGAGG CGGGCGCCGG AACCCTGCTG ACCCGTGTGC ATGGCGACTT TCATCTCGGC CAGGTTCTGG TGGCGCAGGG CGATGCGTTC ATCATCGATT TCGAAGGCGA GCCCATCAAG CCGATTGCCG AGCGGCGGAA GAAATCCTCT CCCCTGCGTG ATGTCGCCGG TCTGTTGCGA TCCCTCGATT ACGCGGCCGC CACGGTGGAG CGCGCGGCTT TCGCGGCCAG CGAACGCGGC GAAGACCGTC AGCAGGCCAT GATCGCGCGC TTTCGCACCG ATGCCGCCGC GGCCTTCATC GAGGCCTATC GCGCGGTCGC GATGACGGCC CCGCGGCCAT GGATCACGGA AGTCGCGTGG CGCGATGTCC TGGCATTGTT CATGATCGAG AAGGCCGCGT ATGAAATCTG TTACGAGGCG GCGAACCGGC CCGGCTGGAT CGACATCCCG CTGAGCGGTC TGGTCCGGAT TCATGAGCGG CACGAGGGAG GCGGGGATGC CGGCATCGGC TGA
|
Protein sequence | MTSLTDQIIR PQANMFDPLW YKDAVIYQVH VKSFFDKNND GVGDFAGLTE KLDYIAELGV TAVWILPFYP SPRRDDGYDI SAYRAVHPEY GSLGDLRRFI DAAHRRGLRV ITELVVNHTS DQHPWFQRAR HARPGSSARN YYVWSDTDRK YDGTRIIFLD TEKSNWTWDP VAGAYYWHRF YSHQPDLNFD NPRVLQEVLG IMRFWLDLGV DGMRLDAVPY LIERDGTNNE NLPETHAILK QIRAALDAHA PGRMLLAEAN QWPEDARPYF GEGDECHMAF HFPLMPRMYM AIAQEDRFPI SDIMRQTPEI PENCQWAVFL RNHDELTLEM VTDKERDYLW ETYAADRRAR INLGIRRRLA PLLERDRRRI ELMNGLLLSM PGTPVIYYGD EIGMGDNIHL GDRDGVRTPM QWSPDRNGGF SRADPAALVL PPIMDPLYGY QALNVEAQAK DPYSLLNWMR RMLAVRRRHR AFGRGGLRFL YPGNRKVLAY VREWTDQDGG EETILCVYNL ARTAQAVELD LAAFGGRIPL DLIGGAPFPP VGQLPYMLTL PPFAFYWFSL TTEAAMPFWR IQPSEPLPDY ITLVMRLGLA DLVAVDSRHS LETEILPPYL QRRRWFAAKD RHVRSVTIAN AHMLGTAEDD FLLCEIEVEF AGEGRGDVYL LPLAVVWDDG PVASIVQQLA LARIRRHRRV GYLTDAFAVD RFCHDIIARL RTKSCISLAS GRLSYEPTAL IDDLPPLDDA EIRRFSAEQS NSSLIVGDAA VMKILRRTER GIHPETEMSR FLTDASFANI PALLGEVVRL DPDGERRTLI VVQQFVRNQG DAWQWTLDVL GRAVDGAIHA ELRDPGGIDP LSGYLSFVSV IGRRLAEMHS VLAQFGTGPD FAPERAGEAE IAAWAEGAKG QLDAAVAAVE QMADRAGPET QGLIRRLRDE RTAIETRLRR LAEAGAGTLL TRVHGDFHLG QVLVAQGDAF IIDFEGEPIK PIAERRKKSS PLRDVAGLLR SLDYAAATVE RAAFAASERG EDRQQAMIAR FRTDAAAAFI EAYRAVAMTA PRPWITEVAW RDVLALFMIE KAAYEICYEA ANRPGWIDIP LSGLVRIHER HEGGGDAGIG
|
| |