Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1051 |
Symbol | |
ID | 5669465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1233736 |
End bp | 1235454 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641239980 |
Product | trehalose synthase |
Protein accession | YP_001505413 |
Protein GI | 158312905 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | [TIGR02456] trehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.828904 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.512672 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGAGT CGGAACCGTT CGCGGAGCAG CCGCTGCGCG AGACCGCGGC CGAGGGCGGC GACACGGGGA CGTCCCAGGA CCCGCTGTGG TTCAAGCGGG CCGTGTTCTA CGAGGTCCTC GTCCGCGGCT TCGCCGACTC CAACGGCGAC GGCACCGGTG ACCTGGCCGG GCTGGTCTCG AAACTGGACT ACCTCGAGTG GCTCGGCGTG GACTGCCTGT GGCTGCTGCC CATCTACTCC TCGCCGCTGC GCGACGGCGG GTACGACATC AGCGACTACT TCCAGATCCT CCCCGAGTTC GGGGACCTGG GCGACTTCAT CAACCTCGTC GACGAGGCGC ACCGGCGCGG GCTACGGATC ATCGCCGACC TGGTGATGAA CCACACCTCC GACGAGCACC CGTGGTTCCA GGCGTCGCGC TCGGACCCGG ACGGGCCGTA CGGCGACTTC TACGTCTGGT CCGACACCGA CGAGAAGTAC CCCGACGCCC GGATCATCTT CGTCGACACC GAGAAGTCGA ACTGGACCTG GGACCCGGTG CGCGGGCAGT ACTACTGGCA CCGGTTCTTC TCCCACCAGC CCGACCTCAA CTACGACAAC CCGGACGTCC AGGAGGCGAT GCTGGAGGTC CTGCGCTTCT GGCTCGACCT CGGCCTCGAC GGGTTCCGCC TGGACGCCGT TCCCTACCTG TACGTGCGCG AGGGCACGAA CGGGGAGAAC CTGCCGGAGA CGCACGAGTA CCTGCGCCGG GTCCGCAAGG AGATCGACGC CAAGTACGCC GACAGGGTCA TGCTGGCCGA GGCGAACCAG TGGCCCTCGG ACGTCGTCCA GTACTTCGGC AATGACGACG AGTGCCACAT GGCCTTCCAC TTCCCGCTGA TGCCGCGCAT CTTCATGGCG GTGCGGCGGG AGTCGCGCTA CCCGATCTCG GAGATCCTGG CGCAGACCCC GGAGATCCCG CCGAACTGCC AGTGGGGCAT CTTCCTGCGC AACCACGACG AGCTGACCCT GGAGATGGTC ACCGACGAGG AGCGGGACTA CATGTACGCC GAGTACGCGA AGGACCCGCG TATGAAGGCG AACATCGGGA TCCGCCGACG CCTCGCCCCG CTGCTGGACA ACAGCCGCGA CCAGATGGAG CTGTTTACCG CCCTGCTGCT CTCCCTGCCC GGCAGTCCCG TGCTCTACTA CGGCGACGAG ATCGGTATGG GCGACAACAT CTATCTCGGT GACCGCGACG GCGTGCGCAC CCCGATGCAG TGGTCCCCGG ACCGCAACGC CGGGTTCTCG ACAACCGACC CGGCCCGGCT GTACCTGCCG GTGATCATGG ACCCGGTGTA CGGCTACCAG GCGCTGAACG TCGAGGCCGA GCAGCGGATG CCGACGTCGT TCCTGTCTTG GACCAGGCGG ATGATCGAGG TCCGCAAGCG GCATCCCGTC TTCGGGCTCG GCACCTACGA GGAGCTCGGC GCGTCGAATC CGTCGGTCTT CGCGTATGTC CGGGAGTTCG GTGACGACAG GGTGCTCTGC GTCGCGAACC TCTCCCGGTT CGCCCAGCCC GTCGAGCTTG ACCTGCGGAG ATTCGCCGGC CTGGTGCCGG TGGAGCTGCT CGGCCGGGTC CATTTCCCAC CGGTCGGCGA GCTTCCGTAC CTGCTGACAC TGCCCGGTCA CGGACACTAC TGGTTCGCTC TGTCCAATCC GGGGGAATTC ACTCAGTAG
|
Protein sequence | MNESEPFAEQ PLRETAAEGG DTGTSQDPLW FKRAVFYEVL VRGFADSNGD GTGDLAGLVS KLDYLEWLGV DCLWLLPIYS SPLRDGGYDI SDYFQILPEF GDLGDFINLV DEAHRRGLRI IADLVMNHTS DEHPWFQASR SDPDGPYGDF YVWSDTDEKY PDARIIFVDT EKSNWTWDPV RGQYYWHRFF SHQPDLNYDN PDVQEAMLEV LRFWLDLGLD GFRLDAVPYL YVREGTNGEN LPETHEYLRR VRKEIDAKYA DRVMLAEANQ WPSDVVQYFG NDDECHMAFH FPLMPRIFMA VRRESRYPIS EILAQTPEIP PNCQWGIFLR NHDELTLEMV TDEERDYMYA EYAKDPRMKA NIGIRRRLAP LLDNSRDQME LFTALLLSLP GSPVLYYGDE IGMGDNIYLG DRDGVRTPMQ WSPDRNAGFS TTDPARLYLP VIMDPVYGYQ ALNVEAEQRM PTSFLSWTRR MIEVRKRHPV FGLGTYEELG ASNPSVFAYV REFGDDRVLC VANLSRFAQP VELDLRRFAG LVPVELLGRV HFPPVGELPY LLTLPGHGHY WFALSNPGEF TQ
|
| |