Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4665 |
Symbol | |
ID | 5673007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5568222 |
End bp | 5569439 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243522 |
Product | xylose isomerase |
Protein accession | YP_001508938 |
Protein GI | 158316430 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2115] Xylose isomerase |
TIGRFAM ID | [TIGR02631] xylose isomerase, Arthrobacter type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.025606 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.682712 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACGAC AGCCCACTCC CGAGGACAAG TTCTCCTTCG GCCTGTGGAC GGTCGGCTGG ACCGGCACCG ACCCGTTCGG CCTGCCGACC CGGACGGCCC TCGACCCGTG GGAGTACGCC GACCGGCTGG CCGAGATAGG CGCCTGGGGC ATCACCCTGC ACGACAACGA CGTCTTCCCC TTCGACGCCG ATGACGCCGC CGCCGCGCGG GCGTCCCGCC GGCTCAAGGA GGCCACCGAC GCCTCCGGCC TGGTCATCGA GATGGTGACC ACGAACACCT TCACCCATCC CGTCTTCAAG GACGGCGGCC TGACCTCGAA CGACCGCGGC GTGCGCCGGT TCGGCCTGCG CAAGGTGCTG CGCGCGGTGG ATCTCGCGGC GCAGCTCGGC GCGACCACGT TCGTGATGTG GGGCGGCCGG GAGGGCAGCG AGTACGACGG GTCGAAGGAC GTCTTCGCCG CGCTGGAGCG CTACCGGGAG GGCCTGGACA CCGTCGCCGG CTACATCAAA AGCCAGGGCT ACGACCTGCG GATCGCGCTC GAACCCAAGC CGAACGAGCC GCGCGGCGAC ATCCTCCTGC CCACCGTCGG GCATGCGCTG GCGCTGATCG CCGAGCTGGA GAACGGCGAC ATCGTCGGGG TCAACCCGGA GACCGGGCAC GAGCAGATGG CCAACCTCAA CTACACCCAC GCGCTCGGCC AGGCACTGTG GAGCGGGAAG CTGTTCCACA TCGACCTCAA CGGGCAGCGG GGCCTGAAGT ACGACCAGGA CCTGGTCTTC GGGCACGGCG ATCTCGTCTC GGCGTTCTTC ACCGTCGACC TGCTCGAGAA CGGCTTCCCG GGCTACCCGG ACGGCCCCCG GTACACCGGT CCCCGCCACT TCGACTACAA GCCGTCGCGG ACGGAGGGCA TGGCCGGGGT CTGGGAGTCG GCGCGGGCGT GCATGTCGAC CTACCTGCTG CTCGCCGAGA AGGTGGCGGC GTTCCGCGCC GATCCCCTCG TCCAGGAGGC GATGGCCTAC GCCGGCGTGT TCGAGCTGGC CAAGCCCACC CTCGCCCCCG GCGAGACCGC GGCCGATCTC CTGGCCTCGG ACGACGGCTT CGACCCGGCC AAGGCGGCCG AGCGGGACTT CGGTTTCGTG CGGCTCCAGC AACTGGCGAT CGAGCATCTC GTCGGCTCGC CCGCCCCGGG CTCGGCAGCC GCCCGTTCCG CCGGCTGA
|
Protein sequence | MPRQPTPEDK FSFGLWTVGW TGTDPFGLPT RTALDPWEYA DRLAEIGAWG ITLHDNDVFP FDADDAAAAR ASRRLKEATD ASGLVIEMVT TNTFTHPVFK DGGLTSNDRG VRRFGLRKVL RAVDLAAQLG ATTFVMWGGR EGSEYDGSKD VFAALERYRE GLDTVAGYIK SQGYDLRIAL EPKPNEPRGD ILLPTVGHAL ALIAELENGD IVGVNPETGH EQMANLNYTH ALGQALWSGK LFHIDLNGQR GLKYDQDLVF GHGDLVSAFF TVDLLENGFP GYPDGPRYTG PRHFDYKPSR TEGMAGVWES ARACMSTYLL LAEKVAAFRA DPLVQEAMAY AGVFELAKPT LAPGETAADL LASDDGFDPA KAAERDFGFV RLQQLAIEHL VGSPAPGSAA ARSAG
|
| |