Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7021 |
Symbol | |
ID | 5675332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8562573 |
End bp | 8564405 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641245867 |
Product | phosphogluconate dehydratase |
Protein accession | YP_001511258 |
Protein GI | 158318750 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase [TIGR01196] 6-phosphogluconate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTGA ACACGGTCCT GCACCCGGTT GTCGCTGAGG TGACTGAACG GGTCGCCGCT CGCAGTGCGG CAACTCGCGA AGCCTATCTC TCCCGCGTCC AGGCAGCAGC CCAGGCGGGC CCGACCCGGG GCAGTCTAGG GTGCGCTAAT CTTGCGCACG GCTTCGCTGC ATGCGCTCCG GCAGACAAGA TCGAACTCCG CGGAGCGGCC AAGCCCGACA TTGCAATCGT GTCAGCATAC AACGATATGC TCTCCGCCCA TCAGCCTTTT GAGACCTATC CCGCGGTTCT CAAGCGTGCG GTGAGCGAGG CTGGTGGTGT CGCGCAGTTT GCCGGCGGTG TGCCCGCAAT GTGCGATGGC ATTACCCAGG GCCGTGCGGG TATGGAATTG TCCTTGTTCA GTCGTGATGT GATTGCGATG GCCACGGCGG TGGCGCTGGC GCACGACATG TTCGACGGCG TGCTACTACT CGGGGTGTGC GACAAGATCG TCCCCGGGCT CGTTATCGGT GCGCTGTCCT TCGGTCACCT ACCGGCCATC CTTGTCCCTG CCGGGCCGAT GACCTCCGGG TTACCCAATG CAGCCAAGAG CCTCACCCGC CAGTTGTACG CCGAAGGCAA GGCCAGTCGA CAGGAGCTCC TCGATGCGGA GGCAGCCGCC TATCACAGCG CGGGAACGTG TACGTTCTAC GGCACGGCCA ATACCAATCA GCTGCTCATG GAGATCATGG GCCTGCACCT TCCAGGTGCC AGCTTCGTCA ACCCGGATAC TGCTCTGCGT AACGCCCTCA CCGCCGCTGC CGGGCATCGG ATTACCCAGC TGACCACTCT CGGCGATTCC CACACGCCAA TCGGCGAGAT TATTGATGAG CGCGCGATTG TAAACGGCGT TGTTGGCTTG CTTGCCAGCG GCGGTTCGAC GAACCACACC ATGCATCTGG TCGCGATTGC CGCTGCCGCA GGAATACGGT TGACCTGGGA CGATTTCGGC GCCCTGTCGG CTGCGGTGCC ATTGCTTGCA CGGATCTACC CCAATGGCCC TGCCGACGTG AACCATTTCC ACGCAGCCGG AGGCACAGCG TTCCTCATCA GCGAACTGCT GGACGCAGGC ATGCTGCACG GTGACGTCCG TACCGTCGCG GGCGACGGCC TCGATCACTA CCGGCAGGAA CCGGTCCTGG TAGGTGACGA GCTCCTCTGG AGAAGCGGGG CGACAAAAAG CCTCGATGGG GACGTGCTGC GCCAGGTTTC CCACCCATTC GCGCCCGACG GCGGTTTGCG TATGCTCAGC GGCAGCCTCG GTCGGGCGGT AGTGAAGACG TCTGCGGTGC GAGCCGAACA TCTCCTCACC CAGGCACCAG CGAGGGTTTT CGACGACCAG GCAGAATTCC TTGCCGCATT CGAGGCGGGC GAGCTTAGCG GTGATCTCGT AGCAGTCATC CGTTACCAGG GCCCACGCGC CAACGGCATG CCTGAGCTTC ACAAACTCAT CCCGGCCCTC GGCGTACTGC AGGACCGCGG CCACAGGGTT GCCCTCGTGA CAGACGGAAG GATGTCCGGC GCTTCGGGAA AAATTCCTGC CGCGATCCAT GTCACTCCGG AGGCAGCTGC GGGCGGGCCA ATCGCCCGTG TACGAGACGG CGACGTCATC CGGCTGGACG CTACGACTGG TTCGCTCGAG GTGATCGGCA CTGATCTCAG TGATCGTGAG CCCACGGAGA AGTCACTTGC CTCGGACACG GGGATAGGGA CCGGACGCGA GCTCTTCGCT GCGTTCCGGA GCGTTGTCGG CCCAGCCGAC TCCGGGGCCA GCGTGCTGAC GGTAACGGCA TGA
|
Protein sequence | MSVNTVLHPV VAEVTERVAA RSAATREAYL SRVQAAAQAG PTRGSLGCAN LAHGFAACAP ADKIELRGAA KPDIAIVSAY NDMLSAHQPF ETYPAVLKRA VSEAGGVAQF AGGVPAMCDG ITQGRAGMEL SLFSRDVIAM ATAVALAHDM FDGVLLLGVC DKIVPGLVIG ALSFGHLPAI LVPAGPMTSG LPNAAKSLTR QLYAEGKASR QELLDAEAAA YHSAGTCTFY GTANTNQLLM EIMGLHLPGA SFVNPDTALR NALTAAAGHR ITQLTTLGDS HTPIGEIIDE RAIVNGVVGL LASGGSTNHT MHLVAIAAAA GIRLTWDDFG ALSAAVPLLA RIYPNGPADV NHFHAAGGTA FLISELLDAG MLHGDVRTVA GDGLDHYRQE PVLVGDELLW RSGATKSLDG DVLRQVSHPF APDGGLRMLS GSLGRAVVKT SAVRAEHLLT QAPARVFDDQ AEFLAAFEAG ELSGDLVAVI RYQGPRANGM PELHKLIPAL GVLQDRGHRV ALVTDGRMSG ASGKIPAAIH VTPEAAAGGP IARVRDGDVI RLDATTGSLE VIGTDLSDRE PTEKSLASDT GIGTGRELFA AFRSVVGPAD SGASVLTVTA
|
| |