Gene Franean1_7067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7067 
Symbol 
ID5675377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8623291 
End bp8625123 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content64% 
IMG OID641245912 
Productphosphogluconate dehydratase 
Protein accessionYP_001511303 
Protein GI158318795 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase
[TIGR01196] 6-phosphogluconate dehydratase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTGA ACACGGTCCT GCACCCGGTT GTCGCTGAGG TGACTGAACG GGTCGCCGCT 
CGCAGTGCGG CAACTCGCGA AGCCTATCTC TCCCGCGTCC AGGCAGCAGC CCAGGCGGGC
CCGACCCGGG GCAGTCTAGG GTGCGCTAAT CTTGCGCACG GCTTCGCTGC ATGCGCTCCG
GCAGACAAGA TCGAACTCCG CGGAGCGGCC AAGCCCGACA TTGCAATCGT GTCAGCATAC
AACGATATGC TCTCCGCCCA TCAGCCTTTT GAGACCTATC CCGCGGTTCT CAAGCGTGCG
GTGAGCGAGG CTGGTGGTGT CGCGCAGTTT GCCGGCGGTG TGCCCGCAAT GTGCGATGGC
ATTACCCAGG GCCGTGCGGG TATGGAATTG TCCTTGTTCA GTCGTGATGT GATTGCGATG
GCCACGGCGG TGGCGCTGGC GCACGACATG TTCGACGGCG TGCTACTACT CGGGGTGTGC
GACAAGATCG TCCCCGGGCT CGTTATCGGT GCGCTGTCCT TCGGTCACCT ACCGGCCATC
CTTGTCCCTG CCGGGCCGAT GACCTCCGGG TTACCCAATG CAGCCAAGAG CCTCACCCGC
CAGTTGTACG CCGAAGGCAA GGCCAGTCGA CAGGAGCTCC TCGATGCGGA GGCAGCCGCC
TATCACAGCG CGGGAACGTG TACGTTCTAC GGCACGGCCA ATACCAATCA GCTGCTCATG
GAGATCATGG GCCTGCACCT TCCAGGTGCC AGCTTCGTCA ACCCGGATAC TGCTCTGCGT
AACGCCCTCA CCGCCGCTGC CGGGCATCGG ATTACCCAGC TGACCACTCT CGGCGATTCC
CACACGCCAA TCGGCGAGAT TATTGATGAG CGCGCGATTG TAAACGGCGT TGTTGGCTTG
CTTGCCAGCG GCGGTTCGAC GAACCACACC ATGCATCTGG TCGCGATTGC CGCTGCCGCA
GGAATACGGT TGACCTGGGA CGATTTCGGC GCCCTGTCGG CTGCGGTGCC ATTGCTTGCA
CGGATCTACC CCAATGGCCC TGCCGACGTG AACCATTTCC ACGCAGCCGG AGGCACAGCG
TTCCTCATCA GCGAACTGCT GGACGCAGGC ATGCTGCACG GTGACGTCCG TACCGTCGCG
GGCGACGGCC TCGATCACTA CCGGCAGGAA CCGGTCCTGG TAGGTGACGA GCTCCTCTGG
AGAAGCGGGG CGACAAAAAG CCTCGATGGG GACGTGCTGC GCCAGGTTTC CCACCCATTC
GCGCCCGACG GCGGTTTGCG TATGCTCAGC GGCAGCCTCG GTCGGGCGGT AGTGAAGACG
TCTGCGGTGC GAGCCGAACA TCTCCTCACC CAGGCACCAG CGAGGGTTTT CGACGACCAG
GCAGAATTCC TTGCCGCATT CGAGGCGGGC GAGCTTAGCG GTGATCTCGT AGCAGTCATC
CGTTACCAGG GCCCACGCGC CAACGGCATG CCTGAGCTTC ACAAACTCAT CCCGGCCCTC
GGCGTACTGC AGGACCGCGG CCACAGGGTT GCCCTCGTGA CAGACGGAAG GATGTCCGGC
GCTTCGGGAA AAATTCCTGC CGCGATCCAT GTCACTCCGG AGGCAGCTGC GGGCGGGCCA
ATCGCCCGTG TACGAGACGG CGACGTCATC CGGCTGGACG CTACGACTGG TTCGCTCGAG
GTGATCGGCA CTGATCTCAG TGATCGTGAG CCCACGGAGA AGTCACTTGC CTCGGACACG
GGGATAGGGA CCGGACGCGA GCTCTTCGCT GCGTTCCGGA GCGTTGTCGG CCCAGCCGAC
TCCGGGGCCA GCGTGCTGAC GGTAACGGCA TGA
 
Protein sequence
MSVNTVLHPV VAEVTERVAA RSAATREAYL SRVQAAAQAG PTRGSLGCAN LAHGFAACAP 
ADKIELRGAA KPDIAIVSAY NDMLSAHQPF ETYPAVLKRA VSEAGGVAQF AGGVPAMCDG
ITQGRAGMEL SLFSRDVIAM ATAVALAHDM FDGVLLLGVC DKIVPGLVIG ALSFGHLPAI
LVPAGPMTSG LPNAAKSLTR QLYAEGKASR QELLDAEAAA YHSAGTCTFY GTANTNQLLM
EIMGLHLPGA SFVNPDTALR NALTAAAGHR ITQLTTLGDS HTPIGEIIDE RAIVNGVVGL
LASGGSTNHT MHLVAIAAAA GIRLTWDDFG ALSAAVPLLA RIYPNGPADV NHFHAAGGTA
FLISELLDAG MLHGDVRTVA GDGLDHYRQE PVLVGDELLW RSGATKSLDG DVLRQVSHPF
APDGGLRMLS GSLGRAVVKT SAVRAEHLLT QAPARVFDDQ AEFLAAFEAG ELSGDLVAVI
RYQGPRANGM PELHKLIPAL GVLQDRGHRV ALVTDGRMSG ASGKIPAAIH VTPEAAAGGP
IARVRDGDVI RLDATTGSLE VIGTDLSDRE PTEKSLASDT GIGTGRELFA AFRSVVGPAD
SGASVLTVTA