Gene Franean1_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2097 
Symbol 
ID5670497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2520902 
End bp2521945 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content69% 
IMG OID641241018 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_001506439 
Protein GI158313931 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGAC GCACCGAACG ACGCACCGAA AGGCGAACGA TGACCACTCT GCTGGTGACC 
GGGGCCGCCG GTTTCATCGG GTCCAACTTC GTCCGGTACT GGCTGGGGAC GCACCCCGGC
GACCGCGTGA TCGCGCTCGA CGCGCTGACC TACGCGGGCT GCCGGGAGAA CCTCGCCGAC
CTCGAGGACG GGATCACGTT CGTCCACGGC GACATCCGTG ACCGTGAGCT CATCGAGTCC
ACGCTGCGCG AGCACCGTGT GGACGTCGTG GTCAACTTCG CCGCCGAGTC CCACAACAGC
CTGGCGATCA TCCGTCCCGG CGAGTTCTTC TCCACCAACG TCATGGGCAC GCAGACCCTG
CTGGAGGCCG CGCGGACCGT CGGGGTCGCC CGCTTCCACC AGATCTCGAC CTGCGAGGTC
TACGGCGACA TGGACCTCGA CGACCCCGGC GCCTTCACCG AGGACTCCCC CTACCTGCCG
CGCACGCCGT ACAACGCCGC CAAGGCCGGC TCGGACCACG CCGTGCGCTC CTACGGTTTC
ACCTACGGCC TGCCGGTGAC CATCACCAAC TGCTCGAACA ACTACGGGCC GTTCCAGTTC
CCGGAGAAGG TCATCCCACT GTTCGTGACC CGGGCGCTGC AGGGCCAGTC ACTGCCGCTC
TACGCCTCCA CGAAGAACCG GCGGGAGTGG CTGCACGTGG TGGACCACTG CCGGGCCATC
GAGGCGGTGC TCGAGCGCGG CACGGTGGGT GAGACCTACC ACGTCGGCTC CGGCATCGAG
GCCGACATCG AGACCATCGC CGACCTGATC CTCGGCGAGC TGGGCCTGCC CGCCTCGCTG
AAGACGATCG TGCCGGACCG TCCCTCCCAC GACCGCCGCT ACCTGCTGGA CTCCGGCAAG
CTACGCACGA CGCTCGGCTG GGAGCCGCGG ATCAGCTTCG CGGACGGGAT GAAGGCCACC
ATCGGGTGGT ACCGGGACAA CGAGGCGTGG TGGCGTCCCC TGCTCGGCCG CTCCCCCGTC
TCGGAGACCG CCTGGCAGAG CTGA
 
Protein sequence
MRRRTERRTE RRTMTTLLVT GAAGFIGSNF VRYWLGTHPG DRVIALDALT YAGCRENLAD 
LEDGITFVHG DIRDRELIES TLREHRVDVV VNFAAESHNS LAIIRPGEFF STNVMGTQTL
LEAARTVGVA RFHQISTCEV YGDMDLDDPG AFTEDSPYLP RTPYNAAKAG SDHAVRSYGF
TYGLPVTITN CSNNYGPFQF PEKVIPLFVT RALQGQSLPL YASTKNRREW LHVVDHCRAI
EAVLERGTVG ETYHVGSGIE ADIETIADLI LGELGLPASL KTIVPDRPSH DRRYLLDSGK
LRTTLGWEPR ISFADGMKAT IGWYRDNEAW WRPLLGRSPV SETAWQS