Gene Franean1_4678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4678 
Symbol 
ID5673020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5587369 
End bp5588532 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content72% 
IMG OID641243535 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_001508951 
Protein GI158316443 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.194789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCG CCGACGTGCG GACGCACGTG TACGGGACCC CGTGGCGGGA TCTGACCTAC 
GTCCAGGTCT TCACGGACGA CGGTCTGGTC GGCGTAGGAG AGACCCGGAT GCTCGGCCAC
ACCCAGGCGC TGCTGGGCTA CCTGGCCGAG GCGACCCGCA ACCATGTGCT CGGCTCCGAT
CCGTTCGACA TCGAGTCACT GGTCGACCGG ATGAAGCGCG GCGACTACGG GCGGGCCGGC
GAGATCGTCA TGTCCGGTAT CGCGTGCGTC GAGATGGCCT GCTGGGACAT CGTCGGCAAG
GCGCTGGGGC AGCCGGTGTG GCGGCTGCTG GGCGGCAAGG TCCGCGACCG GATCAAGGCG
TACGCCAACG GCTGGTACAC CGTCGAGCGC ACCCCCGAGG AGTTCCACGC GGCGGCGCGG
GCGGTGGTGG ACCGGGGCTA CCGGGCGCTC AAGCTCGACC CGTTCGGGGC CGGCCGGTGG
GAGCTGGACC GGGCCGAACG TCGCCACTCC ATCTCCCTGG TCGAGGCGGT GCGCGACGCG
GTCGGGCCGG ACGTGGAGAT CCTCGTCGAG ATGCACGGGC GGTTCGCCCC ACACGAGGCG
ATCCGGATTG CCGCCTCACT GACCGAGTTC GAGCCGGGCT GGGTCGAGGA GCCGGTACCA
CCGGAGAACC TGCGGGCGCT GGCCAAGGCC GCCGCCGGGA TCGACGCCCC GGTGGCGACC
GGGGAGCGCA TCCACGACCG CACCGAGTTC CGGGAGCTGT TCGACCTCGG CGCGGCCGAC
ATCATCCAGC CCGACATCGG CCATCTCGGT GGCATCAGCG AGACCCGCAA GCTCGCCGCG
ACCGCAGAGA CCCACTTCAC GCTGGTCGCC CCGCACAACG TCGGCGGCGC GGTTCTCACC
GCCGCCAACC TGCACCTGGC CGCCTGCACC CCCAACTTCA TGATCCAGGA ACACTTCAAC
GACTTCGCCG ACGAGGAGGT CAAGCTCGCG GCGCCGGGCC TGCCGCCGGT CGTCGACGGC
TACTTCGCCC TGCCGACCGC ACCCGGCCTC GGCGTCGAGC TGGACGTCGA CGTCGTGGCC
GCCCACCCGT CCCGCGGCGC CCACTTCGAC CTCTACGCCG ACGGCTGGGA GCTGCGCGGC
TCCCGCCCGC CCGGCCGCGG CTGA
 
Protein sequence
MKIADVRTHV YGTPWRDLTY VQVFTDDGLV GVGETRMLGH TQALLGYLAE ATRNHVLGSD 
PFDIESLVDR MKRGDYGRAG EIVMSGIACV EMACWDIVGK ALGQPVWRLL GGKVRDRIKA
YANGWYTVER TPEEFHAAAR AVVDRGYRAL KLDPFGAGRW ELDRAERRHS ISLVEAVRDA
VGPDVEILVE MHGRFAPHEA IRIAASLTEF EPGWVEEPVP PENLRALAKA AAGIDAPVAT
GERIHDRTEF RELFDLGAAD IIQPDIGHLG GISETRKLAA TAETHFTLVA PHNVGGAVLT
AANLHLAACT PNFMIQEHFN DFADEEVKLA APGLPPVVDG YFALPTAPGL GVELDVDVVA
AHPSRGAHFD LYADGWELRG SRPPGRG