Gene Francci3_1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1987 
Symbol 
ID3903695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2333656 
End bp2334747 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content71% 
IMG OID637879323 
Productmandelate racemase/muconate lactonizing enzyme 
Protein accessionYP_481090 
Protein GI86740690 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.392929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCGG TTGAGAGCGT GGAGGTCGCC GCCTACCTGG TGCCGACCGA GCAGCCCGAG 
TCGGACGGGA CGCTGGAGTG GGGTTCCGTC ACCCTGGTGG TGGTGACGGT CCGGACCGGC
GGCCAGGCCG GTCTGGGCTA CACCTACTGC CATCTCGCTG CCGCGGATGT CGTCGCGGGA
AAGTTGGCGG CGGTGGTGAC TGGCCGGGAC GCGTTGCGGG TCGGTGCCTG CTGGTCGGCG
ATGCAGGCGG CGGTGCGGAA CATCGGCCGT CCCGGCATGG CCGCAGAGGC GATCTCGGCG
GTGGACATAG CGCTGTGGGA TCTCAAGGCG CGGCTGCTCG GTGTCCCGCT GGTGGTGGCG
CTGGACGCGG TGCACGACCG GGTCCCGATC TACGGCAGTG GCGGCTTCAC GTCCTATCCG
GACAGCCAGC TCTGCGATCA GTTGTCGGGT TGGGCGGCGG CGGGTATCCC GCGGGTGAAG
ATGAAGGTCG GCCGGGACCC GGCGGAGGAC CGGAAGCGGG TGGCTGTGGC CCGGCGGGCA
GTCGGATCCG ACGTCGAGCT CTATGTGGAC GCGAACGGGG CGTACAGCCG CAAGCAGGCG
TTGATGCTGG CGGAGATCTT CGCGGAGCAG GATGTGCGCT GGTTCGAGGA GCCGGTCAGC
TCCGACGACC TGGAGGGGTT GCGGCTGCTG CGGGACCGCG GCCCGGCGGG GATGGACATC
GCGGCCGGCG AGTACGGCTA CACGTTGTCC GGTCTGGAAC GGATGCTGGC CGCGGGTGCT
GTCGACTGCC TGCAGGTGGA CGTCACCCGC TGCGGCGGCA TCAGCGGGTT CCTGCGGGCG
GCGGCGCTGT GCGACGCGCG GGGGATCGAC CTGTCCGCCC ACTGTGCGCC GCAGGTCAGC
GTGCACGCCT GCACCGCGGT GTGGCATCTG CGGCACCTCG AGTACTTCCA CGACCACGTC
CGGGTCGAAC ATCTGCTGTT CGACGGGGTC CTCGATGCGC GGCCGGACGG GACGCTGGTC
CCGGATCGGT CGCGGTGCGG CCTGGGCCTG TCGGTGCGGC AGCGGGACGC CGAACGGTTC
CGGGTCCGAT GA
 
Protein sequence
MPAVESVEVA AYLVPTEQPE SDGTLEWGSV TLVVVTVRTG GQAGLGYTYC HLAAADVVAG 
KLAAVVTGRD ALRVGACWSA MQAAVRNIGR PGMAAEAISA VDIALWDLKA RLLGVPLVVA
LDAVHDRVPI YGSGGFTSYP DSQLCDQLSG WAAAGIPRVK MKVGRDPAED RKRVAVARRA
VGSDVELYVD ANGAYSRKQA LMLAEIFAEQ DVRWFEEPVS SDDLEGLRLL RDRGPAGMDI
AAGEYGYTLS GLERMLAAGA VDCLQVDVTR CGGISGFLRA AALCDARGID LSAHCAPQVS
VHACTAVWHL RHLEYFHDHV RVEHLLFDGV LDARPDGTLV PDRSRCGLGL SVRQRDAERF
RVR