Gene Franean1_0220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0220 
Symbol 
ID5668645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp268359 
End bp269465 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content78% 
IMG OID641239149 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001504593 
Protein GI158312085 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.854455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.343267 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGCGGT CGGTGGTCAC CGGCGCGGCC GGATTCCTCG GCGGCGCCGT GGCGCATGAG 
CTCCGTCGGC GCGGTGACGA GGTCATCGGG CTCGACGTGC GGCGTGCGCC GGGCGTCACC
CTGGCCGACG TCACGATGTC CGGGAGCTGG GAGAAGGCGC TCGAGGGCGC TGACCTGCTC
GTGCACGCCG CCGCCGTGGG CATGGGCGGC GTCGGGGAGC TGGCGCCGGT GCGGGCCGGC
CGGGCGACCC CGCCCAGCGG GATCACCACC GCGCAGATGC GCAAGGTGCT GCTCGGCGGG
ACCGCGACGG TGCTCGACGC GGCCCAGCGC GCTGGTGTCC GCCGCGTGGT CCACCTGTCC
TGCGTGAGCG CGCTCGGCGA CGACGCACCG CACGCGGCCG ACGAGTCCGC GCCGGTCGGC
CTCACCGGCG AGCCGCGCGC CGACGCGATC GCCGCCGCCG AGCAGACCGT CAGCGCGGCC
GCCGCCCACG GGGCCCCCGT CACCGTGCTG CGGATCGCCG ACGCCTACGG CCCGCGCGCC
GGCCGCTGGA CGCTGTGGCC CGTGCTGCTG ATGCGGGCCG GCCGGTTCGT CCTCGTCGAC
GGCGGGCGCG GCATGCTGAG CCCCGTCCAC GTCGACGACG TGGTGAGCGC GGTGATGGCC
GTCGCGGCCG CCCCGGGCGA GACGGTGACC GGCCAGGTGC TGCACGTGAC CGGTCCGGGC
CCGGCGACGG TGGCCGAGTT CTTCGGGCGG TACGCCGCGA TGGCGCAGGT GCGGGCGCCC
CGGTCCGTGC CTGCGCGGCT GTGCGAGGTC GTCGACGCCG TCGACCGGCT GCCGAGCCGG
CAGCCGGCGT CCGCGGGCGG CCGTCCGGGC CTGCTGCGCG GGATCGGCGC GGCCCTCATG
GCCAACGTGG ACCCGCGTGT CCGGGTCGAC CTCGGCCCGC TGACGATCCA GGACGTCACC
AGGCGCGGCA CGGTCTCCGG GGACCGGATC GCCGCGCTCG TCGGCTGGCG GGGCGAGGTC
GACCTCGACG AGGGCATGCG GCGGACGGGT TCCTGGCTGC GCGACCGGGG CCTGCTGGGG
GTCCCGGAGC CGGCCCGCCG TGGGTGA
 
Protein sequence
MVRSVVTGAA GFLGGAVAHE LRRRGDEVIG LDVRRAPGVT LADVTMSGSW EKALEGADLL 
VHAAAVGMGG VGELAPVRAG RATPPSGITT AQMRKVLLGG TATVLDAAQR AGVRRVVHLS
CVSALGDDAP HAADESAPVG LTGEPRADAI AAAEQTVSAA AAHGAPVTVL RIADAYGPRA
GRWTLWPVLL MRAGRFVLVD GGRGMLSPVH VDDVVSAVMA VAAAPGETVT GQVLHVTGPG
PATVAEFFGR YAAMAQVRAP RSVPARLCEV VDAVDRLPSR QPASAGGRPG LLRGIGAALM
ANVDPRVRVD LGPLTIQDVT RRGTVSGDRI AALVGWRGEV DLDEGMRRTG SWLRDRGLLG
VPEPARRG