Gene Franean1_0788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0788 
Symbol 
ID5669204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp915311 
End bp916339 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content68% 
IMG OID641239716 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001505152 
Protein GI158312644 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.266201 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.21169 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTGC TCGTCACCGG TACGGAGGGC TACCTGGGCT GCCTGCTTGC CCCGGAGCTG 
CTGCGCGACG GCCACGACGT GGTCGGGGTG GACACCGGTT ACTACAAGTA CGGCTGGCTC
TACCGCGGCA CGGACCGTGT CCCGCACACG ATCGACAAGG ACCTGCGCGA CCTCACCGTC
GAGGATTTCG AGGGCGTCGA CGCGGTCGTG CACATGGCGG AGCTGTCGAA CGACCCGCTG
GGCGCGCTGG CACCCGACGT GACCTACAAG GTGAACCACC AGGGGTCCGT GCGGCTCGCG
AAGCTGGCGA AGCAGGCCGG CGTCCAGCGG TTCGTCTACA TGTCGTCGTG CAGCGTCTAC
GGCGTCGCGA CCGGGTCGGA CGTCACGGAG ACCTCGCCGG TCAACCCGCA GACGCCGTAC
GCCGAGTGCA AGGTCTACGT CGAGCGGGAC GTCGCGCCGC TGGCGGACGA CACCTTCTCA
CCGACGTTCC TGCGCAACGC CACCGCGTAC GGCGCCTCGC CGCGGATGCG GTTCGACATC
GTGCTGAACA ACCTGGCCGG GGTCGCCTGG ACCACGAACG AGATCGCGAT GACCTCGGAC
GGCACCCCGT GGCGCCCGCT GGTGCACGGC CTGGACATCG CCAAGGCGAT CCGGTGCGTG
CTCACCGCGC CGCGCGACGC CGTCCACAAC GAGATCTTCA ACGTGGGTGA CAGCGCGCAG
AACTACCAGG TGAAGGAGAT CGCGGACGCG GTCGCCACCG TCTTCACCGG CTGCAAGCTG
AGCTTCGGCG ACAACGGCGG GGACAACCGC AGCTACCGGG TGTCGTTCGA CAAGATCGCC
TCCCAGCTCC CGGGCTTCTC CTGCGACTGG GACGCGCACA AGGGAGCCGA GCAGCTCCAC
GAGGTGTTCA GCCGCATCCA GCTCGACACC GAGACGTTCA CCGGCCGCGG GCACACCCGG
CTCAAGCAGC TGCAGTACCT GATCGGCACC GGCCAGGTCG ACGCCGAGCT GTTCTGGACC
GCCCGGTGA
 
Protein sequence
MKVLVTGTEG YLGCLLAPEL LRDGHDVVGV DTGYYKYGWL YRGTDRVPHT IDKDLRDLTV 
EDFEGVDAVV HMAELSNDPL GALAPDVTYK VNHQGSVRLA KLAKQAGVQR FVYMSSCSVY
GVATGSDVTE TSPVNPQTPY AECKVYVERD VAPLADDTFS PTFLRNATAY GASPRMRFDI
VLNNLAGVAW TTNEIAMTSD GTPWRPLVHG LDIAKAIRCV LTAPRDAVHN EIFNVGDSAQ
NYQVKEIADA VATVFTGCKL SFGDNGGDNR SYRVSFDKIA SQLPGFSCDW DAHKGAEQLH
EVFSRIQLDT ETFTGRGHTR LKQLQYLIGT GQVDAELFWT AR