Gene Franean1_4354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4354 
Symbol 
ID5672709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5198276 
End bp5199241 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content71% 
IMG OID641243227 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001508644 
Protein GI158316136 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTGC ATGTGATCAC CGGTGCCGGT GGCACCGGAG CCCCCACCGC CGAACTGTTG 
GCCCGGCAGG GTGATCGCGT CCGGCTGGTC AGCCGGCGCG GGGGCGGACC CGAGCACCCA
CTGATCGAGC GGATCGCCGC CGACGCGACC GACGCCGACG CGCTGACCCG ACTCGCCGAG
GGCGCGACGA CGCTGATCAA CACCGCGATG CCGCCGTACG ACCGGTGGCC GGACGAGTTC
CCACCGCTCG CGACGGCGCT GCTGGACGCG GCTGAACGCA CCGGCGCCGG CTACGTGATG
ATGGGCAACA CCTACGGCTA CGGCATCGTC AACGGCCGCT TCACCGAAGA TCTACCGATG
GCACCGGTAT CCGCCAAAGG TCAGGTACGG GCCCGGATGT GGAGCGATGC CCTCGAGGCG
CACCGCGCGG GTCGAGCCCG CGTGACCGAG GTCCGGGCCT CGGCGTTTCT GGGCGCCGGG
GCCGGTTCGC TGTACAACTT CACGGTGGCG CCCCTCGTCC TGCGCGGCGA GCCGGCAGCC
TTCCCCGGCG ACCTGGACGC CCCGAAAACC TGGTCCTACG TCGGGGACGC CGCCCGAACC
CTGGCCGCCG TAGCCCTCTC CGGCGACGAC CTTGCGTGGG GACGGGCGTG GCACGTGCCC
TCCACCGCGG CACTGTCCGT GCGGGAGCTG ACCACGCGGC TCGCGACCGC CGCCGGGGCG
CCCGCACCCA TCCTGACGGC GATGTCCACC GATCAGCTCG CCGCGACCGG AGCCGTGAAC
CCGATCATGC GGGAAGTCAT CGAGATGATG TACTCCCTGG AACAGCCCGA CCTGCTCGAC
TCCACCCTCA CCGAGCAGAC GTTCCGCCTC GCCCCGACCC CCCTCGAGAC CGTCCTGGCT
GAAACCGTCA GCGCCTACGG ACCTGTACCT GACCTGACGG TCAGCAGGAT TCCGTGTTGG
AGCTGA
 
Protein sequence
MPLHVITGAG GTGAPTAELL ARQGDRVRLV SRRGGGPEHP LIERIAADAT DADALTRLAE 
GATTLINTAM PPYDRWPDEF PPLATALLDA AERTGAGYVM MGNTYGYGIV NGRFTEDLPM
APVSAKGQVR ARMWSDALEA HRAGRARVTE VRASAFLGAG AGSLYNFTVA PLVLRGEPAA
FPGDLDAPKT WSYVGDAART LAAVALSGDD LAWGRAWHVP STAALSVREL TTRLATAAGA
PAPILTAMST DQLAATGAVN PIMREVIEMM YSLEQPDLLD STLTEQTFRL APTPLETVLA
ETVSAYGPVP DLTVSRIPCW S