Gene Franean1_5344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5344 
Symbol 
ID5673678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6443510 
End bp6444568 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content71% 
IMG OID641244202 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001509608 
Protein GI158317100 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTGC ATGTGATCAC CGGTGCCGGT GGCACCGGAG CCCCCACCGC CGAACTGTTG 
GCCCGGCAGG GTGATCGCGT CCGACTGGTC AGCCGGCGCG GGGGCGGACC CGAGCACCCA
CTGATCGAGC GGATCGCCGC CGACGCGACC GACGCCGACG CGCTGACCCG ACTCGCCGAG
GGCGCGACGA CGCTGATCAA CACCGCGATG CCGCCGTACG ACCGGTGGCC GGACGAGTTC
CCACCGCTCG CGACGGCGCT GCTGGACGCG GCTGAACGCA CCGGCGCCGG CTACGTGATG
ATGGGCAACA CCTACGGCTA CGGCATCGTC AACGGCCGCT TCACCGAAGA TCTACCGATG
GCACCGGTAT CCGCCAAAGG TCAGGTACGG GCCCGGATGT GGAGCGACGC CCTCGAGGCG
CACCGCGCGG GTCGAGCCCG CGTGACCGAG GTCCGGGCCT CGGCGTTTCT GGGCGCCGGG
GCCGGTTCGC TGTACAACTT CACGGTGGCG CCCCTCGTCC TGCGCGGCGA GCCGGCAGCC
TTCCCCGGCG ACCTGGACGC CCCGAAAACC TGGTCCTACG TCGGGGACGC CGCCCGAACC
CTGGCCGCCG TGGCCCTCTC CGGCGACGAC CTTGCGTGGG GACGGGCGTG GCACGTGCCC
TCCACCGCGG CCTTGTCCGT GCGGGAGCTG ACCACGCGGC TCGCGACCGC CGCCGGGGCG
CCCGCACCCA TCCTGACGGC GATGTCCACC GATCAGCTCG CCGCGACCGG AGCCGTGAAC
CCGATCATGC GGGAAGTCAT CGAGATGATG TACTCCCTGG AACAGCCCGA CCTGCTCGAC
TCCACCCTCA CCGAGCAGAC GTTCCGCCTC GCCCCGACCC CCCTCGAGAC CGTCCTGGCT
GAAACCGTCA GCGCCTACGG ACCTGTACCT GACCAGACCC TGACCACCTG TACCAGACCA
GACCAGAACG GTCGGCAGAA TTCCGGGTCG GCACCGACCG CATCTCGCCG GCAATCAGAC
GGTCACGTCG GCGACAGTAC CGCCGTCCGG GGAACGTAA
 
Protein sequence
MPLHVITGAG GTGAPTAELL ARQGDRVRLV SRRGGGPEHP LIERIAADAT DADALTRLAE 
GATTLINTAM PPYDRWPDEF PPLATALLDA AERTGAGYVM MGNTYGYGIV NGRFTEDLPM
APVSAKGQVR ARMWSDALEA HRAGRARVTE VRASAFLGAG AGSLYNFTVA PLVLRGEPAA
FPGDLDAPKT WSYVGDAART LAAVALSGDD LAWGRAWHVP STAALSVREL TTRLATAAGA
PAPILTAMST DQLAATGAVN PIMREVIEMM YSLEQPDLLD STLTEQTFRL APTPLETVLA
ETVSAYGPVP DQTLTTCTRP DQNGRQNSGS APTASRRQSD GHVGDSTAVR GT