Gene Franean1_1514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1514 
Symbol 
ID5669918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1817765 
End bp1818814 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content69% 
IMG OID641240434 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001505860 
Protein GI158313352 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000791042 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCCCAG GGAAATGGGC CCGAGGAATG CGGAAGGGTG GTCATTTCAT GGTCGGCACA 
CACCTGGTCA TGGGAGCGAG CGGTTTCCTC GGTTCGCACG TCACACGGCA GCTCGCGGAG
CGCGGCGACG ACGTCCGCGT GTGGATTCGC CGATCCAGCT CCACGCGGGC TTTCGACGAC
TTACCGGTGC AACGTTGCTA TGGCGAGCTG GTCGACGACG CGGCGATCCG CGAGGCGATG
CACGGCGTCG ACACCGTGTA CTACTGCATT GTCGACACCC GGGCCTGGCT GCGTGATCCG
GCGCCGCTGT TCGCGACGAA CGTCGACGGC CTGCGGCACG CACTGGACGC GGCGCTCGAA
GCCCAGGTGC GGCGCTTCGT GTTCTGCAGC ACCGTCGGCA CGATCGGCCT CTCGCCGGAC
GGCCGCCCGG CCGACGAGAG CGTTCCGCAC ACCTGGGAGC ACCTGGGTGG GCCGTACATC
CAGACGCGCG TCGCCGCCGA GAACCTCGTC CTGCGCTACT GCCGTGAGCA CGGGCTGCCG
GGGATCGTCA TGTGCGTGTC GACGACCTAC GGAGCGCCCG ACCACGGCTC CCCGCACGGC
CGCATGGTGT CCGACGCCGC GAAGGGCAGG CTGCCGTTCT ACTTCGGCAA TGCGGCGATG
GAGGTCGTCG GCATCTCCGA CGCCGCCCGC GCGTTCCTGC TGGCCGCGGA GAAGGGCCGC
GTCGGCGAGC GGTACATCAT CAGCGAGCGT TACATGACCT GGAAGGAACT GGTCACGACG
GCGGCCGACG CCGGCGGCGC GAAGCCGCCG CGCGTGGGGA TCCCGCTCCC CGTGATGAAG
GCCGTCGGTC GCCTCGGTGA CGTGGCGGGG CGCGTACTGC GCCGCGACGT CGTGATGAAC
AGCGTCAGCA CCCGGCTCAT GCACTTCATG CCGCCGCTCG ACCACAGTAA AGCCACCCGG
GAACTCGGCT GGGATCCGTC CCCGACACCG GATGCCGTCC GCGCGGCCGC GAAGTTCTAC
CTCGAGCAAC AGCACCAGAC CGGCCGCTGA
 
Protein sequence
MGPGKWARGM RKGGHFMVGT HLVMGASGFL GSHVTRQLAE RGDDVRVWIR RSSSTRAFDD 
LPVQRCYGEL VDDAAIREAM HGVDTVYYCI VDTRAWLRDP APLFATNVDG LRHALDAALE
AQVRRFVFCS TVGTIGLSPD GRPADESVPH TWEHLGGPYI QTRVAAENLV LRYCREHGLP
GIVMCVSTTY GAPDHGSPHG RMVSDAAKGR LPFYFGNAAM EVVGISDAAR AFLLAAEKGR
VGERYIISER YMTWKELVTT AADAGGAKPP RVGIPLPVMK AVGRLGDVAG RVLRRDVVMN
SVSTRLMHFM PPLDHSKATR ELGWDPSPTP DAVRAAAKFY LEQQHQTGR