Gene Franean1_0963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0963 
Symbol 
ID5669377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1126323 
End bp1127315 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content73% 
IMG OID641239891 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001505325 
Protein GI158312817 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.409004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.187686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTATCC TCGTCACCGG CGCGGCCGGT TTCATCGGTT CCACCGTCGT CGATCGGATG 
CTTGCTGACG GTCATTCGGT CGTGGGCATC GACGACCTGT CGTCCGGGCG CATGGAGAAT
CTCACGCAGG CGGCGACCGA TGCCCGGTTC TCGTTCGAGA AGGGCGATAT CACGTCGCCC
GATCTCGGTG ACTTCGTCGC CCGGGTCCGC CCCGACGCGG TGGCTCATCT CGCGGCGCAG
ATCGACGTCC GGATCAGCGT CGCCGACCCG CTGCTCGACG CCCGGCTGAA CGTTCTCGGC
ACGATCAACG TGCTGGAGGC GGCCCGGGCC GCCGGGGTGG TGAAGGTCAT CCACACCTCG
TCCGGCGGGT CGATCTACGG CACGCCGGCC GCGCTGCCCG TCGACGAGTC CGTGCCACCC
GCGCCCGAGT CACCGTACGC GGCCGGGAAG GCCGCCGGCG AACTGTACCT CAACGTGTAC
CGGGTGACCT ACGGTGTCGC GACGACGGCG CTGGCGCTCG GGAACGTCTA CGGGCCCCGC
CAGGACCCGC ACGGCGAGGC CGGGGTGGTC GCCATCTTCG GCACCGCCCT GCTCGAGGGG
CGCCCGACCA AGATCTTCGG TGACGGCGCG ACCAGCCGGG ACTACGTCTT CGTCGGGGAC
GTCGCCGACG CCTTCGCCCG GTGCGTGCCG GCCCAGGCGG CCAACGGCCT GCGGATCAAC
ATCGGGACCG GCGCCGAGAC CACCGTTCTC GACCTGCACA GCCGCATCGC GCGGGTGGTC
GGGGTGCCGG ACGAGCCCCA GTTCGCCCCG CCGCGCCCCG GCGAGCTGCA GCGCATCAGC
CTGGACGTCG GCCTCGCGGA GCGGGAGATC GGCTGGCGGC CGCGGATGGA CCTGGACGGC
GGGCTCACCC GGACCGTCGA CTGGATCCGG GCCCGGATCG GCGCCCGCGC CGCCGCCTCC
GGCTCGGCCG GCGCGACCGG CGCGACCGGC TGA
 
Protein sequence
MRILVTGAAG FIGSTVVDRM LADGHSVVGI DDLSSGRMEN LTQAATDARF SFEKGDITSP 
DLGDFVARVR PDAVAHLAAQ IDVRISVADP LLDARLNVLG TINVLEAARA AGVVKVIHTS
SGGSIYGTPA ALPVDESVPP APESPYAAGK AAGELYLNVY RVTYGVATTA LALGNVYGPR
QDPHGEAGVV AIFGTALLEG RPTKIFGDGA TSRDYVFVGD VADAFARCVP AQAANGLRIN
IGTGAETTVL DLHSRIARVV GVPDEPQFAP PRPGELQRIS LDVGLAEREI GWRPRMDLDG
GLTRTVDWIR ARIGARAAAS GSAGATGATG