Gene Franean1_5661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5661 
Symbol 
ID5673988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6873295 
End bp6874284 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content75% 
IMG OID641244515 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001509918 
Protein GI158317410 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.894525 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCATCC TCGTCACCGG CGCGTCCGGG TTCGTCGGCG GCGTCACCGC CGACCTGCTC 
TCGGCCGCCG GCCACCAGGT GACCGCGCTC GTCCGGGACG CGACGGCCCG GACGAGGCTG
TCGAGGGTGA TCGAGGTGGT CCAGGCCGAC CTGCTCGAAC CACGCCAGCT CGCCGCGGCG
GGCGTCAGCC GCGGGTTCGA CGGGGTGTGC CACCTCGCCG CTCTGACCAG GGTGCGGGAG
TCCCGCGAGA CGCCGCTGCG GTACTTCGCG GCGAACGTCA CGGGCACGAC CAACCTGCTG
GCGGCCCTCG ACGCGGGCAC CCGGGCCACC GGGGTGGCCC CGCGGTTCGT CTTCGGCTCG
AGCTGCGCCG TCTACGGGGA CACGGGTACC TCCCCTATCC CGGAGACACG CGCACCCGCG
CCGACGAATC CCTACGGCGC CTCGAAACTC GCCGCGGAGC AGGCGGTGGC CTACCAGGCC
GCCACCGGGC GGCTGGGCGC CGTCGTGCTG CGCTCGTTCA ACGTCGCGGG GGCGGTCGGC
TCGCACGCCG ACCGCGACAG CAGCCGGATC ATCCCGGCCG CGCTCGGCGT CGCAACGGGC
CGGCGCGACG CCTTCCGGGT GAACGGTGAC GGCGCGTCGA TCCGCGAGTA CGTCCACGTC
GTCGACATGG CGCGGGCGTA CCTGACCGCG CTGCGGGCGA CCGTGCCGGG CCGCTGCACC
GTCTACAACG TCGGCAGCGG CCTCGGCGTG AGCGTCACCG ACGTGCTGCG GACGGTGGAG
AGCGTGACGG GCCGGGACGT GCCGCGGGTG ACCCTGCCCC CGGTGCCCGA ACCCAGAGCG
CTCATCGCCG ACAGCCGCCG CATCCGGGCC GACCTGGGCT GGACCTCTCC GTCCTCGACC
ATCGAGAAGA TCGTCACGGA TGCCTGGCGC TCGACGGCGG TGCCCGAGCC GGTCGCGGCG
CGGCGCGGCG ACGTCCCGAT CGTCTCGTGA
 
Protein sequence
MRILVTGASG FVGGVTADLL SAAGHQVTAL VRDATARTRL SRVIEVVQAD LLEPRQLAAA 
GVSRGFDGVC HLAALTRVRE SRETPLRYFA ANVTGTTNLL AALDAGTRAT GVAPRFVFGS
SCAVYGDTGT SPIPETRAPA PTNPYGASKL AAEQAVAYQA ATGRLGAVVL RSFNVAGAVG
SHADRDSSRI IPAALGVATG RRDAFRVNGD GASIREYVHV VDMARAYLTA LRATVPGRCT
VYNVGSGLGV SVTDVLRTVE SVTGRDVPRV TLPPVPEPRA LIADSRRIRA DLGWTSPSST
IEKIVTDAWR STAVPEPVAA RRGDVPIVS