Gene Franean1_1270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1270 
Symbol 
ID5669683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1527878 
End bp1528849 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content74% 
IMG OID641240202 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001505630 
Protein GI158313122 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.473487 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.360955 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGTCG TGGTGACCGG GGGCGCCGGG TTCATCGGTG CCCACCTGAC TCGTGCGCTG 
CTGGCCCGTG GCTGCGAGGT GGTCGTCGTC GACGACCTCA GCACCGGAGC CCGGTCGAAC
CTGATCGGCC TGCCCGCGCG CCTCGTCCTC GGCAGCGTCA CCGACCGGGA GCTGCTGGAG
GACGCCTGCA CCGGGGCGGA CAGCGTCGTC CATCTGGCGG CGCGGCCCTC CGTCGAGCGT
TCGCTGCTCG ACCCGCTGGC CACCCACCAC GTCAACGCGA CCGGCACACT GACCGTGCTC
GACGTGGCAC AGCGGGGCGA GACCCACGTG GTGGTCGTCT CCTCCGCTCT GGTATACGGG
ACGTCCGGCG GCCGCTCCCA GAGCGAGGAC GACCCGCCGC GTCCGACCAG CCCGTACGCG
GCCAGCGCGC TGGCGGCCGA GGGGTACGCC CTGGCGCACC AGGCGAGCTT CGGGCTGCCG
GTCCTGGTCG CGCGGCTGTT CAACGTCTAC GGCCCGTACC AGCCGGCCCG GCACGCCCAC
GCCGCTGTCG TGCCGTCGTT CATCGACGCC GCGCTGCGTG GACGTCCGCT GCCGGTGCAC
GGCGACGGCC GGCAGACCAG GGACTTCACC TACGTCGAGG CGGTCGCCGG GCTGCTGGCC
GACGCGGCCT GCCGTCGGCT GGCCCACCCC GGGCCGGTGA ACCTGGCGTT CGGGACGAGC
ACCGACCTGC TGGCGTTGGT CGCCCACCTC GAGGACATCC TCGGCCGCCG GCTGGACGTG
GCGCACGGCG CGCCGCGCGC CGGGGACGTC CGCGACTCGC GCGCCCGCCC GGACGCCATG
CACGCGCTGT TCGGCTCCGT CGAGCGCTCT GACCTGCGGG CGACCCTGGA GGAGACCGTG
TACTGGTACC AGGAACGCCT CGGCGAGGCC CCCGGCGCCG CGCCCAGATC CGCGGACACC
ACCCACCACT GA
 
Protein sequence
MRVVVTGGAG FIGAHLTRAL LARGCEVVVV DDLSTGARSN LIGLPARLVL GSVTDRELLE 
DACTGADSVV HLAARPSVER SLLDPLATHH VNATGTLTVL DVAQRGETHV VVVSSALVYG
TSGGRSQSED DPPRPTSPYA ASALAAEGYA LAHQASFGLP VLVARLFNVY GPYQPARHAH
AAVVPSFIDA ALRGRPLPVH GDGRQTRDFT YVEAVAGLLA DAACRRLAHP GPVNLAFGTS
TDLLALVAHL EDILGRRLDV AHGAPRAGDV RDSRARPDAM HALFGSVERS DLRATLEETV
YWYQERLGEA PGAAPRSADT THH