Gene Franean1_4528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4528 
Symbol 
ID5672877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5401735 
End bp5402856 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content74% 
IMG OID641243393 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001508809 
Protein GI158316301 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCGTT CCGCGCTTGT CATCGGTGGC ACCGGGCCGA CCGGGCCGGG AGTGGTGCGC 
GGACTGCTCG ACCGCGGCTT CGACGTGACG ATCCTGCACG GCGGCCAGCA CGAGGTCGAG
CTGCCCGCCG AGGTCCGCCA CATCCACACC GACCCGCACT GGCCGGAAAC GCTCGGCGCC
GGCCTGGGCC GGGCGGAGTT CGACCTCGTG GTGGCCCAGT ACGGGCGGCT CGCGGTCACC
GCGCAGGTCC TCGCCGGCCG CACCGACCGG GTGGTTGCCG TCGGGGGTGC CCACGGCTCC
CTCGCGCATG CCGCCGATCC GCGCTGGGGA GCGCTGGGTC GGCCCGCCCT GCCCCCCGAA
GAGGGCGAGC ACCTGGAACA GGACCCCGCA CACGGCACGC TCGCGGTCAG GATGGCCGCC
GCGGAACGGG CCCTGTTCGA CGCCCACGCG GCAGGCGCGT TCGCGGCGAC CCTGCTGGCC
TATCCCGTGG TCTACGGCCC ACGGCAGATC GCGCCGCACG AGTGGTGCAT CGTGCGGCGG
ATCCTCGACG GCAGGCGCCG CATCGTCGTC GCCGACGGGG CCATCCGCAT GGAGACCAGG
CTCTACACCG AGCATGCGGT GCATGCCGTG CTGCTGGCGG TGGACCACCG CGCGGCCGCG
TCCGGGCGCA AGTTCGTGGT CGCCGACGAC AACGTGTTCA CGATGCGCCA GCGCATCGAG
TTCATCGCCG CCCGGCTCGG CGTGGACGTG GAACTGGTCG ACATGCCCTA CCCGCTGGCG
ACCCCGTGCC ACCCGTACTG GCGCCACGGG CCCGACCACC GGCTGCGCGG CAACGCCCGC
ATCCGGGCGG AGCTGGGCTA CGCCGACACC ACCCCCGCGG CGGACGCCCT CGGCGCGACG
GTCGGCTGGC TGCTCGAACA CCCGCCGGTG CCCGGCGGTC CGGAGGAGAA GCGGCTCGGT
GACCGGTTCG ACTACGCCCA TGAGGACGAT CTGATGGAAC GCTGGGCCGC GGTGCGCCGC
GACTGGTCCG ATGACGCGGA CCCGTTCCGG CCGACCCATG CCTACCGCCA CCCGCGCCGT
CCCGACGAGA CGTGGTACCC AGGCGGTACG GCCACGCCGT AG
 
Protein sequence
MSRSALVIGG TGPTGPGVVR GLLDRGFDVT ILHGGQHEVE LPAEVRHIHT DPHWPETLGA 
GLGRAEFDLV VAQYGRLAVT AQVLAGRTDR VVAVGGAHGS LAHAADPRWG ALGRPALPPE
EGEHLEQDPA HGTLAVRMAA AERALFDAHA AGAFAATLLA YPVVYGPRQI APHEWCIVRR
ILDGRRRIVV ADGAIRMETR LYTEHAVHAV LLAVDHRAAA SGRKFVVADD NVFTMRQRIE
FIAARLGVDV ELVDMPYPLA TPCHPYWRHG PDHRLRGNAR IRAELGYADT TPAADALGAT
VGWLLEHPPV PGGPEEKRLG DRFDYAHEDD LMERWAAVRR DWSDDADPFR PTHAYRHPRR
PDETWYPGGT ATP