Gene Franean1_6561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6561 
Symbol 
ID5674876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7980567 
End bp7981616 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content66% 
IMG OID641245410 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001510804 
Protein GI158318296 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATTC TTGTGACCGG ACACGACGGA TATATAGGAA CCCGCCTCGT CCCGTTCCTC 
CGGCAAGCCG GCCATGACGT CGCCGGCCTG GACAGCATGC TGTTCTCCGA CTGCACGCTC
GGGACCGAAC CGGACTCCGT GCCGGCGCTC GCCCTCGACA TCCGCGACGT CCGCCCCTCC
CATCTGGAGG GGTTCGACGC CGTGATTCAT CTGGCCGGAA TCTCCAACGA CCCACTGGGA
GATCTCAATC CCCGCACCAC CTACGACATC AATGCACGCG GGACGTTGAT GATCGGCAGC
GCGGCCAGGC AGGCCGGCGT GCCGCGATTC GTCTTCTCGT CCTCCTGTAG CCTTTATGGC
GCCCATGGGG ACGCCCCCAT CGACGAATCC GCCGAGTTCC ATCCGGTGAC GCCGTACGGG
GAGTCGAAGG TGATCGCCGA ACGCGAGCTC ACCGCGCTCG CCGACGATGG TTTCAGTCCG
GTCTTCCTCC GCAACGCGAC CGCCTACGGG GTGTCACCCA GACTGCGCGG CGACCTGGTG
GTCAACAACC TGACGGGATA TGCGGTCACG ACCGGCAAGG TGTACCTCAA GAGCGACGGG
ACGCCATGGC GTCCGCTGGT CCACATCGAG GACATCGCCC GGGCGATGCT CGCGGTCTGC
GAGGCACCGC GGGAGGCGAT CCATTGCAAG GCGTTCAACG TCGGCCGGTC GGGCGAGAAC
TACCGGATAC GTGAGGTCGC CGAGATCGTC GAGGATGTCG TACCTGGCAG CCGGGTTGTC
TTCGCCGACG AGGCCGGACC GGACAAGCGG AACTATCGGG TCGATTGCGA CCGCATCGCA
CGGGAGATAC CCGGATTCCA GCCGGTGTGG ACGGTGCGCA AGGGCGTGGA GGAGCTGCAC
GCCGCCTACC TGGCAGCCGA GCTGGCCAAG GAGGACCTGA TCGGGGCGCG CTTCCAGCGG
ATCCGGCGCA TCCAGGAGCT CATGGCGGAA GGTCTACTCG ACAACTCCCT GCGGCCGATC
AGAAGGGAGC GGGTGCCATG CGCGACCTGA
 
Protein sequence
MRILVTGHDG YIGTRLVPFL RQAGHDVAGL DSMLFSDCTL GTEPDSVPAL ALDIRDVRPS 
HLEGFDAVIH LAGISNDPLG DLNPRTTYDI NARGTLMIGS AARQAGVPRF VFSSSCSLYG
AHGDAPIDES AEFHPVTPYG ESKVIAEREL TALADDGFSP VFLRNATAYG VSPRLRGDLV
VNNLTGYAVT TGKVYLKSDG TPWRPLVHIE DIARAMLAVC EAPREAIHCK AFNVGRSGEN
YRIREVAEIV EDVVPGSRVV FADEAGPDKR NYRVDCDRIA REIPGFQPVW TVRKGVEELH
AAYLAAELAK EDLIGARFQR IRRIQELMAE GLLDNSLRPI RRERVPCAT