Gene Franean1_2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2074 
Symbol 
ID5670475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2498496 
End bp2499605 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content71% 
IMG OID641240996 
Producttransaldolase 
Protein accessionYP_001506417 
Protein GI158313909 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0176] Transaldolase 
TIGRFAM ID[TIGR00876] transaldolase, mycobacterial type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.195671 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0887759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC CCCTGTCCGA CCTGTCAGCC GCCGGCGTGG CGGTGTGGCT GGACGACATC 
AGCCGCGAGC GGATCCGGAC CGGCAACCTC GCCGAGCTCG CCCGCACCCG CAGCGTCGTC
GGCGTCACCA GCAACCCGAC GATCTTCCAG AAGGCCATCG GCGGCGGTGA GACGTACAAC
GAGCAGCTCC GCGACCTGGC CGTTCGCGGG GTCGACGTGG GCGAGGCCGT CCGCGCGATC
ACCGCGGCGG ACATCCGCGA CGCCTGCGAC ATCCTGCGGC CCGCCTACGA CGCCAGCGCC
GGCGTCGACG GCCGGGTCTC CCTCGAGGTC GACCCGCGGC TCGCACACGA GACCGAGCGC
ACGGTCGCCG AGGCCCGTGC CCTGTGGTGG TCGGTCGACC GGCCGAACCT GTTCATCAAG
ATCCCGGCGA CGAAGTCCGG CCTGCCGGCC ATCACCGCGA CGCTGGCGCA GGGCATCAGC
GTGAACGTGA CGCTGATCTT CGCGCTGGAC CGCTACGAGG CCGTCATGGA CGCGTTCATG
ACCGGTCTGG AGCAGGCCCT CGCCGCCGGT CGGGACATCT CCGACGTGGC CTCTGTCGCG
TCCTTCTTTG TCAGCCGCGT CGACAGCGAG GTGGACGGCC GGCTCGCGAA GATCGGCACG
CCGAAGGCGG AGGCCCTGCG CTCGAAGGCC GCGATCGCCA ACGCCCGGCT CGCCTACGAG
CTGTACGAGA AGATCTTCAG CACGCCGCGC TGGGAGCGGC TCGCCGCCGC CGGCGCGAAG
CCCCAGCGCC CGCTGTGGGC CTCAACGTCG ACGAAGGACC CGGGGCTGCC GGACACCCTC
TACGTGACGG AGCTGATCGC ACCGGGCACC GTCAACACGA TGCCGGAGGC GACGCTCGAG
GCGTTCGCCG ACCACGGGGT CGTGCCCGGC GACACCATCA CGCCCAACTA CGAGGACGCC
CGCGCCGTCC TGGCCGAGCT CACCGAGCTC GGAGTGGACA TGGCCGACGT CGTCGAGGTG
CTGGAGGTCG AGGGCGTCCG CAAGTTCGAG GACTCCTGGA ACCAGCTCCT CGACACCATC
CGCGAGCAGC TCGGCTCCGC CGCGTCCTGA
 
Protein sequence
MSKPLSDLSA AGVAVWLDDI SRERIRTGNL AELARTRSVV GVTSNPTIFQ KAIGGGETYN 
EQLRDLAVRG VDVGEAVRAI TAADIRDACD ILRPAYDASA GVDGRVSLEV DPRLAHETER
TVAEARALWW SVDRPNLFIK IPATKSGLPA ITATLAQGIS VNVTLIFALD RYEAVMDAFM
TGLEQALAAG RDISDVASVA SFFVSRVDSE VDGRLAKIGT PKAEALRSKA AIANARLAYE
LYEKIFSTPR WERLAAAGAK PQRPLWASTS TKDPGLPDTL YVTELIAPGT VNTMPEATLE
AFADHGVVPG DTITPNYEDA RAVLAELTEL GVDMADVVEV LEVEGVRKFE DSWNQLLDTI
REQLGSAAS