Gene Franean1_7054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7054 
Symbol 
ID5675365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8608574 
End bp8609617 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content69% 
IMG OID641245900 
Productglycosyl hydrolase family 32 protein 
Protein accessionYP_001511291 
Protein GI158318783 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGAC TCCCGGACGT CTGGACATGG GACTTCTGGT TCGCCGACGA CGGCGTGCAC 
TACCATATGT TCTTCCTCAA GGCATCCCGC GCTCTCGGCG ATCCCGACCA GCGCCACTGG
AAGGCCGTCG TCGGGCATGC CGTCTCGTCC GACCTCACCA GGTGGGACGA GGTCGCGGAC
GCGGTCGCGC CCAGCGACCC GGTGGCCTTC GACGACATCG CCACCTGGAC CGGATCGGTC
GTCCGGGACG ACGACGGAAC CTGGATGATG TTCTACACCG GTGCCAGTTC CGGCGAGCGG
GGGCTCAAAC AGCGGATCGG CCTCGCCACC TCCACAGACC TCCACACCTG GGAGAAACAT
CCGGCAGCGC CGGTGCTGGA GAGTGATCCG CACTGGTACG AGCAGCTGGT CGACGCACAG
TGGCCCGACG AGGCATGGCG CGACCCGTGG GTGTTGCGTG ACCCGGCCGG CGATGGCTGG
CACATGCTGG TCACCGCGCG CGCCGCGACC GGGCCCGGTG ACCAACGGGG CGTGATCGGA
CACGCCCGTT CCCACGACCT GGTGCACTGG TCGGCCCAGC CACCCCTGAG CCTGCCGCAG
ACGGGTTTCG GACATCTCGA GGTCCCCCAG GTCGAGGTCG TCGACGGTCG TCCCGTGTTG
GTGTTCTCGT GCCTGCGGGG TGAGCTCTCC CGGGAACGCC GTGAGCGCGG CGTACACGGC
GGAACGTGGT GCGTTCCCGT CGAGACGCTT CTGGGTCCCT ACGATGTCAC GCGCGCGGTG
CAGGTGACCG ACGAGTCCCT CTACAGCGGA CGCCTGGTCC GGGACCGGTC CGGGCGCTGG
GTGATGCTCG CCTTCCACAA CGTGGTGGAC GGCGGTCGGT TCGTCGGGGA GATCAGTGAT
CCGATGTACG TCTCGTGGGC GCAGGACGGC ACCAGGCTGG TGCTCAGCGG GACGGCGCGA
GACGACCCGG GGCGCGCCTC GGACGACGCC ACCGCCAGCC ACGACGACCC GGGGCTCAAC
CCGGTCTCTC GGCCGGGGGT GTGA
 
Protein sequence
MLRLPDVWTW DFWFADDGVH YHMFFLKASR ALGDPDQRHW KAVVGHAVSS DLTRWDEVAD 
AVAPSDPVAF DDIATWTGSV VRDDDGTWMM FYTGASSGER GLKQRIGLAT STDLHTWEKH
PAAPVLESDP HWYEQLVDAQ WPDEAWRDPW VLRDPAGDGW HMLVTARAAT GPGDQRGVIG
HARSHDLVHW SAQPPLSLPQ TGFGHLEVPQ VEVVDGRPVL VFSCLRGELS RERRERGVHG
GTWCVPVETL LGPYDVTRAV QVTDESLYSG RLVRDRSGRW VMLAFHNVVD GGRFVGEISD
PMYVSWAQDG TRLVLSGTAR DDPGRASDDA TASHDDPGLN PVSRPGV