Gene Franean1_1546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1546 
Symbol 
ID5669949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1846979 
End bp1848034 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content71% 
IMG OID641240465 
Productputative esterase 
Protein accessionYP_001505891 
Protein GI158313383 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2382] Enterochelin esterase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.828639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.764024 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTTCGG CACTGGGTTC GCTGGTCATC GTCGCGACGT GGTTCCTGTT GGTCCTCGGG 
GCGCTGTCGG CGATCTGCTG GGCGGGCTGC GTGTGGATGA CCCGCCGACG GCGGGCCATG
GCCGTCGGGC TGGGCTTCCT GGCCGCGCTG CTCACGCTGG CGACGGCGGC GGACACCGCC
AACGCCCACT ACGGCTATCT GCCACGGGCC GCTGACGTCC TCGGGCTGAC CTCCTGGCCG
ACCGCGTCGG TGCGCGAGGT CGTCGGCCCG GCGCCGCGGC CGCATCCCGA CGGCGCGGTC
GTCCACCTGC CGATCGCCGG CGTTCACAGC GGGTTCGGTA CCCACAGCGC ACTGGTGTAC
GTCCCGCCGC AGTATTTCAC CGACCCGGGC GCCCGATTCC CGGTCGTCTA TCTTTTCCAC
GGCTCCCCGG GAATTCCGCT CGACTGGTAC CGGGCGGGGC AGGCGGCGAA GACCGGCGCG
GCCCTGGCAC GCGCGGGCCG GCCCGCGATC CTCGTCGCCC CGCCGCTGGG TCATGGCTGG
CTCGATGACA GTGAGTGCGT CGACCGTCCC GGGGAACGGA TCGAGACCTA CCTCGTCGAC
GATGTTCTCC CGACCGTCGA CAATCTCCTG CGCGCCATTC CCGACCGGGC GGACCGCGTC
TTCGCCGGGA TATCCGCGGG CGGTTTCTGC GCGCTGAACC TCGGGCTGCG CCACCGCGAT
CTCGTCGGGA CGATCGTGGA CATCTCCGGG TTGGCGAGGC CGACCCATTC CGGCGGAATG
ACCGGCCTTT TCGGGAATCG TCCGGACCTC GCCGCCGTCA CCGCGGCCAA CACCCCGGAA
AGCTATTCCG CGACGCTGCC GCCGAATCCA CCGACCCGGG TCTGGCTGAG CTGTGGACTC
ATGGACTTCG GGCCGCTCGG CGACATCAGG AAAATGGCGC TGGCCCTGTA CGGACGGCCC
GGATTCACCA CCGTGCTGCG CCCGCGGCCC GGCGGCCACG ACTTCGGCGT CTGGCGGCCC
GCACTGCGCG ACGGCCTGCG CTGGGCGTTC CCCTGA
 
Protein sequence
MGSALGSLVI VATWFLLVLG ALSAICWAGC VWMTRRRRAM AVGLGFLAAL LTLATAADTA 
NAHYGYLPRA ADVLGLTSWP TASVREVVGP APRPHPDGAV VHLPIAGVHS GFGTHSALVY
VPPQYFTDPG ARFPVVYLFH GSPGIPLDWY RAGQAAKTGA ALARAGRPAI LVAPPLGHGW
LDDSECVDRP GERIETYLVD DVLPTVDNLL RAIPDRADRV FAGISAGGFC ALNLGLRHRD
LVGTIVDISG LARPTHSGGM TGLFGNRPDL AAVTAANTPE SYSATLPPNP PTRVWLSCGL
MDFGPLGDIR KMALALYGRP GFTTVLRPRP GGHDFGVWRP ALRDGLRWAF P