Gene Franean1_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0544 
Symbol 
ID5668961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp630384 
End bp631904 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content73% 
IMG OID641239471 
Productglycosidase PH1107-related 
Protein accessionYP_001504909 
Protein GI158312401 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2152] Predicted glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.134151 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCGGCA GCACCCGGAC CACAGCTGCC GGGTCGGCCC AGAACGACCT GGTCACGCGG 
CATCCGCTGG GGCTCTCCCC GAACAGCGAC CGGGTCATCG CGAAGCTCTT CCTGCCCGGT
GAGGAGGGCA GCCAGCCGCA CTCCCGGGCC GCCGGCATCG TGGCCCGGGC TCTCGCCCTG
CCCGAGCGGG AGGTCGACAG CCTGGTCGCC GACCTCCTCG AACGGTTCGA GCCGCGCCAC
CGGGACTACC GGGGCATCCT CGCGCGGCAT GCCGCGGTGG TGACTCCGCG GGTGCCGGTG
CCCGCGGAGC TCTCCCCAGC GCGCTGTCTG CTGCTCGGCG CGTGTTTCAC CGCCGAGTAC
GCGGTGGAGG GCGCGGCCGT CACCAACCCC TCGGCGGTGC CGCACCCCGA CCAGACCGGG
CTGCCGTCCG GCGCGCTGCG ACTGGCGTTG AGCCTGCGCG CTGTCGGGGA GGGCCATCTG
TCCTCGATCG GCTTCGCCGT CGCGGTGATC GGTCCGGGCC CGGCGGTGCG CCTGGAGTCG
CGGACCGGCC CACTCACCAC CGGCGTCGCC GTACCCGTCG AGTGGGAGGC CGCCCGGCTG
CGCGCCGTGC TCACCGAACA CGGGCTCGAC GACGAGGTCA CCCGTGCCGT CCTCCAGAGC
CTGGACCCAC GGCGGCCCGG CAGTGAACCC GACCCCGATC TCTCCCGTGC CTTCGCCGCG
GTTCCCCCCG ACCTGCTCCG TCGCCCCCAG GCACCCGGGA TCCTCGCCGG GATCCGCTCG
ATCGCCGGAT CATTGCGGCG GGTCGAGTTC CCACCCGACA GCGCCCTGCC GCAACGGGTG
CTGTGGCCCA CCGTCACCAG CGAGAGCAAC GGCATGGAGG ACGCGCGGTT CGTCCGGTTC
ACCGCGCCGG ACGGGACCGC GGACTACCGG GCCACCTACA CCGCCTTCGA CGGCACGGAC
ATCTCCCCGC GCCTGCTCAC CAGCCCCGAC CTGCGGGTCT TCACCACCGC ACCGCTCACC
GGCCCGGCCG CCCGTAACAA GGGCATGGCG CTGTTCCCCC GGCTGGTCGA CGGTCGCCAT
CTCGCGCTCT GTCGCTCCGA CGGCGAGTCC ACCGGTCTGA CCGCATCGGA TGACGGCCAG
GTATGGGGGC CGGCGCGTCC GCTCCACGGG CCGCGGGTCG CCTTCGAACT GCTCCAGGTG
GGCAACTGCG GGTCCCCGGT CGAGACGTCC GCGGGCTGGC TCACGCTCAC CCACGGCGTC
GGGCCGATGC GGACGTACAC CATCGGCGCG ATCCTGCTTG ATCTCGACGA TCCGGGAAGG
GTGGTCGCGG CGCTGCCCGA ACCACTGCTC GCTCCGACCG GGCAGGAGAG CACGGGCTAT
GTCCCCAACG TCGTCTACTC CTGCGGCAGC CTGATCCACC ACGGTCTGCT GTGGCTGCCG
TACGGGATCG GCGACACCCG GATCGGGATG GCCAGCGTGC CCGTCGACCG GCTCCTCGCA
CGAATGGTTC CCGTCGGGTG A
 
Protein sequence
MIGSTRTTAA GSAQNDLVTR HPLGLSPNSD RVIAKLFLPG EEGSQPHSRA AGIVARALAL 
PEREVDSLVA DLLERFEPRH RDYRGILARH AAVVTPRVPV PAELSPARCL LLGACFTAEY
AVEGAAVTNP SAVPHPDQTG LPSGALRLAL SLRAVGEGHL SSIGFAVAVI GPGPAVRLES
RTGPLTTGVA VPVEWEAARL RAVLTEHGLD DEVTRAVLQS LDPRRPGSEP DPDLSRAFAA
VPPDLLRRPQ APGILAGIRS IAGSLRRVEF PPDSALPQRV LWPTVTSESN GMEDARFVRF
TAPDGTADYR ATYTAFDGTD ISPRLLTSPD LRVFTTAPLT GPAARNKGMA LFPRLVDGRH
LALCRSDGES TGLTASDDGQ VWGPARPLHG PRVAFELLQV GNCGSPVETS AGWLTLTHGV
GPMRTYTIGA ILLDLDDPGR VVAALPEPLL APTGQESTGY VPNVVYSCGS LIHHGLLWLP
YGIGDTRIGM ASVPVDRLLA RMVPVG