Gene Franean1_4229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4229 
Symbol 
ID5672584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5035949 
End bp5037175 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content70% 
IMG OID641243102 
Productbeta-lactamase domain-containing protein 
Protein accessionYP_001508519 
Protein GI158316011 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2015] Alkyl sulfatase and related hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACAGC CCCTGAACGC CCTCATCGTC AAGGAGGGGG AGGGCCAGCA GGACGCCGTC 
CCCGTCAACG ACCACATCTT CACCTCGAAG GGCATCTCCA ACAGCTACCT GGTCACCACC
CCCGACGGCG ACGTGCTGAT CAACACCGGC ATGTACACCG AGGCGGAGCA GATCAAGGCC
CGCTTCGGCC GGGTGAGTTC CGGCCCGCTG CGGGTCATCG TCTTCACCCA GGGCCACCCC
GACCATGTGG GCGGCTGGTC CCAGATCGCC GCGCCGGGCG TCGAGACGAT CGCCCAGGCC
AACCACGCGG ACGTCCGCGA GTACTGGCGA CGGCTGCAGC CGTTCTACTC CAGCCGCAGC
ACCCACCTGT GGAAGCGCGA TGTCACGGGC GTCGACCGCA CCTACCAGCC GCCCGAGGCC
GTGGTCACCA CCACCTTCCT GGACAACCAC GCCTTCACCC TGGGCGGGCG CCGCTTCGAG
CTCTACTCGA CTCCGGGCGG CGAGACGACG GACTCCCTGG TCGTGTGGCT CCCCGACGAG
CGCACCGTGT TCACCGGCAA CCTTACCGGA CCTCTGTTCG GCCACGTCCC CAACCTGTAC
ACGATCCGCG GCGACAAGAT CCGCGGCTCG CTGTCGTACA TCCACTCCGT CGACCGGGTC
ATCGGGCTCG AACCCGAGGT CCTCATCACC GGGCACGGCG AGCCGGTGCG CGGGGCCGAG
GAGATCCGCC GCCGCCTCAC CCAGCTGCGC GACGCCACCG AGTACCTGCG CGACCGCACC
ATCGAGGGCA TGAACGCGGG CGTCGACCTG TGGACCCTGA TGGGCCAGAT CACGCTGCCG
CCCGAGCTGG CCATCCCGCA GGGGCACGGC AAGGTGCCCT GGATCGTCCG GGCGATCTGG
GAGGAGCACA CCGGCTGGTT CCGCTACGAG TCGACCACAG AGCTCTACGA CGTGCCCGCC
TCCGCCGTCT GGGCCGACCT GCTGGACATG GCCGGTGGGA CCGGCCCGCT GGTCGACCGG
GCCCGGGCCC ACCTGGACGC CGGGCGGCCG GTCGAGGCAC TGCACCTGAT CGACATGGTG
CTCTCCCGGG AACCGAAGGA TCCCGAGGCC CTGCGGATCA GGCTCGGGGC CCACGAACTG
CTGCTCGAGC GCAGCGGCCG GGAGAACTTC AGCGAGGTCC GCTGGCTCGA AGCCGAGATC
CGTGACCTCC GGGACATGCT GCCATGA
 
Protein sequence
MKQPLNALIV KEGEGQQDAV PVNDHIFTSK GISNSYLVTT PDGDVLINTG MYTEAEQIKA 
RFGRVSSGPL RVIVFTQGHP DHVGGWSQIA APGVETIAQA NHADVREYWR RLQPFYSSRS
THLWKRDVTG VDRTYQPPEA VVTTTFLDNH AFTLGGRRFE LYSTPGGETT DSLVVWLPDE
RTVFTGNLTG PLFGHVPNLY TIRGDKIRGS LSYIHSVDRV IGLEPEVLIT GHGEPVRGAE
EIRRRLTQLR DATEYLRDRT IEGMNAGVDL WTLMGQITLP PELAIPQGHG KVPWIVRAIW
EEHTGWFRYE STTELYDVPA SAVWADLLDM AGGTGPLVDR ARAHLDAGRP VEALHLIDMV
LSREPKDPEA LRIRLGAHEL LLERSGRENF SEVRWLEAEI RDLRDMLP