Gene Franean1_3255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3255 
Symbol 
ID5671629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3851938 
End bp3853734 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content69% 
IMG OID641242147 
Productbeta-D-glucuronidase 
Protein accessionYP_001507567 
Protein GI158315059 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGTC CCCGGGACAC TGCCACGCGT GAGCGCAGGA ACCTGAGCGG GTTGTGGGCG 
TTCCGCCTCG ACCCGGCCGG GGTCGGGCGC GAGGCGGGCT GGTGGCAGGG CCGCCTGCCC
GAGGCACGCG AGATGCCCGT GCCGGCCAGC TACAACGACG TCCTCGTCGA CGCGGGGGTA
CGGGACCACA TCGGTGACGC CTGGTATCAG ACGACGGTGC GTGTCCCGCG GGGCTGGGCG
GGACAGCGGG TCGTGCTGCG TTTCGACTCG GCGACCCACC GCGCGGTGGT CTGGGTGGAC
GACGTGGAGG TCGCCAGCCA CGAGGGTGGC TACACCCCGT TCGAGGCGGA CATCACCGCG
TACGCGCGGC CGGGCCGGGA GATCCGGGTG ACAGCGGTGG TGAACAACGA GCTGACCTGG
CAGTCGGTAC CGCCGGGAGT GGTCGAGGAC ACCCCCTCCG GCCGCCGGCA GAACTACTTC
CACGACTTCT TCAACTACGC AGGCCTGCAC CGGCCGGTCT GGCTCTACTG CACCTCGCCG
GACCACATTG ACGACATCAC CGTCGTGACC GGACTCGACG GAGCCGTCGG CACTGTCGAC
TACCGCGTCG AGGTGGCTGG TGCCGGCGAG GCCGCCGGCG ACGCGGTCCG GCAGATTCGC
GCGGTGCTCG AGGACGCGGA GGGAGCCGAG GTGGCGCGGT CCACCGGCGC CTCGGGTGTC
CTGACGGTCG AGGGCGTGCG GCCGTGGCAG CCGGGTGCGG GATACCTCTA CACCCTGCGG
GTGGAACTAG CCGACGGCGG CGAGCTGGTC GACGACTACA CCCTGGCGGT CGGCATCCGC
ACGGTGGAGA TCGACGGCAC GAATTTCCTG ATCAACGGCG AGCCGTTCTA TTTCACCGGG
TTCGGCAAGC ACGAGGATCT GCCGATCCGG GGCAAGGGCC ATGACGACGT TTTCCTGGTC
CACGACTTCG CCCTGATGGA GTGGATCGGC GCGAACTCGT TCCGGACGTC GCACTATCCG
TACGCCGAGG AGGTGCTCGA CCACGCCGAT CGCCACGGGA TCGTCGTGAT CGACGAGACG
GCGGCCGTCG GCCAGAACAC CGGGGTCGGC GGCGGCATCG TCGGGTCGCG CCCGTATCCG
ACCTTCTCGC CGGAGACGAT CAACGATGCC AGCCGCGACG TCCATGCCCA GGCCATCCGG
GAACTCGTCG CCCGGGACAA GAACCATCCG TGCGTCGTGC TCTGGAGCGT CGCCAACGAG
CCGGAGTCGA CCACCGAGGC GTCCCGGGAC TACTTCGAGC CGTTGTTCGC GCTCACCCGC
GAACTCGACC CGACCCGGCC GGTGGGCTTC GTGAACATGT TGCTGGCGCG GCCCGGGATG
GACCTGGTCA TGCAGTTCTC CGATGTGATC ATGCTCAACC GGTACTACGG CTGGTACCTG
AACACCGGTG ACCTGGCAGG TGCGGAGACG GCCTGGCAGG CCGAACTCGA GGGCTGGGCG
GCCGAGGGCA AGCCGATCAT CATCACCGAG TACGGGGCCG ACACCATCAC CGGCCTCCAC
CAGGTCACGC CGCAGCCGTG GAGCGAGGAG TACCAGGCCG AGTACCTGGA GATGAACCAC
CGCGTCTTCG ACCGGATCGA CGCCGTGATC GGCGAGCATG TGTGGAACTT CGCCGACTTC
GCCACGAAGT CGGCGATCTT CCGGGTGGAC GGGAACAAGA AGGGAGTCTT CACCCGGGAC
CGCCACCCGA AGTCCGCCGC CCACACACTG CGCCGCCGCT GGCGCGGCAA GAACTGA
 
Protein sequence
MLRPRDTATR ERRNLSGLWA FRLDPAGVGR EAGWWQGRLP EAREMPVPAS YNDVLVDAGV 
RDHIGDAWYQ TTVRVPRGWA GQRVVLRFDS ATHRAVVWVD DVEVASHEGG YTPFEADITA
YARPGREIRV TAVVNNELTW QSVPPGVVED TPSGRRQNYF HDFFNYAGLH RPVWLYCTSP
DHIDDITVVT GLDGAVGTVD YRVEVAGAGE AAGDAVRQIR AVLEDAEGAE VARSTGASGV
LTVEGVRPWQ PGAGYLYTLR VELADGGELV DDYTLAVGIR TVEIDGTNFL INGEPFYFTG
FGKHEDLPIR GKGHDDVFLV HDFALMEWIG ANSFRTSHYP YAEEVLDHAD RHGIVVIDET
AAVGQNTGVG GGIVGSRPYP TFSPETINDA SRDVHAQAIR ELVARDKNHP CVVLWSVANE
PESTTEASRD YFEPLFALTR ELDPTRPVGF VNMLLARPGM DLVMQFSDVI MLNRYYGWYL
NTGDLAGAET AWQAELEGWA AEGKPIIITE YGADTITGLH QVTPQPWSEE YQAEYLEMNH
RVFDRIDAVI GEHVWNFADF ATKSAIFRVD GNKKGVFTRD RHPKSAAHTL RRRWRGKN