Gene Franean1_3187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3187 
Symbol 
ID5671563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3755431 
End bp3758499 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content71% 
IMG OID641242081 
Productglycoside hydrolase family protein 
Protein accessionYP_001507501 
Protein GI158314993 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCA ACGGAACCGG GACGTCCGCC ATCCAGCCGT GGGACGCGCC TGAGCTCACT 
TCCGCGAACC GCGTCCCGAT GCACGCCGTT CCGCACGCTG ATCGTCTCGT CCTGGACGGT
GTCTGGGATT TCCAGCTCCT TCCCGGCCCG CTCGCGGAGC GCGGCGAGCA GTGGCGCACC
GTCGAGGTGC CGGGTGTGTG GACGATGCAG GACAGCGGGG ACCTTCCGCA GTACACGAAC
GTCGTCATGC CGTTCGACTC CCCGTTCCCG CACCCGCCCG AGGCGAACCC GACCGGGGTC
TACCGGCGCG GCTTCACCGC CGCGGCGGAC TGGACGGGGC GACGGGTCGT CCTGCACGTC
GGCGCCGCGG AAAGCGTCCT GCTGGTGCGG GTGAACGGAC GCGACGTCGG CTTCAGCAAG
GACTCCCACC TCGCGGCCGA GTTCGACGTG ACCAGGTTCG TCCGGCCCGG GACGAACGAA
CTCGAGCTGA CGGTCGTGAA GTGGTCGGAC GCCTCGTTCG TCGAGGATCA GGACCACTGG
TGGCACGGCG GCATCACCCG CTCGGTCTAC CTCTACACCA CCGCTCCCGT TTACCTGGCC
GACATCCAGG CGATGGCCGA CCTGCATGTG ACGGCAGCAG CAACCGGCAG CCTGCGGCTG
GACGTGAAGA TCGGCGGCGC CGGGACGCGC ACCGCCGGCT GGACGGTGCG CTCCCGGATC
GCGGGACTGT ACGAACCGGA CCCGCAGCCC GTCCTGGAAG CGGGTGCCGC GACGGGTATG
CCCCAGGCCG GGGCGGGTTC GGGCTACCCG GATGAGCCGC CGCCCTCGCT GATTCCCGAC
GGTCTGCTGG ACCTGCTGTC CCTCGGTGCT TCGGGGGCGC CGCTCCCGCC CGAGCTGAAG
GCCCGGGCCC AGGCGATGCA GGAACGGGCG ATGCCGACCC GTGTCGGCCG TACGCGGTTC
GAGGGCGAGG GCCTCGCCGT CGCGCCGTGG TCGGCGGAGA ACCCGCGTCT GTACCCGTTG
GAGGTCGAAC TCGTCGCGCC GGACGGTGCG GTCGTCGAAC ACGCCACGAT CCGGGTCGGT
TTCCGCCGGG TCGAGATCCG CGGTCGGAAC CTGCTGGTCA ACGGCGGCCG GGTGTGGATC
CAGGGCGTGA ACCGGCACGA CTTCAACGCG CGGACCGGCC GGGTCATCAC CGCCGGGCAG
CTGCGCGCCG AACTCGCGCT GCTCAAGCGG TTCAACGTCA ACGCGGTTCG CACTTCCCAC
TACCCCAACG ACCCGCTGTT CCTCGATCTG TGCGACGAGT ACGGCCTGTA TGTCGTGGAC
GAGGCGAACA TCGAGGCGCA CGCCCATGCC GGAACGGTCT GCGGGGACCC GCGCTACCTC
GGCGCGTTCG TGGACCGGGT GTCGCGCATG GTGCTGCGCG ACAAGAACCA TCCCTGCGTG
ATCTTCTGGT CGCTCGGCAA CGAGAGCGGT TACGGCCCCA ACCACGACGC CGCGGCTGGG
TGGGCCCGCG CCTACGACCC CAGCCGGCCC CTGCACTACG AAGGGGCGAT CAGCGCCGAC
TGGCACGGCG GCCACCGGGC CACCGACGTC GTCTGCCCCA TGTACCCCGC CTTCGACGCG
CTGCGCGCCT ACGCGGCGCA CCCCGACGCC GACCGCCCCG TGATCCTCTG CGAGTACGCC
TACTCCCAGG GCAACTCGAC CGGCGGGCTC GGCACCTACT GGGACCTGTT CGAGTCCACT
CCCGGCCTGC AGGGCGGGTT CATCTGGGAG CTCTACGACC ACGGCCTCGA CCCCGACGGC
GACGGCCGGT TCCGCTATGG CGGGGACTTC GGCGACCAGC CCAACGACGG CGTCGTCTGC
ATCAACGGCA TCCTGTTCTC CGACGGCGCC CCCAAGCCCG CCTTCCACGA GGCGCGCCAC
CTCTTCGCGC CCGTCCGGGT CCTGTCCGGT GCCACCGAGG CGCGTCTCGG CCGGGTGCGG
CTGAGGAACC GGCAGACCTT CGAGGACCTC TCCGGTCTCC GCCTCGCGCT CCACGTCGAA
CGGACCGACG GGCCGGGCGA TCCGACGCTG GTCGCGGCGC CGGCCATCCC CGCCGGCGGC
GAAGGCGTCC TCGACCTCCC CGAGACAGTC ACCGCGCAGC TCACCGGGCC GGATGCCGTC
GCCCTGGCCC TCGTCGTCCA GCTCGCCGAC GCGACGCCGT GGGCCGAGCA GGGCACCGAA
CTCGCCCGGC TCCAGGTCCT GCTCGACGTC GATGTGCCTG ACCTGATCGA AACCCGTCCC
GCCACCGGCG TGCTGCGCCT GGACGGCGAC GGGCTGCTCC AGCACCCCGT CCTGAGCGCG
GCACCGGTGT TGTCCTTCTG GCGCGCGCCG ACCGACAACG ACACCTCCAT CGGCCTCGAC
TCCCGGTTCG TGCGCACCGG TCTGTTCCGC GTCACCCGCA CGCTGGTCGA CCAGAAGATC
ACCGGCTCCA CGGCGACGAT CGTCAGCCGG TACACCGCCG CCTACGGCGC GGAGATCGAA
CACCGGCAGC GAATCACCGC CCTGTCCGAC ACCAGCTTCC GTTTCGACGA ACACGTCACC
CTGCCGGAGG AGCTCGACGA CATCCCCCGC CTGGGCGTCA CCTTCGCCAC CAACCCCGGC
TTCGAACACC TCACGTGGTT CGGCCTCGGG CCCCACGAGA CCTACCCCGA CCGGAAGAAG
TCAGGACTTC TCGGCCGCTG GACCTCCCAG GTCGACGACC TGTTCGTCCC CTACCTGCTG
CCGCAGGAGA ACGGCGGCCG CGCCGACGTC CAAGAACTCA CCCTCACCGG CCCCGACGGG
TACGCAATCA CCATCAGCAC CGACCGACCC GTGCAGATGA ACGTGTCGCA CTACCAGGTA
GCCGACCTGG AACCCGCCCG GCACACCTGG GAGCTCAGGC CCCGGGCGGA GACGTACGTT
CACCTCGACC TCGCGCACCG GGGACTCGGC ACCGGAGCCC TCGGCCCGGA CACCCTGGCG
TGGTACCGGG TACGCGGCGG CAGGTACGAG TGGTCCTGGC AGCTTGATCT CACCGCCCCG
CAGCGCTGA
 
Protein sequence
MTVNGTGTSA IQPWDAPELT SANRVPMHAV PHADRLVLDG VWDFQLLPGP LAERGEQWRT 
VEVPGVWTMQ DSGDLPQYTN VVMPFDSPFP HPPEANPTGV YRRGFTAAAD WTGRRVVLHV
GAAESVLLVR VNGRDVGFSK DSHLAAEFDV TRFVRPGTNE LELTVVKWSD ASFVEDQDHW
WHGGITRSVY LYTTAPVYLA DIQAMADLHV TAAATGSLRL DVKIGGAGTR TAGWTVRSRI
AGLYEPDPQP VLEAGAATGM PQAGAGSGYP DEPPPSLIPD GLLDLLSLGA SGAPLPPELK
ARAQAMQERA MPTRVGRTRF EGEGLAVAPW SAENPRLYPL EVELVAPDGA VVEHATIRVG
FRRVEIRGRN LLVNGGRVWI QGVNRHDFNA RTGRVITAGQ LRAELALLKR FNVNAVRTSH
YPNDPLFLDL CDEYGLYVVD EANIEAHAHA GTVCGDPRYL GAFVDRVSRM VLRDKNHPCV
IFWSLGNESG YGPNHDAAAG WARAYDPSRP LHYEGAISAD WHGGHRATDV VCPMYPAFDA
LRAYAAHPDA DRPVILCEYA YSQGNSTGGL GTYWDLFEST PGLQGGFIWE LYDHGLDPDG
DGRFRYGGDF GDQPNDGVVC INGILFSDGA PKPAFHEARH LFAPVRVLSG ATEARLGRVR
LRNRQTFEDL SGLRLALHVE RTDGPGDPTL VAAPAIPAGG EGVLDLPETV TAQLTGPDAV
ALALVVQLAD ATPWAEQGTE LARLQVLLDV DVPDLIETRP ATGVLRLDGD GLLQHPVLSA
APVLSFWRAP TDNDTSIGLD SRFVRTGLFR VTRTLVDQKI TGSTATIVSR YTAAYGAEIE
HRQRITALSD TSFRFDEHVT LPEELDDIPR LGVTFATNPG FEHLTWFGLG PHETYPDRKK
SGLLGRWTSQ VDDLFVPYLL PQENGGRADV QELTLTGPDG YAITISTDRP VQMNVSHYQV
ADLEPARHTW ELRPRAETYV HLDLAHRGLG TGALGPDTLA WYRVRGGRYE WSWQLDLTAP
QR