Gene Franean1_0430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0430 
Symbol 
ID5668853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp506431 
End bp508965 
Gene Length2535 bp 
Protein Length844 aa 
Translation table11 
GC content73% 
IMG OID641239362 
Productalpha-L-rhamnosidase 
Protein accessionYP_001504801 
Protein GI158312293 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCAGC CCTGCGCACC TCGTTTCGAA CACCGCACCG AGCCCGGCCC GGTGCTCGGC 
CTCGGCGCAC CAGCCCCCCG GCTCTCCTGG ACCGTCACGC GCGCGGAGGA GGGCTGGCGC
CAGACGGCCT ACGAGGTCCA GGTCGCCGGC AAGGTCTTCA CCGTGCGCAG CGCCGAGCAG
GTGCTGGTCC CATGGCCGGC CCGGCCGTTG CGCTCACGCG AGCGGGCCGA GGTCAAGGTC
CGGGTGGCGT ACGGCGATCA CTGGAGCCCG TGGAGCGAAG CGGCCACCGT CGAGGCGGGC
CTGCTGGACG CGGCCGACTG GACAGCGCGC TTCATCAGCC CGGTCGACCC GGGCGAACGG
GCGCCCGTCG TCGTCGGCCA CGTCGAGGTG CCCGGGCCGG TCCGCAGCGC TCGGCTGTAC
GCCACCGCCC ACGGCATCTA CGTACCGTCG ATCAACGGCC GCCGCGTCGA CGACACCGTC
CTCGCGCCGG GTTGGACATC GTACGAGCAC CGGCTGCGCT ACCACGTCTA CGACGTCACG
ACCCTGATCC GGCCGGGCGA GAACGTCCTG GAGTTCGTGC TCGGCAACGG CTGGTACCGC
GGACGGCTCG GCTGGACCAA CCAGCGCGCG ATCTACGGCG ACCGCCTCGC CCTGCTCGCC
CAGCTGGAGA TCACCACCTT CGACGGCGCC GTCCGCGTAC TGGCGACCGA CGGCACCTGG
CATGCCCGGC CGAGCGAGGT GCTGGCCGAC GACCTGTACG ACGGCCAGAC GACCGACCTG
CGCCCGCACG ACGCACCGGC GACCGGCGTC GACATCGTCG AGGGCGATCT GACGCGGCTC
GTCGCCCCGG ACGGCCCTCC GATCCGGCCG ACCAGCGTGC TGCCCGCCCG CAAGGTGTGG
ACCTCCCCGG CCGGCAAGAC GCTCGTCGAC TTCGGCCAGA ACGCCGTCGG ATGGATCCGC
CTGCGCGTCC GGGACCTGCC CGCGGGAACG ACGGTCGTGG TCCGCCACGC CGAGGTGCTG
GAAGACAAGG AACTGGGCAC CCGGCCGCTG CGATCGGCCC GCGCCACCGA CACCTGGGTG
CTGTCGGGCC ACGACGAGGT GCTCGAGCCC TCGCTGACGT TGCACGGCTT CCGCTACGCC
GAGGTCACCG GAGTGCCGGA CCTGCGCGAG CAGGACATCG CGCTGGTCGT CGTCGGCTCG
GACCTGCGCC GCACCGGCTG GTTCTCCTCC TCGCACGAAC AACTGAACCA GCTGCACGAG
AACGTCGTCT GGAGCACCCG CGGCAACTTC GTCGACCTGC CCACCGACTG CCCGCAGCGC
GACGAGCGCC TCGGCTGGAC CGGCGACATC CAGATCTTCG GGCCGACCGC GACCTTCCTG
TACGACACCG CCGGCCTGCT GCGCTCCTGG CTGGCCGACC TGGCCGCCGA GCAGCGCCCC
GACGGCTCGG TGCCGCACGT CGTCCCCGAA ATCGATCGCG CCCCGGAGTT CCGCACGCCC
GCGGCGGCCT GGGGCGACGC CGCCACCGTC GTGCCCTGGA CGCTCTACCA GCGCACCGGC
GACCTGCAGA TGCTCGAACG GCAACTGCCG AGCATGAGCG CCTGGGTCGA CAAGGTCGCC
GCCCTGGCCG GCCCGGACCG GCTCTGGCGC GGCGGCTTCC AGTTCGGTGA CTGGCTCGAC
CCGACAGCTC CGCCGGAAAA CCCCGGCCGG GCCAAGGCCG ACCCCGACGT GGTGGCCACC
GCCCACTTCG CCCGCTCGTC GTGGATCGTG GCCGAATCGG CCCGCCTGCT CGGCCGGGCC
GCCGACGCGG AGAAATACCG CACGCTGACC GACGAGGTAC GGGCCGCCTT CGTCCGGGCC
TTCGTGACGC CGCAGGGCCG GATCCACTCC GACGCGCAGA CCCTGTACGC GCTGGCCATC
GAGTGGGATC TGCTGCCGGA GGCGGAGCAG CGCGTGGCGG CCGGGCACCG GCTGGCCGAG
CTCGTGCGCG ACGGCGGGTT CCACATCGCC ACCGGCTTCG TCGGCACGCC ACTGGTCTGC
GACGCCCTCA CCAGCACGGG CCACCTCGAC GTCGCGTACC GCCTGTTGTT GCAAACGCAG
TGCCCGTCCT GGCTCTACCC GGTCACCATG GGCGCCACCA CGGTGTGGGA ACGCTGGGAC
AGCATGCGCC CGGACGGCAC CATCAACCCG GGGGAGATGA CCTCCTTCAA CCACTACGCC
CTGGGCGCCG TGGCCGACTG GCTGCACCGC ACGGTGGCCG GGCTCGCACC CGCCGCTCCC
GGATATCGGC GCCTGCTCGT GCACCCCCGG CTGACGCCGG AGCTCACCGG CGCTGCGGCC
ACCCATCTGA CCCCGTACGG GAAAGCCTCG GTGTCCTGGC TTCGCTCGGC CGGCCACCTG
CGGCTCGACG TGCACGTGCC GGTCGGCGCG TCGGCCGAGG TGCACGTGCC GGGCGCGGAG
CAGCCCGTGA CGGTCGGCCA CGGCGACCAC CGCTGGCTCG TGGCTGACCC GGTGTCGCGC
TCCGAGGGGC GGTGA
 
Protein sequence
MVQPCAPRFE HRTEPGPVLG LGAPAPRLSW TVTRAEEGWR QTAYEVQVAG KVFTVRSAEQ 
VLVPWPARPL RSRERAEVKV RVAYGDHWSP WSEAATVEAG LLDAADWTAR FISPVDPGER
APVVVGHVEV PGPVRSARLY ATAHGIYVPS INGRRVDDTV LAPGWTSYEH RLRYHVYDVT
TLIRPGENVL EFVLGNGWYR GRLGWTNQRA IYGDRLALLA QLEITTFDGA VRVLATDGTW
HARPSEVLAD DLYDGQTTDL RPHDAPATGV DIVEGDLTRL VAPDGPPIRP TSVLPARKVW
TSPAGKTLVD FGQNAVGWIR LRVRDLPAGT TVVVRHAEVL EDKELGTRPL RSARATDTWV
LSGHDEVLEP SLTLHGFRYA EVTGVPDLRE QDIALVVVGS DLRRTGWFSS SHEQLNQLHE
NVVWSTRGNF VDLPTDCPQR DERLGWTGDI QIFGPTATFL YDTAGLLRSW LADLAAEQRP
DGSVPHVVPE IDRAPEFRTP AAAWGDAATV VPWTLYQRTG DLQMLERQLP SMSAWVDKVA
ALAGPDRLWR GGFQFGDWLD PTAPPENPGR AKADPDVVAT AHFARSSWIV AESARLLGRA
ADAEKYRTLT DEVRAAFVRA FVTPQGRIHS DAQTLYALAI EWDLLPEAEQ RVAAGHRLAE
LVRDGGFHIA TGFVGTPLVC DALTSTGHLD VAYRLLLQTQ CPSWLYPVTM GATTVWERWD
SMRPDGTINP GEMTSFNHYA LGAVADWLHR TVAGLAPAAP GYRRLLVHPR LTPELTGAAA
THLTPYGKAS VSWLRSAGHL RLDVHVPVGA SAEVHVPGAE QPVTVGHGDH RWLVADPVSR
SEGR