Gene Franean1_4669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4669 
Symbol 
ID5673011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5574344 
End bp5576833 
Gene Length2490 bp 
Protein Length829 aa 
Translation table11 
GC content73% 
IMG OID641243526 
Productalpha-L-rhamnosidase 
Protein accessionYP_001508942 
Protein GI158316434 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.170324 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCTC GCTGGAACGC CGCGTTCATC GCCGCCCCGT CCGATCCGGC CGGGCGCGGT 
CCCGCCCCGG CGCCGTACCT GCGCCGCGAG TTCACCGTGG GCAATGGCCT GCGCTCCGCC
ACCCTGCACG TCACCGCGGT GGGCCTGATC GAGGCTCACC TGAACGGGGC CCCGGTGGGC
GACGAGGTCC TCGCCCCGGG CTGGACGTCC TACCGGCATC GCCTGGTCGT GAGCAGTCAC
GACGTCACCG GCCTGCTGGT GGAGGGCGCC AACGCCCTCG GCGCCGTCCT CGGTGAGGGC
TGGGCGGTGG GCAGGCTCAC CTGGGAGAAG GACAAGCGGG CCGTCTGGGC GGACCACCCG
GCCGGCTTCC TGCAGCTCGA CCTCGACTAC GGCGACCGGG TGGACGTGAT CAGAAGCGGT
GCCGACTGGC GGGCGGGCAC CGGGGCGACC CTGACGGACA GCATCTACGA CGGCGAGACC
CATGACGCGC GCCTGGAGCC GGCGGGCTGG GCAGAGCCCG GCTTCGACGA CGAGCACTGG
TCGCCAGTGG AGGTCGTCGC ACGCGATCTG ACCACGCTGA TCGCGCCGAG CGCGCCGCCG
ATCCGACGGG TGCAGGAGCT GCCCGCCGTC GACATCCTGA CGACGCCGGC GGGCCGGACG
GTCGTCGACT TCGGGCAGAA CCTGTCGGGC TGGGTCCGCC TGACCGTCCG CGGGGAGGCC
GGCACCACGA TCGTCCTGCG GCACACCGAG ACGTTGATCG ACGGCGAGGC GGACTTCCGG
CCCAACCGGA CGGCGCTGGC GACCGACTGC TACGTCCTGC GGGGCGGTGA TCCGGAGACG
TGGGAGCCTC GGTTCACCTT TCACGGTTTC CGTTACGTCG AGATGGAGGG GTGGCCCGGC
AGCCTCGACG CCGACGCGAT GACCGCGGTG GTGGTGCACA GCGACATGCG CCGGACCGGG
TGGTTCGAGA CGTCCGACGA ACTGCTCAAC CAGCTGCATC GCAACGTCGT CTGGTCGATG
CGCGGCAACT TCGTCGGCGT GCCCACCGAC TGCCCGCAGC GCGACGAGCG GCTCGGCTGG
ACCGGCGACA TCAACGCCTT CGGCCCCACC GCCGCCTTCC TCTACGACGT GCGCGGCGTG
CTGGGCTCGT GGCTCACCGA CCTCGCCGTC GAGCAGCGCG CGCAGGGGCA CGTGCCGCTG
GTCGTCCCGG ACGTGGGGGG CATGCCGATC ACGGCGCCCA CCGCGCTGTG GGGAGACGTC
GCGGTCAGCC TGCCCTGGAC GCTCTACCAG GAGTACGGCG ACCGGGAGCT GCTCGCCGAC
CAGTACGAGT CCATGACGGC CTTCATCGAC AGCGTGGAGG GCCTGCTGGA CGAGCGGGGG
CTGTGGAACT CCGGTTTCCA GTTCGGTGAC TGGCTCGATC CGGACGCGCC GCCGAAGAAC
CCGGCCGGCG GCAGGACGGA CGCCTACCTG GTCGCCAGCG CCTTCTTCTG CCACACGACC
CGCCAGCTGG CGCAGGCCGC CGAGGTCCTG GGCCACACCG GCGACGCCGC CCGGTACACG
GCCCTGCACC AGCGTGTCCG CGCCGCCTTC CGCGACGAGT GGGTCACCCC GTCCGGCCTG
GTCGCGAACG ACACGGCGAC CGCCTACGCC CTGGCCATCT GCTTCGACAT CCTCGACCCG
GCCCAGCAGG CGCGCGCCGG GCGCCGGCTC GCCGACCTGG TCAGCAAGGC CGACCACAGG
ATCAGCACGG GCTTCGCCGG CACGCCGCTG GTCGCACACG CGCTGAGCCG CACCGACCAG
CTCGACACCG CCTACCGGTT GTTGCTGCAG ACCGAGTGCC CGTCGTTCCT CTACCCGGTG
ACGCGGGGCG CGACGACGAT CTGGGAGCGG TGGGACGCGA TCCGGCCCGA CGGATCACTG
CACGACACCG GCATGACCTC GCTCAACCAC TACGCCCTGG GCGCCATCGC GGACTGGCTG
CACCGCGTCG TCGGCGGCCT CGAACCCGTC GAGCCCGGCT ACCGGCGGAT GCGGATCGCG
CCGCGGCCCG GCGGCGGCCT CACCCACGCC ACGCTCACCC ACGACACCCC GCACGGGAGG
GTCCGTGTCG CCTGGCGCCG GCAGCCCGAC AGCCGGATCA CCGTCGAGGT CGACGTCCCG
CCGGGCACGG CCGCCGACGT CGTCCTGCCG GGCCACCCCG ACAGGCTGAG CGTCCCCGTC
GGGCCGGGCA GCCACCGCTG GGAGTACGAC GTCCCGACGC CGGACCGGCC CGATTACAGC
CTCGACACCC CACTCAGGCA GGTCTTTCGA GATTCGGCGC TGTGGGCCGA GCTCCAGTCC
GTCCTTCGCC GGCACCTGCC CCAGTTCGCG GACGCCGACA GCGGGACGGA ACCATCCCTG
CCGAGCCTGC GCGCCCTGCT GGGGTACTTC CCGGCGCAGG CCCCGGCTCT GGAGGCCGAC
CTGGTCGCCG TGCTTGGGAC GCGAACCTAG
 
Protein sequence
MPPRWNAAFI AAPSDPAGRG PAPAPYLRRE FTVGNGLRSA TLHVTAVGLI EAHLNGAPVG 
DEVLAPGWTS YRHRLVVSSH DVTGLLVEGA NALGAVLGEG WAVGRLTWEK DKRAVWADHP
AGFLQLDLDY GDRVDVIRSG ADWRAGTGAT LTDSIYDGET HDARLEPAGW AEPGFDDEHW
SPVEVVARDL TTLIAPSAPP IRRVQELPAV DILTTPAGRT VVDFGQNLSG WVRLTVRGEA
GTTIVLRHTE TLIDGEADFR PNRTALATDC YVLRGGDPET WEPRFTFHGF RYVEMEGWPG
SLDADAMTAV VVHSDMRRTG WFETSDELLN QLHRNVVWSM RGNFVGVPTD CPQRDERLGW
TGDINAFGPT AAFLYDVRGV LGSWLTDLAV EQRAQGHVPL VVPDVGGMPI TAPTALWGDV
AVSLPWTLYQ EYGDRELLAD QYESMTAFID SVEGLLDERG LWNSGFQFGD WLDPDAPPKN
PAGGRTDAYL VASAFFCHTT RQLAQAAEVL GHTGDAARYT ALHQRVRAAF RDEWVTPSGL
VANDTATAYA LAICFDILDP AQQARAGRRL ADLVSKADHR ISTGFAGTPL VAHALSRTDQ
LDTAYRLLLQ TECPSFLYPV TRGATTIWER WDAIRPDGSL HDTGMTSLNH YALGAIADWL
HRVVGGLEPV EPGYRRMRIA PRPGGGLTHA TLTHDTPHGR VRVAWRRQPD SRITVEVDVP
PGTAADVVLP GHPDRLSVPV GPGSHRWEYD VPTPDRPDYS LDTPLRQVFR DSALWAELQS
VLRRHLPQFA DADSGTEPSL PSLRALLGYF PAQAPALEAD LVAVLGTRT