Gene Franean1_6487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6487 
Symbol 
ID5674802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7885458 
End bp7887800 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content75% 
IMG OID641245335 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001510730 
Protein GI158318222 
COG category[O] Posttranslational modification, protein turnover, chaperones
[R] General function prediction only 
COG ID[COG0443] Molecular chaperone
[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.177218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGGAGC CGATCCTCGC GATCGACTAC GGCACCTACA GCGCGTCCGC GGCGCTGGTC 
GTCGACGGCA AGGTGGCCAC GGTCCGGGAA CCCGCGAGCG GCGCCGCCGG CTGGCCGGCC
TCCGTGTTCG CCGATGGTGA GGTGCTGGTG GTCGGCACAC TCGCCGAGCG CCGCAAGCGG
ATCTGGCCCG CGGCCTACCG CGGCGGGGTC AAACGTGACC TGACCAGGAA CGCCGCCGTC
ACCCTGGACG GCCGCGCCCA CCAGCCGGCC GCGCTGGTCA CCGCGGTGCT GGCGGCGCTG
CGCGCCGAGG CCGAGGTCCA GCTCGCCGGG GAGGCGCTGC GCCGGGCGGT CGTGACCGTG
CCGGCCAGCT ACGACCCGGC CGGGCCGCTG CGGCGGACGA TGATCCGGGC CGCCGAGGAG
GCCGGCTTCG TCGACGTCGA CCTGCTCGCC GAACCGGTCG CCGCAGCCTG GGCGCCGATG
GGCCCCGCCG GCCCGCAGCC CGGCAACCTC GTCCTCGTCT ACGACCTCGG CGGCGGGACG
TTCGAGGCGG CCCTGGTGGT GGTCGGCGAG GACGGCCGCC ACGAGCTGCT CGGGCACGCC
AGCCTGCACG ACTGCGGCGG GCGCGACTTC GACCAGCTGC TTTTCGACGA GATCCTCGGC
CACTACGGGC CGGGCCTCGA GCCGCTACTC AACCCGACCG GCGTCGGCGA GGTGTTCCAG
ACCGCCGCCA GCCGGACCCG CCACGAGCTG CTCGACTTCG CCCGCGCGGT GAAGCACCAG
CTCACGGACG TCGCCTTCGT CGAGGACGTC TTCAGCCCCG CCGGCCTCCT CGTCAGCCTG
GAGCGGGACC GGTTCGACAA GCTCGCCTCC CCGACCCTGG CCCGGACGAT GGCCTGCTGC
CAGCACCTGC TGGAGAGCAC CGGCATCAGC CCGGAGCAGC TCGACGGCGT GCTGCCGGTC
GGCGGCTCGT CGCGGATGCC GGTGGTCACC GAGATCCTCG CGGGCCGGCT CGGCGTGGGC
ACCGACTGGC CGCACCAGCA CGTGTTCACC GCGCCGGAGC CGGACCAGGC CGTCGTGCTC
GGCGCGGCCT CCTGGGCGGC GACGGTCTCG ACCCGGCGCA CCGTCGCCGC GCGGCCCGAC
CCGGTGGTGC GGCCGGCCCG CTGGGACATT CCCGGCGGAC GGGCCACGAT GGTCCGCTGG
CTGGTCGAAC CCGGGGCCCG CTTCGACGCC GGGGCGGTGC TCGCCCGCGT CCGGCTGACC
GACGGCGCGC TGTTCGATCT GGCCTCCGAG GGTCCGGGCA GCCTCGTGGA GACCCACGCC
GAGCCGGGTT CGGCCGTGCG CGACGTCGAC TGGCTGGCCA CGACGCTGCC GTCCGGCCCC
GCGCCGGCCG GCTCGGCCCG GCCCGCCGAC CTGCGCGGAT TCGAGCGCAG CAACCGACCG
TCCCGCAACG TCGAGGTGAC GACGCGGGTG GCGCTGACCG GCCACGAACG CGACGTCACC
TCCGCCGCGT TCTCCCCCGA CGGCCGGCTG CTCGCCACCA CCAGCAAGGA CGGCACCCGG
CTGTGGGACA CGACCACCGG CCGCACCGTC GGACGGCTGA GCGGGCGCAA GATCTCCGCC
GTGCACGGCT GTGCGTTCTC CCCGGACGGC GACCTGCTCG CGACCACCGG CAGCGACAAG
ACCGCGCGGA TCTGGGAGAT CGCGACCGAG CGGCTGGCCC TCACCCTGGC CGGCCACAAG
GGCCCCGTCT ACGGCTGCGC GTTCTCCCCG GATGGCCGCC TGCTCGCGAC GGTCAGCACG
GACCGCACGG TCAAGCTGTG GGGGGTCTCG ACCGGAACCA ACATCGCCAC GCTGACCGGG
CACCGCGGCT CGGTGTACGG GTGCGCGTTC TCCCCGGACG GCCGGCTCCT CGTCACCGCG
GGCGCCGAGT CGACCCTGCT CTGGGACGTC ACGATCGGCG AGACGATCAC GAGCCTGGCC
GGGCACACCA ACTTCGCGAA CGGCTGCTCG TTCTCCCCCG ACGGCCTGCT GCTGGCCACG
ACCAGCAACG ACGGCACCCG CCTCACCGAC ACCCCGACCG GGACGACCAC ACTCACCCTG
CCCGGCTCGG CGCAGAGCTG CGCGTTCTCC CCGGACGGCG TCCTGCTGGC GACGGCGAGC
ACCGACGACA CCGCCCGGCT GTGGGACGTC GCCACCGGGA CGGCGGTGGC GACGCTGACC
GGTCACAGCA GCACCGTCAT GGCCTGCGCG TTCGCGCCTT ACGGCCTGCT GCTGGCCACC
ACCAGCACGG ACAAGACCGC CCGGCTGTGG GACATCACCT ACACGCCGGA CTCGGCGCGC
TGA
 
Protein sequence
MVEPILAIDY GTYSASAALV VDGKVATVRE PASGAAGWPA SVFADGEVLV VGTLAERRKR 
IWPAAYRGGV KRDLTRNAAV TLDGRAHQPA ALVTAVLAAL RAEAEVQLAG EALRRAVVTV
PASYDPAGPL RRTMIRAAEE AGFVDVDLLA EPVAAAWAPM GPAGPQPGNL VLVYDLGGGT
FEAALVVVGE DGRHELLGHA SLHDCGGRDF DQLLFDEILG HYGPGLEPLL NPTGVGEVFQ
TAASRTRHEL LDFARAVKHQ LTDVAFVEDV FSPAGLLVSL ERDRFDKLAS PTLARTMACC
QHLLESTGIS PEQLDGVLPV GGSSRMPVVT EILAGRLGVG TDWPHQHVFT APEPDQAVVL
GAASWAATVS TRRTVAARPD PVVRPARWDI PGGRATMVRW LVEPGARFDA GAVLARVRLT
DGALFDLASE GPGSLVETHA EPGSAVRDVD WLATTLPSGP APAGSARPAD LRGFERSNRP
SRNVEVTTRV ALTGHERDVT SAAFSPDGRL LATTSKDGTR LWDTTTGRTV GRLSGRKISA
VHGCAFSPDG DLLATTGSDK TARIWEIATE RLALTLAGHK GPVYGCAFSP DGRLLATVST
DRTVKLWGVS TGTNIATLTG HRGSVYGCAF SPDGRLLVTA GAESTLLWDV TIGETITSLA
GHTNFANGCS FSPDGLLLAT TSNDGTRLTD TPTGTTTLTL PGSAQSCAFS PDGVLLATAS
TDDTARLWDV ATGTAVATLT GHSSTVMACA FAPYGLLLAT TSTDKTARLW DITYTPDSAR