Gene Franean1_0523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0523 
Symbol 
ID5668942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp609387 
End bp610772 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content65% 
IMG OID641239452 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001504890 
Protein GI158312382 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGCGCT ACTGCACAGG CCAGTCCGTG CCAGTAGAGT TCGGCACCAT CGAACGCGTC 
GCGCGAGTAT GCGGCGCCGA CGGGGACGAA ATCGCACGCC TGTTCAGACT CTGGGAAACT
GCGGCGTCGA TCTCGACGAG CGCCGGCATC GAACTTCCCA CCGCAGGCGG CGACCATGAG
CACACCAATC CTGTTGAAGG CGTCGAGTAC TCAACCTCTC CGGTCTCTCT GGTGCCACCA
GCACTCGCAG CCGCGGACGA CTCCGCACCA AGACCAGCCA CACAGAATGA GCACAGCCGG
CTTCGCGGCG CACGCAGAGC CGGCTGGAGA TGGAACCGCC GGAGTATATC CGCCACGGCC
ATCATCGTGG TGTTGTTCTC CAGCGCGGTC GGTACCGTAC TCGTTCTCGC CATGATTCTC
GCGCGTGAAC CTGGCGTCCG GCCGCTCGGG CGCCCGTTGG CCGACCAGGC CGGCTGGGCG
TTGTCCACCG CGTTCTCCCC CGACGGGAAA GTAATGGCTA GCAGTAGCAG GAAAGGCGGA
GTGTGGTTGT GGAACATGGC CGATCCCGCC ACGCCCGTCC GAATCGATCC TGCGCTGACC
GGCCCACGCG ACGGGGTGAC ATCACTGGCG TTCTCGCCAG ATGGGAGTCT TTTAGCCGGT
GGCAGCTGGG ACGGGTCCAT ATGGTTGTGG GACATAACCG ACAGCGGGGC TTCCAAGCCG
GCCGGCCGTG CGCTGACCGA CGACTCGGGA CCGATATGGT CGGTAGCATT CTCCGCGGAT
GGCCGCACGC TCGCATCCGG CAGCGACGAT ACGACGGTGC GACTTTGGGA CATGACCAAC
CGCGCCAGGC CGTGGCAATT CGTGCGGCTG AGCAGCGATA TGGAGTTCGT GACATCGGTC
GCGTTCTCCG CGGACAACCG CCTCCTAGTC GCCGCCGGCT TCAGTAGGAC CATCGCGATC
TGGGATATGG CCGACCGTGG GGCCCCTAAA CGGCTGGCCC AATCCCTGTC AACGCCCGCC
ACTACGTACG TGGCCGCCTT CTCCCCCAAT GGACGGCTCC TTGCCACCGG GAGCACCGAT
GGCTTGGTGC GACTTTGGGA CCTCGCCGTT CCAGAGGACC CCCATCCGAT CGGGAGACCG
CTCACCGGGC ATACCAACCG CGTCTGGTCA CTCGCATTCT CTCCGGATGG CGGCACCCTC
GCCAGCAGCG GGTTCGACAA CTCCGTGAGA CTGTGGGACG TGACCGACCT GTCCAACCCG
GAGCCCATCG GCGCGCCACT CACCGGCTAC CAGGGCTGGG TTCTCTCGGT GCGCTTCTCC
CCGAACGGCC GCGTGCTGGC AAGCACCAGC AGCGACAGCA CCATCCGCCT ATGGTCGCTA
CCCTGA
 
Protein sequence
MQRYCTGQSV PVEFGTIERV ARVCGADGDE IARLFRLWET AASISTSAGI ELPTAGGDHE 
HTNPVEGVEY STSPVSLVPP ALAAADDSAP RPATQNEHSR LRGARRAGWR WNRRSISATA
IIVVLFSSAV GTVLVLAMIL AREPGVRPLG RPLADQAGWA LSTAFSPDGK VMASSSRKGG
VWLWNMADPA TPVRIDPALT GPRDGVTSLA FSPDGSLLAG GSWDGSIWLW DITDSGASKP
AGRALTDDSG PIWSVAFSAD GRTLASGSDD TTVRLWDMTN RARPWQFVRL SSDMEFVTSV
AFSADNRLLV AAGFSRTIAI WDMADRGAPK RLAQSLSTPA TTYVAAFSPN GRLLATGSTD
GLVRLWDLAV PEDPHPIGRP LTGHTNRVWS LAFSPDGGTL ASSGFDNSVR LWDVTDLSNP
EPIGAPLTGY QGWVLSVRFS PNGRVLASTS SDSTIRLWSL P