Gene Franean1_6751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6751 
Symbol 
ID5675064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8210850 
End bp8212472 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content71% 
IMG OID641245600 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001510991 
Protein GI158318483 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.136114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGTGG CGCGTGGCGA GGCTGGCTGG GATTTCTTCA TCTCTTACAC CGCCGTGGAC 
ACAGCCTGGG CGGAGTGGAT CGCCTGGCAG TTAGAAGACG CTGGCTACCG GGTGCTGATC
CAGGCGTGGG ACTCCGTGCC CGGGTCGAAC TGGGCGGTCC GCATACAACA GGGGACGACC
GAGTCTGACC GCACCATCGC TGTGCTCTCG GCCTCCTACC TGCGGTCGGT CTACGGGCAA
AACGAGTGGC ATGCCGCCCA CGCCGCCGAC CCCGGCGGCT TCGCCCGCAC GTTGCTCCCC
ATCCGGGTGG AGGACTGCCC CCGCCCGGGA CTGCTCGGAC AGATCGTGTC GATCGATCTG
TTCGGCCATC CCGCCGACGT CGCCCGCCAG CACCTCCTCG ACGCGATCAG CACAGCACGG
GCGGGGCGCG CGAAACCCAC CGCCGCACCC GCCTTCCCCC CACGCCCGGC CCTACCCCCA
CAGCAGCCCT CAGCAAGAAC GGCACCCCCC TTCCCCGGCC CGGACCCGAC GGCCTCCCTC
GACCAGCCCG CCCTCCGCAC GCACGCCCGA TCCGATCGCC TCCTCCATGG GCCGCAGCGC
CGCGTTTCTC TCGCGGTGGT GCTGCTCGTC ATCACCGGCA CCGTCTTCCT TGCCAGTTCC
GCGCGGGACA GGAATCCGAG CGCGACCAGC GCCCACTCCG CTGCGCCACC AACGTCGGCT
CCCACACCCA GCCTGTCGGG CTCACCCTTA CGCGACCACA CCGACTCGGT GCGGTCGGTG
GCGTTCTCCC GGGACGGACG CACGCTAGCC AGCGCCAGCC AGGACGGCAC GGCGCGGCTG
TGGGACATCG CCGAGCGGAC CTCCCAACCG TTGACCGGCC GCATCGCAGT GTGGTCGGTG
GCGTTCTCCC CAGACAAGCA CACGCTGGCC AGCGCCAACG GCGACAGCAC GGTGCAGTTG
TGGGACGTGG CCGAGGGGAC CCTCCCCCAC CCGGTGGCTT CCCTGCCCGG CCACAGCGAC
GCGGTGGGAT CGGTGGCGTT CTCCCCGGAC GGACGCACGC TGGCCAGCGC CAGCGACGAC
CACACAGTGC GACTGTGGGA CGTGGCCACG GGGACCACCA CCCACACGTT GACCGACCAC
ACCGGCCCCG TGAACTCGGT GGCGTTCTCC CGGGACGGGC GCACGCTGGC CAGCGCCAGC
GACGACCACA CGGTGCGACT GTGGGATGTG GCCGAGGGGA CCCTCCTCCG CACCTTGCCC
GGCCACACCG AGCCAGTGAT GTCGGTGGCG TTCTCCCCGG ACAGACGCAC GCTGGCCAGC
GCCAGCCAGG ACAACACCGT GCGGTTGTGG GATGTGGCCG CGCGGACCGC CCCCCGCCTG
GTGGGCTCTC TGTCCGACCA CACCCACTGG GTGATGTCGG TGGCGTTCTC TCCCGACGGG
CGCATCCTGG CCAGCGCCAG CCAGGACCGC ACAGTGCGGC TGTGGGACGT GGCCGCGCGG
ACCACCACCC ACACGTTGAC CGGCCACACC GGCCCCGTGT TCTCGGTGGC GTTCTCCCTG
GACGGGCGCA CTCTGGCCAG CGCCAGCGAC GACAACACGG TGCGACTGTG GGACATGAGC
TGA
 
Protein sequence
MGVARGEAGW DFFISYTAVD TAWAEWIAWQ LEDAGYRVLI QAWDSVPGSN WAVRIQQGTT 
ESDRTIAVLS ASYLRSVYGQ NEWHAAHAAD PGGFARTLLP IRVEDCPRPG LLGQIVSIDL
FGHPADVARQ HLLDAISTAR AGRAKPTAAP AFPPRPALPP QQPSARTAPP FPGPDPTASL
DQPALRTHAR SDRLLHGPQR RVSLAVVLLV ITGTVFLASS ARDRNPSATS AHSAAPPTSA
PTPSLSGSPL RDHTDSVRSV AFSRDGRTLA SASQDGTARL WDIAERTSQP LTGRIAVWSV
AFSPDKHTLA SANGDSTVQL WDVAEGTLPH PVASLPGHSD AVGSVAFSPD GRTLASASDD
HTVRLWDVAT GTTTHTLTDH TGPVNSVAFS RDGRTLASAS DDHTVRLWDV AEGTLLRTLP
GHTEPVMSVA FSPDRRTLAS ASQDNTVRLW DVAARTAPRL VGSLSDHTHW VMSVAFSPDG
RILASASQDR TVRLWDVAAR TTTHTLTGHT GPVFSVAFSL DGRTLASASD DNTVRLWDMS