Gene Franean1_5086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5086 
Symbol 
ID5673421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6086610 
End bp6087551 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content75% 
IMG OID641243937 
ProductRluA family pseudouridine synthase 
Protein accessionYP_001509351 
Protein GI158316843 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00722052 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000548014 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGCACGG CCGACACCGA CCGGCGCTCG CTGCCGGTCC CCGACGGCCT CGACGGCATC 
CGCCTGGACG CCGCCATCGC CCGCATGTTC GGCCTGAGCC GCACGGCCGC CGCCACGCTC
GTTGACGACG GCCAGGTGAG CGTCGACGGC CAGGTCCGCG GCCGCTCCGA CCGGGTCAGC
GGCGGCGCCT GGCTGGACGT AGCCCTCCCC GCCCCGCCCC GTCCGCTCAC GGTGGAGCCC
ACCCCGGTCG GTGGCCTGCG CATCCTGCAC GACGACGACG ACATCGTGGT CGTCGACAAG
CCGGTCGGCG TGGCGGCCCA CCCGGCCCCG GGGTTCACCG GCCCGACGGT CATCGGCGCG
CTCGCCGCCG CCGGCTACCG GGTCTCCACC TCGGGCTCGG CCGAGCGCCA GGGCGTGGTG
CACCGCCTGG ACGTCGGCAC GACCGGGGCG ATGGCTGTCG CGAAGAGTGA ACGCGCCTAC
ACCCTTCTCA AGCGGGCCTT CCGGGAACGC GAGGTGGAGA AGGAGTACCG GGCGGTCGTC
CAGGGGCACC CGGACCCGCT GCGCGGGACG GTCGACGCCC CCATCGACCG CCATCCCCGC
CGGCCCGGCC TGTTCGCCGT CGTCGCGGAC GGCAAGCCCA GCGTGACCCA CTACGACACC
GAGGAGGCGT TCCGCGCCGC CTCGCTGCTG TCGGTGCGGC TGGAGACGGG CCGCACCCAC
CAGATCCGGG TGCACATGGC GGCGCTGCGG CATCCCTGCG TGGGCGACCT CGCCTACGGG
GCGGATCCGA CGCTGGCCCA GCGCCTCGGG CTGACCAGGC AGTGGCTGCA CGCTGCCCGG
CTCGCCTTCG CCCACCCGGC GGACGGCACC TGGGTCGAGT TCACCAGCCC CGATCCCGAT
GACCTGGCCG AGGCGGTGAA GCGGCTGCGC GACCAGACGT GA
 
Protein sequence
MSTADTDRRS LPVPDGLDGI RLDAAIARMF GLSRTAAATL VDDGQVSVDG QVRGRSDRVS 
GGAWLDVALP APPRPLTVEP TPVGGLRILH DDDDIVVVDK PVGVAAHPAP GFTGPTVIGA
LAAAGYRVST SGSAERQGVV HRLDVGTTGA MAVAKSERAY TLLKRAFRER EVEKEYRAVV
QGHPDPLRGT VDAPIDRHPR RPGLFAVVAD GKPSVTHYDT EEAFRAASLL SVRLETGRTH
QIRVHMAALR HPCVGDLAYG ADPTLAQRLG LTRQWLHAAR LAFAHPADGT WVEFTSPDPD
DLAEAVKRLR DQT