Gene Francci3_1426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1426 
Symbol 
ID3903157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1719111 
End bp1720052 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content72% 
IMG OID637878763 
Productribosomal large subunit pseudouridine synthase D 
Protein accessionYP_480532 
Protein GI86740132 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.128873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0393122 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCATCA CGGGCGGTGA CCTGCGGTCC CTGCCGGTCC CGGACGGGCT CGACGGCGTC 
CGTCTCGACG CGGCGATCGC GAGGATGTTC GGGCTGTCGC GGACCGTGGC CGCCGCGCTC
GTCGACGACG GCCAGGCGAG CCTCGACGGG AAGGTCCGGG GCCGGTCCGA CCGGGTCAGC
GGCGGTGCCT GGCTGGAGGT CCGGCTGCCC GCTCCGCCGC GTCCGGTGGC GGTGGAACCC
ACGCCGGTCG AGGCTCTCGG CATTCTCTAC GACGACGACG ACATCATCGT GGTGGACAAG
CCGGTCGGGG TCGCCGTCCA TCCGGCGCCC GGCTTCACCG GACCGACCGT GATCGGGGCG
TTGGCCGCCG CGGGATACCG CATTTCCACC TCGGGCGCGG CCGAGCGTCA GGGGGTGGTG
CACCGTCTCG ACGTCGGTAC CACCGGGGTG ATGGTGGTCG CCAAGAGCGA GCGCGCATAT
ACCCTGCTGA AACGGGCGTT TCGTGACCGT ACGGTGGACA AGCGCTACCG GGCCGTGGTG
CAGGGCCATC CCGATCCGCT GCGGGGCACC GTGGACGCCC CGATCGACCG GCATCCGCGC
CGGCCGGGGC TGTTCGCCGT CGTCGCGGAC GGCAAGCCGA GTATCACCCA CTACGACCTC
CAGGAGGCGT TCCGGGCCGC CTCCCTGCTG TCCGTGCGAT TGGAGACCGG GCGCACCCAC
CAGATCCGGG TGCACATGTC CGCCCTGCGG CACCCGTGTG TCGGGGATCT CGCCTACGGG
GCCGATCCCA CGCTCGCCGA GCGGCTCGGC CTGACCCGCC AGTGGCTGCA CGCGGCGCGG
CTGTCCTTCG ATCATCCCGG TCACGGCGGA CGGGTCGAGT TCACCAGTCC GGACCCGGCT
GACCTGGCCG AGGCGGTGGA ACGGCTGCGG GACCAGCCAT GA
 
Protein sequence
MTITGGDLRS LPVPDGLDGV RLDAAIARMF GLSRTVAAAL VDDGQASLDG KVRGRSDRVS 
GGAWLEVRLP APPRPVAVEP TPVEALGILY DDDDIIVVDK PVGVAVHPAP GFTGPTVIGA
LAAAGYRIST SGAAERQGVV HRLDVGTTGV MVVAKSERAY TLLKRAFRDR TVDKRYRAVV
QGHPDPLRGT VDAPIDRHPR RPGLFAVVAD GKPSITHYDL QEAFRAASLL SVRLETGRTH
QIRVHMSALR HPCVGDLAYG ADPTLAERLG LTRQWLHAAR LSFDHPGHGG RVEFTSPDPA
DLAEAVERLR DQP