Gene Francci3_3189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3189 
Symbol 
ID3903915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3778339 
End bp3779814 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content77% 
IMG OID637880513 
Productputative RNA-binding Sun protein 
Protein accessionYP_482275 
Protein GI86741875 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000532749 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.349925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGATCGG CCGGTTCGCC GGGCACCCGT GGGCCCCGGC CAGCGTCAAG CGTCGACCGG 
CCGCGGTTGC TGGCCTGGGA GGTGCTGCGG GCGGTGGACG AGCGCGGCTC CTACGCCAAC
CTGCTGCTGC CGTCCCTGTT GGCGGGCAGC GGGCTGTCCG CCCGGGACCG CGGGTTCGTC
ACGGAGCTCG CCTACGGCTC CCTGCGTGCG CAGGGCACCC TCGACGGGGT GCTCGACACG
GCGACGAGTC GACCCGTCCA CACCATCGAC CCGCCGGTGC GCGACGCGCT GCGCCTGGGC
GCCTACCAGC TGCTGCGGAC CAGGGTCCCG GCCCACGCGG CCGTGGCCAG CACCGTCGAG
CTGGTCCGCA CGACGAGTGG CGAGCGCCCG GTCCGCTTCG CGAACGCCGT GCTGCGTCGG
GTGGCCGCCC GGGTGGCCGA GACCGGCGGC GATCTCGCCA CGATGCTGTC CGCACCGCGG
TTCGACGTCG ACCCTGTCGG TCACCTGGCG GTCGTGACGA CGCATCCCCG CTGGATCGTC
GAGGTCGTCG CGGAGGCCCT GGCCGGCGAC CTCACGGCGA CCCGCGCCGC GCTGGAGGCA
GACGACGTCC GACCCGCGGT ACACCTGGTC GCCCGTCCGG GCCGGGTCGA CCGTGACCGG
CTGCTCGCCG AGGCCGCGCA GGCAGGTCTG ACCGCCCGAG TCGGCCCCTA CTCGCCGTAT
GCGGTACGCC TCGACGGCGG GGACCCGGCG GGGTTGCCCG CCGTGGCCGC GGGCGCGGCC
GCCGTGCAGG ACGAGGGCAG CCAGCTCGTC ACCCTGGCCC TGGCCCGCGC GGCGACGGTG
GGCCGTGACC TCGGGCTGAC CGTCGACCTG TGCGCGGGCC CCGGCGGGAA GGCGGCCCTG
CTCGCCGCGC TGCTCGGCGG CTCCGCCCCA TCGGACGGGC CGGGTCTGCC GGACAGGCCG
GGTCCACCGG ACAGGCCGGC CCTGATCGCG GTCGAGCCCC GGGCGACCCG GGCGGCCATG
GTGGCCCGGT CCCTGGGCGA CGCGGCGCGG GCCTGGACGG TGCGCGCCGA CGGCCGGGCG
GTGCCGCTGC GGCCCGATGG AGCCGACCGG GTGCTGGTCG ACGTCCCCTG CACCGGCCTC
GGAGCACTGC GGCGCCGGCC GGAGGCCCGG TGGCGGCGGA CCTCGGCCGA TGTGGCCGCG
CTCGTCCCGC TCCAGCGTGC GCTGCTCGTC GCCGCGCTCG ACCTGGTCCG CCCGGGCGGG
GTGGTGGCGT ACGCGACCTG TTCCCCGCAC CCGGCCGAGA CCGTCGAGGT GGTGCGTGGC
GTGGCCGGAC AACGCGCCGA CACCTCCATC CTCGATGCCC GCCTGACCCT GCCGGAGGTC
GACCGGCTCG GTGACGGCCC GTTCGTCCAG CTCTGGCCGC ATCTCCATGG CACGGATGCG
ATGTTCGTCG CCCTGCTGCG CCGGGTCGAC AGCTGA
 
Protein sequence
MRSAGSPGTR GPRPASSVDR PRLLAWEVLR AVDERGSYAN LLLPSLLAGS GLSARDRGFV 
TELAYGSLRA QGTLDGVLDT ATSRPVHTID PPVRDALRLG AYQLLRTRVP AHAAVASTVE
LVRTTSGERP VRFANAVLRR VAARVAETGG DLATMLSAPR FDVDPVGHLA VVTTHPRWIV
EVVAEALAGD LTATRAALEA DDVRPAVHLV ARPGRVDRDR LLAEAAQAGL TARVGPYSPY
AVRLDGGDPA GLPAVAAGAA AVQDEGSQLV TLALARAATV GRDLGLTVDL CAGPGGKAAL
LAALLGGSAP SDGPGLPDRP GPPDRPALIA VEPRATRAAM VARSLGDAAR AWTVRADGRA
VPLRPDGADR VLVDVPCTGL GALRRRPEAR WRRTSADVAA LVPLQRALLV AALDLVRPGG
VVAYATCSPH PAETVEVVRG VAGQRADTSI LDARLTLPEV DRLGDGPFVQ LWPHLHGTDA
MFVALLRRVD S