Gene Francci3_1057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1057 
SymbolrpsA 
ID3905303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1259062 
End bp1260540 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content65% 
IMG OID637878391 
Product30S ribosomal protein S1 
Protein accessionYP_480168 
Protein GI86739768 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.960366 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGAGCA CCACCGACGT ACGACCCGTG GAGACCGTGA CCATCTCGCC CAGCCCGACC 
ACACCACAGG TAGCGGTCAA CGACATCGGC TCGGCCGAAG ACTTCCTTGC GGCCGTCGAC
AAGACGATCA AGTTCTTCAA CGATGGCGAC ATCGTTGACG GCATCATCGT CAAGGTCGAC
CGCGACGAGG TGCTGCTCGA CATCGGCTAC AAGACCGAGG GCGTGATCCC GTCCCGGGAA
CTGTCGATCA AGCACGACGT CGACCCGCAC GAGGTCGTCA GTGTGGGCGA CCACGTCGAG
GCCCTTGTCC TCCAGAAGGA GGACAAGGAA GGCCGCCTGA TCCTGTCCAA GAAGCGTGCG
CAGTACGAGC GCGCCTGGGG CACGATCGAG AAGCTCAAGG ACGAGGACGG TGTCGTCACC
GGCACCGTGA TCGAGGTCGT CAAGGGTGGT CTCATCCTCG ACATCGGCCT GCGTGGCTTC
CTGCCGGCTT CGCTTGTGGA GATGCGCCGG GTCCGCGATC TGCAGCCCTA CGTGGGCCGC
GAGCTCGAAG CCAAGATTAT CGAGCTGGAC AAGAACCGCA ACAACGTTGT GCTCTCGCGG
CGGGCCTGGC TCGAACAGAC CCAGAGCGAA GTCCGTTCCG AGTTCCTCGC CCAGCTGGCC
AAGGGCCAGA TCCGCAAGGG CGTGGTCAGC TCCATCGTCA ACTTCGGCGC CTTCGTGGAC
CTCGGTGGTG TGGACGGCCT CGTGCACGTC TCCGAGCTGT CCTGGAAGCA CATCGACCAC
CCGTCCGAGG TGGTCGAGGT CGGCCAGGAG GTTACCGTCG AGGTCCTCGA TGTCGACTTG
GACCGCGAGC GGGTCTCGCT GTCGCTGAAG GCGACGCAGG AGGACCCGTG GCGTCAGTTC
GCCCGGACCC ACGCGATCGG TCAGGTCGTT CCAGGCCGGG TCACGAAGCT GGTGCCGTTC
GGTGCGTTCG TCCGGGTGGA CGAGGGCATC GAGGGTCTGG TCCACATCTC CGAGCTGGCC
GAGCGGCACG TGGAGATCCC CGAGCAGGTC GTGAACGTCG GTGACGAGAT CCTGGTGAAG
GTCATCGACA TCGACCTCGA CCGCCGCCGC ATCAGCCTGT CGCTCAAGCA GGCGAACGAG
GCGACAGGGC TGGCTGTCGA CGGCGAGGCG TTCGACCCGA GCCAGTACGG CATGGAAGCC
AAGTACGACG AGCAGGGTAA CTACGTCTAC CCCGAAGGCT TCGACCCCGA GACCGGGGAA
TGGCTCGAAG GTTACGAGGA GCAGCAGGCG GAATGGGAGC GGCAGTACGC CGAGGCCCAG
GCCCGCTTCG AGGCGCACCA GGTCCAGATC CGGGCCGCGC AGGAGGCCGA TGCCGCCGCG
GCGGCCCCCT CCTCCTACAC CTCCCAGTCC GAGCAGCCTT CCTCGGCGAT CGATGAAGAG
GCGCTGCGTC GGCTGCGTGA GCAGTTCGGT CGGGAGTAG
 
Protein sequence
MTSTTDVRPV ETVTISPSPT TPQVAVNDIG SAEDFLAAVD KTIKFFNDGD IVDGIIVKVD 
RDEVLLDIGY KTEGVIPSRE LSIKHDVDPH EVVSVGDHVE ALVLQKEDKE GRLILSKKRA
QYERAWGTIE KLKDEDGVVT GTVIEVVKGG LILDIGLRGF LPASLVEMRR VRDLQPYVGR
ELEAKIIELD KNRNNVVLSR RAWLEQTQSE VRSEFLAQLA KGQIRKGVVS SIVNFGAFVD
LGGVDGLVHV SELSWKHIDH PSEVVEVGQE VTVEVLDVDL DRERVSLSLK ATQEDPWRQF
ARTHAIGQVV PGRVTKLVPF GAFVRVDEGI EGLVHISELA ERHVEIPEQV VNVGDEILVK
VIDIDLDRRR ISLSLKQANE ATGLAVDGEA FDPSQYGMEA KYDEQGNYVY PEGFDPETGE
WLEGYEEQQA EWERQYAEAQ ARFEAHQVQI RAAQEADAAA AAPSSYTSQS EQPSSAIDEE
ALRRLREQFG RE