Gene Franean1_5494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5494 
SymbolrpsA 
ID5673825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6648388 
End bp6649869 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content67% 
IMG OID641244349 
Product30S ribosomal protein S1 
Protein accessionYP_001509755 
Protein GI158317247 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.117184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.159849 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGAGCA CCACCGACGT ACGACCCGTG GAGACCGTGA CCACCTCGCA CAGCCCGACC 
ACACCACAGG TTGCGGTCAA CGACATCGGC TCGGCGGAGG ATTTCCTCGC CGCGGTCGAC
AAGACGATCA AGTTCTTCAA CGACGGCGAC ATCGTCGACG GCATCATCGT CAAGGTGGAC
CGCGACGAGG TGCTGCTCGA CATCGGTTAC AAGACCGAGG GTGTCATCCC GTCGCGGGAG
CTCTCGATCA AGCATGATGT CGATCCGCAC GAGGTCGTCA CGGTCGGCGA CCACGTCGAG
GCCCTCGTCC TCCAGAAGGA GGACAAGGAA GGCCGTCTGA TCCTGTCCAA GAAGCGTGCG
CAGTACGAGC GCGCCTGGGG CACGATCGAG AAGCTCAAGG AGGAGGACGG CGTCGTCACC
GGCACCGTCA TCGAGGTCGT CAAGGGTGGT CTCATCCTCG ACATCGGCCT GCGCGGCTTC
CTTCCCGCCT CCCTCGTGGA GATGCGCCGG GTCCGCGACC TCCAGCCCTA CGTGGGCCGT
GAGCTCGAAG CCAAGATCAT TGAGCTGGAC AAGAACCGCA ACAACGTGGT TCTGTCCCGC
CGCGCCTGGC TGGAGCAGAC CCAGAGCGAG GTGCGGTCGG AGTTCCTCGC CCAGCTCGCC
AAGGGTCAGA TCCGCAAGGG CGTGGTCAGC TCCATCGTCA ACTTCGGTGC CTTCGTGGAC
CTGGGTGGCG TCGACGGCCT CGTGCACGTG TCCGAGCTGT CCTGGAAGCA CATCGACCAC
CCGTCCGAGG TGGTCGAGGT CGGCCAGGAG GTCACCGTCG AGGTTCTCGA CGTCGACCTG
GACCGCGAGC GGGTCTCCCT GTCGCTGAAG GCGACGCAGG AGGACCCGTG GCGCCAGTTC
GCCCGGACGC ACGCGATCGG CCAGGTCGTA CCCGGTCGGG TCACCAAGCT CGTGCCGTTC
GGCGCGTTCG TCCGCGTGGA CGAGGGCATC GAGGGCCTGG TGCACATCTC GGAGCTGGCC
GAGCGCCACG TCGAGATCCC CGAGCAGGTC GTCAATGTCG GCGACGAGAT CCTGGTCAAG
GTCATCGACA TCGACCTCGA CCGGCGCCGG ATCAGCCTCT CGCTGAAGCA GGCGAACGAG
GCGTCCTCGC TGGTCGCTGA GGGCGAGTCG TTCGACCCGA GCCAGTACGG CATGGAAGCC
AAGTACGACG AGCAGGGCAA CTACGTGTAC CCCGAGGGCT TCGACCCCGA GACCGGCGAG
TGGCTGCCGG GCTTCGAGGA GCAGCAGGCC GAGTGGGAGC GTCAGTACGC GGAGGCCCAG
ACCCGGTTCG AGGCCCACCA GGCCCAGATC CGGGCCGCGC AGGAGGCCGA CGCCGCCGCG
GCGGCACCGT CCTCCTACAC CTCGTCGCAG TCGGAGCAGC CCGCTTCGGC GATCGACGAG
GAGGCGCTGC GACGCCTGCG GGAGCAGTTC GGCCGCGAGT AG
 
Protein sequence
MTSTTDVRPV ETVTTSHSPT TPQVAVNDIG SAEDFLAAVD KTIKFFNDGD IVDGIIVKVD 
RDEVLLDIGY KTEGVIPSRE LSIKHDVDPH EVVTVGDHVE ALVLQKEDKE GRLILSKKRA
QYERAWGTIE KLKEEDGVVT GTVIEVVKGG LILDIGLRGF LPASLVEMRR VRDLQPYVGR
ELEAKIIELD KNRNNVVLSR RAWLEQTQSE VRSEFLAQLA KGQIRKGVVS SIVNFGAFVD
LGGVDGLVHV SELSWKHIDH PSEVVEVGQE VTVEVLDVDL DRERVSLSLK ATQEDPWRQF
ARTHAIGQVV PGRVTKLVPF GAFVRVDEGI EGLVHISELA ERHVEIPEQV VNVGDEILVK
VIDIDLDRRR ISLSLKQANE ASSLVAEGES FDPSQYGMEA KYDEQGNYVY PEGFDPETGE
WLPGFEEQQA EWERQYAEAQ TRFEAHQAQI RAAQEADAAA AAPSSYTSSQ SEQPASAIDE
EALRRLREQF GRE