Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1764 |
Symbol | thrS |
ID | 5670166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2114770 |
End bp | 2116746 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641240685 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_001506108 |
Protein GI | 158313600 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.279145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00103927 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCCGACG TCCGCGTAAC CGTCCAGCGA GGTCAGCAGG CCGAGGAGCG GGTGGTGCGG ACGGGGACTA CGGCCGCCGA GCTCTTCGAC GGGGAGCGGT CGGTGATCGC GGCCCGCGTG GGTGGTCTCC AGCGTGACCT CTCCCATCCG CTCGCCGCCG GTGACGTCGT CGAGCCGATC ACCGTCGACT CGCCGGACGG GCGGGCCATC GTCCGCCACT CCTGCGCCCA CGTGGTGGCA CAGGCGGTGC AGGACCTGTT CCCCAAGGCC CGGCTCGGCA TCGGCCCGCC GATCCAGGAC GGCTTCTACT ACGACTTCGA CGTCGAGCGC CCGTTCACCC CGGAGGACCT CAAGGCCGTC GAGAAGCGCG CCCAGGAGAT CATCAAGGCC GGCCAGCGCT TCGCCCGGCG GGTCGTCTCC GAGGACGAGG CGCGCGCCGA GCTCGCCGAC GAGCCGTACA AGATCGAGCT GATCGGGCTG AAGTCCAGCG CGGGCGACGA CGCCGAGGCC GGGGTCGAGG TGGGCGAGGG CGAGCTCACC ATCTACGACA ACCTCGACCC GAAGTCCGGC GAGCTGTGCT GGAAGGACCT GTGTCGCGGG CCGCACGTGC CCACCACCCG GCACATCCCC GCGTTCGCCG TCCAGCGCTC GGCGGCGGCG TACTGGCGGG GCAGCGAGCG CAACCCGCAG CTGCAGCGCA TCTACGGCAC CGCCTGGGAG TCCCGCGACG CGCTCAAGGC CTACCAGCAC CGGCTGGCCG AGGCGGAGAA GCGCGACCAC CGCCGGCTGG GCGCCGAGCT CGACCTGTTC TCCTTCCCGA CCGAGATCGG GCCCGGCCTG GCCGTGTTCC ATCCCAAGGG CGGCGCGATC CGCACCGTGA TGGAGGACTA CTCGCGGCGC CGGCACATCG AGGCCGGGTA CGAGTTCGTC AACACCCCGC ACATCACCAA GTCGGATCTC TACGAGATCT CCGGGCACCT CGACTGGTTC GCCGACGGCA TGTACCCGCC CATGCAGCTC GACGGCGGGG CGGACTACTA CCTCAAGCCG ATGAACTGCC CGATGCACAT CCTGATCTTC CGGTCGCGCG GCCGGTCGTA CCGGGAGCTG CCGCTGCGGC TGTTCGAGTT CGGCACCGTC TACCGGTACG AGAAGTCCGG GGTCGTGCAC GGCCTGACCC GGGTCCGCGG CCTCACCCAG GACGACGCGC ACCTGTTCTG CGCCCGCGAG CAGCTGCCCA CCGAGCTCGA CACCGTCCTG AAGTTCGTGC TCGGCCTGCT GCGCGACTAC GGCCTCGAGG ACTTCTACCT CGAGCTGTCG ACCCGTCCGC CGGGCAAGGC GATCGGCAGC GACAAGGAGT GGGAGGAGGC GACCGAGCTG CTGCGCGAGG CCGCCTCCAA GCAGGACCTC GAGCTCGTCA TGGACGAGGG CGGCGGCGCG TTCTACGGGC CGAAGATCTC CGTGCAGGCC CGGGACGCGA TCGGGCGCAC CTGGCAGCTG TCCACCATCC AGGTCGACTT CCAGCTCCCG CAGCGCTTCG ACATGACCTA CCAGGCGGCG GACGGCACCC GCCAGCGCCC GTTCATGATC CACCGGGCGC TGTTCGGGAC GATCGAGCGG TTCTTCGCGA TCCTGCTGGA GCACTACGCC GGCGCGCTGC CGCCGTGGCT GGCTCCCGTG CAGGTGGTGG GCATCCCGAT CACCGACGAG CACGTGCCGT ATCTGACCGA CGTGGCGGCG AAGCTGCGGC AGCGGGGCAT CCGCGTCGAG GTCGACTCCT CCGACGACCG GATGCAGAAG AAGATCCGCA CCGCCCAGAA GCAGAAGGTG CCCTTCATGC TGCTCGCCGG TGACGAGGAC GTCGCCAAGG GCGCGGTGTC CTTCCGCTTC CGGGACGGCA CGCAGCGCAA CGGCGTGCCG GTGGACGAGG CCGTCGCGGA GATCCTCGAC GCCGTCGAGC GTCGTATCCA GGTCTGA
|
Protein sequence | MSDVRVTVQR GQQAEERVVR TGTTAAELFD GERSVIAARV GGLQRDLSHP LAAGDVVEPI TVDSPDGRAI VRHSCAHVVA QAVQDLFPKA RLGIGPPIQD GFYYDFDVER PFTPEDLKAV EKRAQEIIKA GQRFARRVVS EDEARAELAD EPYKIELIGL KSSAGDDAEA GVEVGEGELT IYDNLDPKSG ELCWKDLCRG PHVPTTRHIP AFAVQRSAAA YWRGSERNPQ LQRIYGTAWE SRDALKAYQH RLAEAEKRDH RRLGAELDLF SFPTEIGPGL AVFHPKGGAI RTVMEDYSRR RHIEAGYEFV NTPHITKSDL YEISGHLDWF ADGMYPPMQL DGGADYYLKP MNCPMHILIF RSRGRSYREL PLRLFEFGTV YRYEKSGVVH GLTRVRGLTQ DDAHLFCARE QLPTELDTVL KFVLGLLRDY GLEDFYLELS TRPPGKAIGS DKEWEEATEL LREAASKQDL ELVMDEGGGA FYGPKISVQA RDAIGRTWQL STIQVDFQLP QRFDMTYQAA DGTRQRPFMI HRALFGTIER FFAILLEHYA GALPPWLAPV QVVGIPITDE HVPYLTDVAA KLRQRGIRVE VDSSDDRMQK KIRTAQKQKV PFMLLAGDED VAKGAVSFRF RDGTQRNGVP VDEAVAEILD AVERRIQV
|
| |