Gene Franean1_1764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1764 
SymbolthrS 
ID5670166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2114770 
End bp2116746 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content70% 
IMG OID641240685 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001506108 
Protein GI158313600 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.279145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00103927 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCCGACG TCCGCGTAAC CGTCCAGCGA GGTCAGCAGG CCGAGGAGCG GGTGGTGCGG 
ACGGGGACTA CGGCCGCCGA GCTCTTCGAC GGGGAGCGGT CGGTGATCGC GGCCCGCGTG
GGTGGTCTCC AGCGTGACCT CTCCCATCCG CTCGCCGCCG GTGACGTCGT CGAGCCGATC
ACCGTCGACT CGCCGGACGG GCGGGCCATC GTCCGCCACT CCTGCGCCCA CGTGGTGGCA
CAGGCGGTGC AGGACCTGTT CCCCAAGGCC CGGCTCGGCA TCGGCCCGCC GATCCAGGAC
GGCTTCTACT ACGACTTCGA CGTCGAGCGC CCGTTCACCC CGGAGGACCT CAAGGCCGTC
GAGAAGCGCG CCCAGGAGAT CATCAAGGCC GGCCAGCGCT TCGCCCGGCG GGTCGTCTCC
GAGGACGAGG CGCGCGCCGA GCTCGCCGAC GAGCCGTACA AGATCGAGCT GATCGGGCTG
AAGTCCAGCG CGGGCGACGA CGCCGAGGCC GGGGTCGAGG TGGGCGAGGG CGAGCTCACC
ATCTACGACA ACCTCGACCC GAAGTCCGGC GAGCTGTGCT GGAAGGACCT GTGTCGCGGG
CCGCACGTGC CCACCACCCG GCACATCCCC GCGTTCGCCG TCCAGCGCTC GGCGGCGGCG
TACTGGCGGG GCAGCGAGCG CAACCCGCAG CTGCAGCGCA TCTACGGCAC CGCCTGGGAG
TCCCGCGACG CGCTCAAGGC CTACCAGCAC CGGCTGGCCG AGGCGGAGAA GCGCGACCAC
CGCCGGCTGG GCGCCGAGCT CGACCTGTTC TCCTTCCCGA CCGAGATCGG GCCCGGCCTG
GCCGTGTTCC ATCCCAAGGG CGGCGCGATC CGCACCGTGA TGGAGGACTA CTCGCGGCGC
CGGCACATCG AGGCCGGGTA CGAGTTCGTC AACACCCCGC ACATCACCAA GTCGGATCTC
TACGAGATCT CCGGGCACCT CGACTGGTTC GCCGACGGCA TGTACCCGCC CATGCAGCTC
GACGGCGGGG CGGACTACTA CCTCAAGCCG ATGAACTGCC CGATGCACAT CCTGATCTTC
CGGTCGCGCG GCCGGTCGTA CCGGGAGCTG CCGCTGCGGC TGTTCGAGTT CGGCACCGTC
TACCGGTACG AGAAGTCCGG GGTCGTGCAC GGCCTGACCC GGGTCCGCGG CCTCACCCAG
GACGACGCGC ACCTGTTCTG CGCCCGCGAG CAGCTGCCCA CCGAGCTCGA CACCGTCCTG
AAGTTCGTGC TCGGCCTGCT GCGCGACTAC GGCCTCGAGG ACTTCTACCT CGAGCTGTCG
ACCCGTCCGC CGGGCAAGGC GATCGGCAGC GACAAGGAGT GGGAGGAGGC GACCGAGCTG
CTGCGCGAGG CCGCCTCCAA GCAGGACCTC GAGCTCGTCA TGGACGAGGG CGGCGGCGCG
TTCTACGGGC CGAAGATCTC CGTGCAGGCC CGGGACGCGA TCGGGCGCAC CTGGCAGCTG
TCCACCATCC AGGTCGACTT CCAGCTCCCG CAGCGCTTCG ACATGACCTA CCAGGCGGCG
GACGGCACCC GCCAGCGCCC GTTCATGATC CACCGGGCGC TGTTCGGGAC GATCGAGCGG
TTCTTCGCGA TCCTGCTGGA GCACTACGCC GGCGCGCTGC CGCCGTGGCT GGCTCCCGTG
CAGGTGGTGG GCATCCCGAT CACCGACGAG CACGTGCCGT ATCTGACCGA CGTGGCGGCG
AAGCTGCGGC AGCGGGGCAT CCGCGTCGAG GTCGACTCCT CCGACGACCG GATGCAGAAG
AAGATCCGCA CCGCCCAGAA GCAGAAGGTG CCCTTCATGC TGCTCGCCGG TGACGAGGAC
GTCGCCAAGG GCGCGGTGTC CTTCCGCTTC CGGGACGGCA CGCAGCGCAA CGGCGTGCCG
GTGGACGAGG CCGTCGCGGA GATCCTCGAC GCCGTCGAGC GTCGTATCCA GGTCTGA
 
Protein sequence
MSDVRVTVQR GQQAEERVVR TGTTAAELFD GERSVIAARV GGLQRDLSHP LAAGDVVEPI 
TVDSPDGRAI VRHSCAHVVA QAVQDLFPKA RLGIGPPIQD GFYYDFDVER PFTPEDLKAV
EKRAQEIIKA GQRFARRVVS EDEARAELAD EPYKIELIGL KSSAGDDAEA GVEVGEGELT
IYDNLDPKSG ELCWKDLCRG PHVPTTRHIP AFAVQRSAAA YWRGSERNPQ LQRIYGTAWE
SRDALKAYQH RLAEAEKRDH RRLGAELDLF SFPTEIGPGL AVFHPKGGAI RTVMEDYSRR
RHIEAGYEFV NTPHITKSDL YEISGHLDWF ADGMYPPMQL DGGADYYLKP MNCPMHILIF
RSRGRSYREL PLRLFEFGTV YRYEKSGVVH GLTRVRGLTQ DDAHLFCARE QLPTELDTVL
KFVLGLLRDY GLEDFYLELS TRPPGKAIGS DKEWEEATEL LREAASKQDL ELVMDEGGGA
FYGPKISVQA RDAIGRTWQL STIQVDFQLP QRFDMTYQAA DGTRQRPFMI HRALFGTIER
FFAILLEHYA GALPPWLAPV QVVGIPITDE HVPYLTDVAA KLRQRGIRVE VDSSDDRMQK
KIRTAQKQKV PFMLLAGDED VAKGAVSFRF RDGTQRNGVP VDEAVAEILD AVERRIQV