Gene Franean1_1194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1194 
SymbolhisS 
ID5669607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1422912 
End bp1424291 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content70% 
IMG OID641240126 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001505554 
Protein GI158313046 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0221904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0754692 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCCTCG CCCAGGCGCG CGCACGCAAC AGTGTCGAGA ACGAGGAGTT TGTCCCGGCC 
ATGGCCGACG CACCGATAGT CCGTCCCACC CCGGTCAGCG GGTTCCCCGA GTGGCTTCCC
GACGTCCGGA TGGTCGAGCT CCGCTGGCTC GACACCATCC GCGCCACCTT CGAGCGGTAC
GGGTTCTGCT CCGTGGAGAC CCCGTCGGTG GAGGCCCTCG AGGCCCTGAT GGCCAAGGGC
GAGACGTCGC AGGAGGTCTA CACCCTGCGC CGCCTGCAGG CAGAGGACGA CGACGACAGC
GCGCGCCTCG GCCTGCACTT CGACCTGACC GTGCCGTTCG CCCGGTACGT CGCCGCCCAC
TTCAACGACC TGGTCTTCCC GTTCAAGCGC TACCAGATCC AGCGGGTGTG GCGGGGCGAG
CGGCCGCAGG AGGGCCGGTT CCGGGAGTTC ACTCAGTGTG ACATCGACGT CATCAACGTC
GACAACGTCC CGCTGCACTT CGACGCCGAG CTGCCCCGGA TCGTGCACGA GGTCCTCACG
ACGCTCGCCG TCCCGTCCTG GACGTTGAAC ATCAACAACC GCAAGATCCT GCAGGGGTTC
TACGAGGGGC TCGGTATCGG TGACCCGCTC GCCGTCATCC GCGCCGCGGA CAAGATCGAC
AAGATCGGCG CGCGCGGGGT CGAGGCGCTG CTGACGGGGC CGGTCGGGCT CACGGCCGAG
CAGGCGCGCG CCTGCCTCGA CCTGGCCCAG GTCCGGGGCT CCGACGCCGG TGTCATCGAC
GAGATCGGCC GGCTCGGGGT CAAGTCCGAG CTCCTGACGG AAGGGCTCGA CGAGCTGGCC
CGGGTGCTGG ACGACCTGGC CGACCTGCCG GCGGGCGCCG TCGTGGCCGA CCTGTCGATC
GCCCGCGGCC TGGACTACTA CACCGGCACC GTCTACGAGG CCAAATTCGT CGATGCGCCC
GGGTACGGCA GCATCTGCTC CGGTGGCCGG TACGACGACC TCGCGGGCAC CTTCATCCGG
CGCAACCTCC CCGGTGTCGG GATCTCGATC GGTCTCACCC GCATCTTCGC CAAGCTTGTC
GCGGACGGCC TCATCGGCGC GGGCCCGTCC AGCCCGGCCC AGGTGCTCAT GGTGATCCCG
AGCGACGAGC GCCGCGCCGA GGCGCTCGCG ACCGCCCGGC TGCTGCGCGG CCGCGGCCTC
AACGTGGAGA CCTACCACCA GGCCGACAAG CTGGCGAAGC AGATCAGGTA CGCCTCACGC
AAGGGCATCG GGCACGTCTG GTTCCCGCCG TTCGCCGACG ACCGCCCGCA CGAGGTGAAG
GACATGGCCA CCGGCGACCA GGGCCCGGCC GACCCGACGG CGTGGACACC CGCGGGCTGA
 
Protein sequence
MRLAQARARN SVENEEFVPA MADAPIVRPT PVSGFPEWLP DVRMVELRWL DTIRATFERY 
GFCSVETPSV EALEALMAKG ETSQEVYTLR RLQAEDDDDS ARLGLHFDLT VPFARYVAAH
FNDLVFPFKR YQIQRVWRGE RPQEGRFREF TQCDIDVINV DNVPLHFDAE LPRIVHEVLT
TLAVPSWTLN INNRKILQGF YEGLGIGDPL AVIRAADKID KIGARGVEAL LTGPVGLTAE
QARACLDLAQ VRGSDAGVID EIGRLGVKSE LLTEGLDELA RVLDDLADLP AGAVVADLSI
ARGLDYYTGT VYEAKFVDAP GYGSICSGGR YDDLAGTFIR RNLPGVGISI GLTRIFAKLV
ADGLIGAGPS SPAQVLMVIP SDERRAEALA TARLLRGRGL NVETYHQADK LAKQIRYASR
KGIGHVWFPP FADDRPHEVK DMATGDQGPA DPTAWTPAG