Gene Francci3_3548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3548 
SymbolhisS 
ID3904487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4242295 
End bp4243617 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content70% 
IMG OID637880869 
Producthistidyl-tRNA synthetase 
Protein accessionYP_482629 
Protein GI86742229 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.458882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.233684 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACG CGCCGATCGT CCGTCCCCTG CCCGTGAGCG GATTTCCCGA GTGGCTGCCC 
GAGGTCCGTC TCGTCGAGCA GCGCTGGCTC GACACCATCC GAGCGACGTT CGAGCGGTAC
GGCTTCTGCT CGGTGGAGAC CCCCTCGGTC GAGGCGCTCG AGGTGCTGAC CGCGAAGGGG
GAGACCTCCC AGGAGGTCTA CATGCTACGC CGCCTGCAGG CCGACGCCGA CGACGACAGC
GCCCGCCTCG GCCTGCACTT CGACCTCACA GTGCCTTTCG CGCGGTACGT GGCCGCCCAC
TTCAACGACC TCGTGTTCCC GTTCAAGCGC TACCAGATCC AGCGGGTGTG GCGGGGGGAG
CGTCCTCAGG AGGGCCGCTT CCGCGAGTTC ACCCAGTGCG ACATCGACGT GATCAACGTT
GATCAGGTGC CCCTGCACTT CGACGCGGAA CTTCCCCGCA TCGTGCACGA GGTCCTGGGC
ACCCTCGGCG TTCCGCCCTG GACCCTCAAC ATCAACAATC GCAAGGTGCT CCAGGGCTTC
TACGAGGGTC TGGGCATCGG CGATCCGCTG GCCGTCATCC GGGTCGCCGA CAAGCTCGAC
AAGATCGGCC TCGCGGGGGT GGAGGGGCTG CTGACCACCG CGGTCGGGCT CGACCCGGAC
CAGGTGCGCG CCTGCCTGGA GCTCACGGGC ATTCGGGGCT GCGATCCCGG CGTCGTCGAG
GAGGTACGCC GGCTCGGGGT GAAATCGGAC CTGCTCTCCG AGGGGCTCGA CGAGCTCGCC
GCGGTTCTCG GCGATCTCGC CGACCTGCCC GCCGGCGACG TGGTCGCGGA CCTCTCGATC
GCCCGCGGTC TCGACTACTA CACCGGGACC GTCTACGAGG CGAAGTTCGT CGACTGGCCG
GACTTCGGCA GCATCTGCTC GGGGGGGCGG TACGACAACC TTGCCGGCTC CTTCATCCGC
CGCAACCTCC CCGGCGTCGG GATCTCGATC GGCCTCACCC GCATCTTCGC CAAGCTCCTG
GCCGAGGGCC TGCTCACCAC CGGGCCGTCC AGCCCCGCCG ACGTGCTGGT CGTGATCCCC
GCCGCGCCGC GCCGCGCCGC CGCCCTCGCG ACGGCCGCCG AGCTGCGCAC CCGGGGGCTG
CGGGTGGAGA CCTACCACCA GCCGGACAAG GTGGCCAGGC AGGTCCGCTA CGCCTCCCGC
AAGGGCATCG GATTCGTCTG GTTCCCGCCC TTCGACGATG GCCGGGCGCA CGAGGTGAAG
AACATGGCCA CCGGGGACCA GTCCGCGGCG GACCCGGCGA CCTGGACCCC GTCCGCGGGC
TGA
 
Protein sequence
MSDAPIVRPL PVSGFPEWLP EVRLVEQRWL DTIRATFERY GFCSVETPSV EALEVLTAKG 
ETSQEVYMLR RLQADADDDS ARLGLHFDLT VPFARYVAAH FNDLVFPFKR YQIQRVWRGE
RPQEGRFREF TQCDIDVINV DQVPLHFDAE LPRIVHEVLG TLGVPPWTLN INNRKVLQGF
YEGLGIGDPL AVIRVADKLD KIGLAGVEGL LTTAVGLDPD QVRACLELTG IRGCDPGVVE
EVRRLGVKSD LLSEGLDELA AVLGDLADLP AGDVVADLSI ARGLDYYTGT VYEAKFVDWP
DFGSICSGGR YDNLAGSFIR RNLPGVGISI GLTRIFAKLL AEGLLTTGPS SPADVLVVIP
AAPRRAAALA TAAELRTRGL RVETYHQPDK VARQVRYASR KGIGFVWFPP FDDGRAHEVK
NMATGDQSAA DPATWTPSAG