Gene Francci3_3946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3946 
Symbol 
ID3906905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4724119 
End bp4725405 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content69% 
IMG OID637881273 
Producttyrosyl-tRNA synthetase 
Protein accessionYP_483025 
Protein GI86742625 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.769187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACACG CGCTCCTGGA CGAGCTGTCC TGGCGTGGAC TCCTCCACGA CAGCACCGAT 
CCCGCCGAGC TGCGGGAGCA CCTCGACAGC GGACGGCGGC GCTGCTACAT CGGCTTCGAC
CCGACGGCCC CCTCGCTGAC GATCGGCAAC CTGCTGCCGA TGACCCTGCT GATCCGGGCC
GCACGGGCGG GGATCGGCGC GGTCACCCTG TTCGGCGGTG GCACGGGTCT GATCGGCGAC
CCGTCGGGCA AGTCGGTGGA ACGTAGCCTG CTCACCGCCC AGGAGGTACG GACGAACATG
CTCGGACAAC AGGAGATCAT GGAGTCCGTC TTCGCCCGGG CCCTGGACTC CGGCCAGCTA
CCACAGTTCG TCGACAACGC CGACTGGCTC GACGGACTCG GAATGATCGA GTTTTTGCGG
GACGTCGGCA AGCACTTCCC CGTCACCGAG ATGGTCCGCC GTGACTCGGT GCGCCGGCGG
CTGGACGATC CCGACGTCGG CCTGACCTAC ACCGAGTTCA GCTACTCCCT GCTCCAGGCG
TACGACTTCC GCCGCCTGTG CGAGGACCAC GGGGTCACCC TTCAGATGGG CGCCTCCGAC
CAGTGGGGCA ACATCGTCGC CGGGATCGAC TACGTGCGAC GGGTGCTGCG CACCCAGGTA
CACGGGCTGA CCTGCCCGCT GCTGCTGCGT TCGGACGGCA CGAAGTTCGG CAAGTCGGAG
AAGGGTGCGG TCTGGCTCTC GGCCGACCGT ACCTCGCCCT ACACCCTGTA CCAGTTCGTG
ATCAATCTGT CCGACGACGA GGCACGCCGG TTCGCCCTGT TCTTCTCGCT CATCGACCGT
GAGCCGCTGG AGCGGCTGTT CGCCGAGCAC GCCGAGGCTC CGGGTAAGCG GGCGTTGCAG
CGTCACCTGG CGCGGGAGAT CACCGCACTG GTGCACGGTC AGGCCGCGGT GGACGCGGCC
GAGGCGGCGT CGGCGGCGTT GTTCAGCGGG GACGTGAAGG CGATCGGCGC GGATCTGCTC
TCCGACGTCT TCGCCGACGT GCCCACGGTC GAGGAGCCGG CCGCCCGGCT CGAGGATGAC
GGCTGGCCGG TTGTCGATCT GCTGATCGCC ACCGGCCTCG CCAGCAGCAA GCGGGATGCG
CGCGAGCACC TGGGCAACCA TGCGGTCCTG GTGAACGGGG AACGGGTCGG CGTGGAAGCC
ACCGTCGGGA CCAAGGATCT GCTGCACGGC TCGGTCATCC TGGTCCGCCG CGGTCGCCGG
GAATGGCGGG TGGCCCGCTT CACCTGA
 
Protein sequence
MTHALLDELS WRGLLHDSTD PAELREHLDS GRRRCYIGFD PTAPSLTIGN LLPMTLLIRA 
ARAGIGAVTL FGGGTGLIGD PSGKSVERSL LTAQEVRTNM LGQQEIMESV FARALDSGQL
PQFVDNADWL DGLGMIEFLR DVGKHFPVTE MVRRDSVRRR LDDPDVGLTY TEFSYSLLQA
YDFRRLCEDH GVTLQMGASD QWGNIVAGID YVRRVLRTQV HGLTCPLLLR SDGTKFGKSE
KGAVWLSADR TSPYTLYQFV INLSDDEARR FALFFSLIDR EPLERLFAEH AEAPGKRALQ
RHLAREITAL VHGQAAVDAA EAASAALFSG DVKAIGADLL SDVFADVPTV EEPAARLEDD
GWPVVDLLIA TGLASSKRDA REHLGNHAVL VNGERVGVEA TVGTKDLLHG SVILVRRGRR
EWRVARFT