Gene Francci3_1867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1867 
Symbol 
ID3906142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2201881 
End bp2202993 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content65% 
IMG OID637879205 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_480972 
Protein GI86740572 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.705084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.228313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACACTG ATTCCGCGTC CATAGGCCTA GATACCACAG ACTCGGACCC CGCTACCTCG 
GCGCTCAGAA CCTCCGCCGC CCAGGCGCGC AGCGCGGAAT TGGAAGAACT GATTCTCAGC
AACCCCGAGC GGTTTCGGGT ACTGACGGGT GACCGTCCTA CCGGGCGCCT ACATCTCGGG
CACTACTTCG GCACGTTGCA CAATCGGGTT CGCCTTCAGG ATCTCGGGAC GGAGATCTTC
CTGATTATTG CTGACTACCA GGTTCTGACC GATCGCGACG TAGCGGACAA CCTGACCGCC
CACGTGGAGG AACTGGTCCT GGACTACCTG GCCATCGGCA TAGACCCGGC ACGCAGCACG
ATCTTCACAC ACAGTGCCGT CCCCGCCCTC AACCAGCTGA TGCTGCCCTT TCTAAGCCTT
GTCTCCGTTG CCGAGCTGAA CCGCAATCCC ACCGTCAAGG AGGAGATCGC GCATTCCCGG
CAGTCGGCCG TCAGTGGCCT GATGTACACC TACCCCGTCC ACCAGGCCGC CGACATTCTC
TTCTGCAAGG GAAACCTGGT CCCAGTGGGC CAGGACCAGC TTCCCCACCT CGAACTCGCC
CGCACGATCG CCCGCCGCTT CAACGACCGC TACGGCGACG GCACCAGACT GTTTCCAGAG
CCCGAGGCGC TCCTGTCGAG CGCGCCCCTT CTCCTCGGCA CGGATGGCTC CAAGATGAGC
AAGAGCCGGC GTAACGCTGT GGCCCTGGCT GCGACCGCCG ACGAGACCGC CCGGCTGCTC
AAGGGAGCGA AGACCGACTC CGAGCGCCAC ATCACCTACG ATCCCGCGAA CCGTCCCGAG
GTGTCCTCCC TCCTCCTGCT CGCTTCGCTC TGCCAGAACC GGCACCCTCA TCAGGTCGCC
GACGACATCG GCTCCGCCGG GGCCGCCGCA CTTAAGAAGA TCGTGATCGA AGCGGTCAAC
GACTACCTGG CACCGATCCG GGCTCGCCGA GCCGACTACG CCGAGGACCG CTCCCATCTG
CGCCGTGTGC TCCGCGAGGG CAACGAACGA GCGGGAGCCG TCGCCGACGC AACCCTCGCC
GAGGTGCGTA CCGCCATGAA CAGCCACTAC TGA
 
Protein sequence
MNTDSASIGL DTTDSDPATS ALRTSAAQAR SAELEELILS NPERFRVLTG DRPTGRLHLG 
HYFGTLHNRV RLQDLGTEIF LIIADYQVLT DRDVADNLTA HVEELVLDYL AIGIDPARST
IFTHSAVPAL NQLMLPFLSL VSVAELNRNP TVKEEIAHSR QSAVSGLMYT YPVHQAADIL
FCKGNLVPVG QDQLPHLELA RTIARRFNDR YGDGTRLFPE PEALLSSAPL LLGTDGSKMS
KSRRNAVALA ATADETARLL KGAKTDSERH ITYDPANRPE VSSLLLLASL CQNRHPHQVA
DDIGSAGAAA LKKIVIEAVN DYLAPIRARR ADYAEDRSHL RRVLREGNER AGAVADATLA
EVRTAMNSHY