Gene Francci3_3963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3963 
Symbol 
ID3906923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4743191 
End bp4744216 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content66% 
IMG OID637881291 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_483042 
Protein GI86742642 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGC AACAGCCGAA GGTGTCTCTC ACCGGGATCA AGCCGACGGG CGAGCCGCAC 
CTCGGCAACT ACATCGGAGC GATCCGGCCG TCGCTGGAGC TCACCTCGGC CTACGAGTCG
ATCTACTTCG TCGCCGACTA TCACGCCCTG ACATCCGTGC GTAGTCGCGA GGAGTTGGCG
AGCTATACCC GCTCGGTGGC GGCCACGTGG CTCGTGCTCG GGCTCGACCC GACGAGCACG
ATCTTCTACC GCCAGTCCGA CGTCCCCGAG ATCTTCGAGC TGACCTGGGT GCTGTCCTGC
GTCACCGGCA AGGGTCTGAT GAACCGGGCC CACGCCTACA AGGCGGCCCG TGACCGCAAC
GCCGAGGCCG GCGTCCCCGA TCTGGACGCC GGCATCAACA TGGGCCTGTT CAACTATCCG
ATTCTGATGG CCGTCGACAT TCTCATCATG AATGCCGACG TGGTCCCCGT TGGCCAGGAC
CAGTCGCAGC ATGTGGAATA CGCCGCCGAC ATCGCCGGCT CCTTCAACCA TCTGTTCGGC
GACGTGTTCA GCCTGAAGAT CCCGGAGGTG GTCCTCCCGG CCGGGAGCTC GGCGAAGGTA
CTGCCCGGCA TCGACGGCCG CAAGATGAGC AAGTCATACC GGAACACAAT CCCCCTCTTC
GCGCCGGAGA AGCAGCTACG GAAACTCGTC CGTGGGATCG TGAGCGACAG CACACCGCTG
GCGGACCCGA AGGACCCGGA CAGTTCCGCG GCCTTCGTTC TGCTCGAGAA CTTCGCGACT
CCAGAAACGA TCAAGGAGAT GCGCGGCCGT CTGGAGCAGG GCGGCACCGG GTGGGGCGAG
GTGAAGAACG CCCTGTTCGA GACGCTCAAC GACTGGCTGT CACCGCTGCG GGAGCGCTAC
ACCGAGCTCA TCGCCCCGGA CAGCGAGCTG GACGGCATCC TCGCGGCGGG TGCCGAGCGG
GCCCGCGACC GGGCCCGGCC GGTGCTCGCC GGCGTCCGGC GCGCGATCGG GATCTCGCAC
CTCTGA
 
Protein sequence
MAAQQPKVSL TGIKPTGEPH LGNYIGAIRP SLELTSAYES IYFVADYHAL TSVRSREELA 
SYTRSVAATW LVLGLDPTST IFYRQSDVPE IFELTWVLSC VTGKGLMNRA HAYKAARDRN
AEAGVPDLDA GINMGLFNYP ILMAVDILIM NADVVPVGQD QSQHVEYAAD IAGSFNHLFG
DVFSLKIPEV VLPAGSSAKV LPGIDGRKMS KSYRNTIPLF APEKQLRKLV RGIVSDSTPL
ADPKDPDSSA AFVLLENFAT PETIKEMRGR LEQGGTGWGE VKNALFETLN DWLSPLRERY
TELIAPDSEL DGILAAGAER ARDRARPVLA GVRRAIGISH L