Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4892 |
Symbol | |
ID | 5673232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5870770 |
End bp | 5872104 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243747 |
Product | cysteinyl-tRNA synthetase |
Protein accession | YP_001509163 |
Protein GI | 158316655 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0215] Cysteinyl-tRNA synthetase |
TIGRFAM ID | [TIGR00435] cysteinyl-tRNA synthetase [TIGR03447] cysteine--1-D-myo-inosityl 2-amino-2-deoxy-alpha-D-glucopyranoside ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00699487 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0480407 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCTGCT CGTCGCCGGC CTGCCCGGCT TGTCCCGCGG CCGGCCCAGC CCCGGTTACC CTGACACGCA TGCAGGCGTG GCCCTTTCCC CAGCTCCCCA AGCTTCCCGG ACAGGGTCGC GACCTCCGCG TTCTCGACAC GGCGCACGGC GGAGTGCGGA CTCTGGACCT CGGGCCGACG GTCCGGCTCT ACGCCTGCGG GATCACCCCG TATGACGCCA CCCACCTCGG GCACGCCTTC ACGTACCTCA CCTACGACCT CGTCCAGCGC GTGCTGCGGG ACGCCGGCCA CGAGGTGCGG TACGTGCAGA ACGTCACCGA CGTCGACGAT CCGCTGCTCG AGCGGGCCAC CCGCGACGGC ATCGACTGGC GCGACCTCGC CCGCCGCGAG ATCGACCTGT TCCGGGCCGA CATGACGGCG CTGCGCATCC TGCCCCCCGA CCACTACGTC GGCGTCGTCG AGGCCGTCGG GCTCATCGTC GACATGGTCT CCCAGCTCGT CGAGCGCGGC GCCGCGTACT CCGTCGACGG CGATCTCTAC TTCTCCGTCG CCGCGGCGCC GGACTTCGGC CAGGTGGCCC ACCTCGACCC CGCGCAGATG CTGGTGTCCT GCGCCGAGCA CGGCGGTGAT CCCGGCCGGC CGGGCAAGAA GGACCCGCTC GACCCGCTGC TGTGGCGCGC GGAGCGGCCG GGGGAGCCGT CCTGGCCGTC GCCGTTCGGC CCGGGGCGGC CGGGCTGGCA CGTGGAGTGC TCCGCGATCG CCCGGCACTA CCTCGGCGCC ACGATCGACA TCCAGGGCGG CGGGAGCGAT CTCGCCTTCC CACACCACGA GTGCAGCGCC GCGCACGCCG AGGTCGCGAA CGGCGCCCGG CCGTTCGCCC GCGCCTACGT CCACACCGCG CTGGTCAGCC TGGACGGCCA CAAGATGTCG AAGTCGCGGG GGAACCTCGA GTTCGTCTCG AGGCTGCTCG CGCGCGGAGC GGACCCGGCC GCGATCAGGC TCGCCCTGTT GCAGCATCAT CACACCGTGG AGTGGGAGTG GACCGCGGCC GCCATGCCGG CGGCAGCCGA GCGGCTCGAC CGGTGGCGCG CCGCCGTCGC GCTGCCGTCC GGCCCAGACT TCCGGCCGGT GCTGGCCGAG GTCCGGGACC GGCTCGCCGA CGACCTCGAC GCGCCTGGAG CGCTCGCCGC GGTCGACGCG TGGGCGGCGG CCGCACTGGC CGCTGGCAGC GGCGGCAGCG GCGAGGCGGA CGACCAGGCG CCGGCCACCG TCCGCGACAC CGTCGACTCA CTGCTCGGTG TCGACCTTGG GCCTGTCCTA CCGAGAGGAA CCTAG
|
Protein sequence | MPCSSPACPA CPAAGPAPVT LTRMQAWPFP QLPKLPGQGR DLRVLDTAHG GVRTLDLGPT VRLYACGITP YDATHLGHAF TYLTYDLVQR VLRDAGHEVR YVQNVTDVDD PLLERATRDG IDWRDLARRE IDLFRADMTA LRILPPDHYV GVVEAVGLIV DMVSQLVERG AAYSVDGDLY FSVAAAPDFG QVAHLDPAQM LVSCAEHGGD PGRPGKKDPL DPLLWRAERP GEPSWPSPFG PGRPGWHVEC SAIARHYLGA TIDIQGGGSD LAFPHHECSA AHAEVANGAR PFARAYVHTA LVSLDGHKMS KSRGNLEFVS RLLARGADPA AIRLALLQHH HTVEWEWTAA AMPAAAERLD RWRAAVALPS GPDFRPVLAE VRDRLADDLD APGALAAVDA WAAAALAAGS GGSGEADDQA PATVRDTVDS LLGVDLGPVL PRGT
|
| |