Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0366 |
Symbol | cysS |
ID | 5668790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 434305 |
End bp | 435825 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641239298 |
Product | cysteinyl-tRNA synthetase |
Protein accession | YP_001504738 |
Protein GI | 158312230 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0215] Cysteinyl-tRNA synthetase |
TIGRFAM ID | [TIGR00435] cysteinyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0158714 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00296969 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGTCTTC ACCTCTATGA CACGCGCACC CGGCGCGTGC GGCCGTTTGT GCCGCTGCGC CCCGGTCATG TGGGCGTGTA CGTGTGTGGG CCCACCGTCC AGTCTCCGCC GCATGTGGGC CACATCCGTA CCGCGCTCGC GTTCGACCTG CTGCGGCGCT GGCTGACGCA GAGCGGCCTG TCGGTGACGT TCGTCCAGAA CGTCACCGAC ATCGACGACA AGATCATCGT CAACGCCGAC CGGGCGGGGA CGAGCGTGTG GGAGCTCGCC ACCAGGCAGA CCCGGGCGTT CGAGGACGCC TACCGGGCCG TCGGGATCCT GCCCCCGACG ATCTCCCCGC GGGCCACCGG GCACATCCCC GAGATGCTCG AGATGATCGC CGTCCTGGTC GAACGGGGGT TCGCCTATCC CGGCGCGGGC TCGGTGTGGT TCCGCGTCGG CGCGTTCGCG GACTATGGTG CGCTCTCCCA TCAGCGGCCC GCCGCGATGC ACCCGTCCCC GGAGTCCGAG CCGGGCAAGG CCGACCCCAG GGACTTCGCC CTGTGGAAGG CCGTGAAGCC CGGTGAGCCG TCCTGGTCCT CTCCCTGGGG CCCTGGCAGG CCGGGCTGGC ACCTGGAGTG CTCAGCGATG GTGGGCAAGT ACCTCGGCGA CGTGTTCGAC ATCCACGGCG GCGGGCTCGA TCTGGTGTTC CCGCATCACG AGAACGAGCG GGCGCAGTCC GTGTGCGCGG CGGAGCTGGC CGCCGCGGGC GCCGCGCCCG CGGCGGCCAG CTCCCGGGGT GCCGTGGCGG CTGGCGCCGA CGTGCCGGGT GCCGGCGGGG CGGGCGAGAT GGCGCGCTAC TGGATGCACG TGGGTCTGCT CACCACCGGT GGCACCAAGA TGTCGAAGTC GCTGGGCAAC TCGTTCTTCG TGACCGACGC GCTCGCCGCG GTACGCCCCC AGGTGCTGCG CTACCACCTG CTCTCCGCGC ACTACCGCTC CTCGCTGGAG TACAGCGCGC AGACCCTGGA AGAGTCCGCG GCGGCGCATG ACCGGATCGA GACGTTCGTC CGCAACGCCC TGGACATCCT GGGCGGCCCG GCGGAGGCCG CGGCGCTTGC CGCCGAGGCC GATCGCGTGG CGGGCACCGA GGCTGCGGAC GCGGACGGGG CCTGGTCCGA GTTCGCCGCG GCGCTGGACG ACGACCTCGG CGTCGGCCGC GCGCTGGCCG CCCTGTTCGG CGTGGTCGGC CGGGGCAACC AGGTGCTGTC GAAGACCCAC AGCCGCGAGC TGGCTGGCTG GGTGGACGTC GCCCGCCGGA TGCTGACCGT CCTCGGGCTC GACCCGGTCG AGCAGTGGCC GACGGCGGGC GCCGAGCTGC GTCCCGCGCT CGACGGCGTC ATGGACGTCC TGCTGGACCT GCGCTCGGCG GCCAGGGCCC GGCGGGACTA CAGCGAGGCC GACTCCATCC GCTCCCGGCT CGCGGCGGCC GGTGTGGTCA TCGAGGACAC GCCCGAAGGG CAGCGCTGGC ACCTCACGTA G
|
Protein sequence | MGLHLYDTRT RRVRPFVPLR PGHVGVYVCG PTVQSPPHVG HIRTALAFDL LRRWLTQSGL SVTFVQNVTD IDDKIIVNAD RAGTSVWELA TRQTRAFEDA YRAVGILPPT ISPRATGHIP EMLEMIAVLV ERGFAYPGAG SVWFRVGAFA DYGALSHQRP AAMHPSPESE PGKADPRDFA LWKAVKPGEP SWSSPWGPGR PGWHLECSAM VGKYLGDVFD IHGGGLDLVF PHHENERAQS VCAAELAAAG AAPAAASSRG AVAAGADVPG AGGAGEMARY WMHVGLLTTG GTKMSKSLGN SFFVTDALAA VRPQVLRYHL LSAHYRSSLE YSAQTLEESA AAHDRIETFV RNALDILGGP AEAAALAAEA DRVAGTEAAD ADGAWSEFAA ALDDDLGVGR ALAALFGVVG RGNQVLSKTH SRELAGWVDV ARRMLTVLGL DPVEQWPTAG AELRPALDGV MDVLLDLRSA ARARRDYSEA DSIRSRLAAA GVVIEDTPEG QRWHLT
|
| |