Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1694 |
Symbol | alaS |
ID | 5670096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2025168 |
End bp | 2027846 |
Gene Length | 2679 bp |
Protein Length | 892 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641240612 |
Product | alanyl-tRNA synthetase |
Protein accession | YP_001506038 |
Protein GI | 158313530 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0013] Alanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00344] alanine--tRNA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0569274 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.130258 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCTTATC CCGACGACCC TGGACGACGT GAACCGATGG ACACGGCCGA GATCCGCCGC CGCTTTCTGA ACCATTTCTC CGAGCGGGGT CACACCGTGG TCCCGAGTGC GTCCCTGGTG GCGCAGGACC CGACCCTGCT GCTGGTGAAC GCCGGCATGG TTCCCTTCAA GCCCTACTTC CTCGGGGACC TGAAGGCGCC GTGGAACCGT GCGACCAGCG TGCAGAAATG CGTGCGGACC GTGGACATCG ACAACGTCGG CCGCACCGCC CGACACGCCT CCTTCTTCCA GATGTGCGGG AACTTCTCCT TCGGTGACTA CTTCAAGGCC GAGGCGATCC CGTTCGCCTT CGAGCTGATC GTCGACGGCT ACGGCTTCAA CCCCGACGAC CTGTGGGCCA CCGTCTACCT GGACGACGAC GAGGCCGAGG CGATTTGGCG CACCCTGCTG CCCGCCGAGC GCATCCAGCG GCGCGGCAAG AAGGACAACT TCTGGTCGAT GGGTGTGCCC GGCCCGTGCG GCCCGTGCAG CGAGATCTAC TTCGACCGCG GCCCGGCGTA CGGGCGCGAG GGCGGCCCGG AGGCGGACGA GGACCGCTAC CTGGAGATCT GGAACCTCGT CTTCATGCAG TTCGCGCGGG GCGAGGGCAG CGAGTACGGC TACGAGATCG TCGGTGATCT GCCCGCCCGC AACATCGACA CCGGGATGGG CCTGGAGCGG ATGGCCACCA TCCTGCAGGG TGTCGAGAAC CTGTACGAGA TCGACATCTC CCGTCCGGTG CTCGACGCGG CCGGCCGGCT CACCGGCACC CGCTACGGCG CCGACCCGGA TTCCGACGTC CGGCTGCGGG TCGTCGCCGA CCACACCCGG ACCGCGGCGA TGCTGATCTC GGACGGGGTC TCGCCGTCCA ACGAGGGCCG CGGGTACGTC CTGCGGCGGA TGCTGCGCCG GGCGGTGCGT GACGCCCGGC TGCTGGGCGC CCGTGAGCCG GTCATGGACG AGCTGTTCGG CGTGGTCCGC GCGGCGATGG GCCCGATCTA CCCGGAGCTC GTCGACCAGG CCGAGGCGAT CACGGCGGTC GCGGTCGCCG AGGAGACGGC TTTCCTGGAG ACGCTGCGCA CGGGCACCAC CCTCTTCGAC ACCGCGGTCA CCCAGGCCCG GTCCAGCGGG TCGTCCCAGC TCAGCGGCGA GTCGGCGTTC CGGCTGCACG ACACCTACGG GTTCCCGATC GACCTGACCA TGGACATGGC GGCCGACGCG GGCCTGACCG TGGACGAGGC CGGCTTCCGC CGGCTGATGG AGCGCCAGCG CCAGGCGGCG AAGGCCGACC GGGCGTCCCG CCGCATCGGC AACCTGGACC TCTCCGCCTT CCGGCCGATC CTCGCCGCCT CCGGCCCGAC GACGTTCACC GGCTACACCG AGCTCGGGCG CGAGTCGGGC ATCGTCGGCA TCGTCGGCAT CGGCGACGGC GACAGCCTGA CCGCGGCCGG CGAGGGGGAG GAGGTCGGCA TCCTGCTCGA CGCGACCCCC TTCTACGCCG AGAGCGGTGG CCAGGAGGCC GACCTGGGCC GGATCCGGTT CGACGGCGGC GAGGCCGAGG TGCTCGACGT CCAGCGCCCG GTGCCCGACC TGGTCATGCA CCGGGTGAAG GTGCTCGGTG GCGAGCTGCG TGTCGGCGCG GACGTGTTCG CCGAGGTGGA CGTCGAGCGC CGGCGCGCGG TGTCGCGCTC GCACACCGCC ACCCACCTCG TGCACACCGC GTTCCGCCGG GCGCTCGGGG AGTCGGCGAC GCAGGCCGGG TCGCTGAACT CGCCGGGCCG GCTGCGCTTC GACTTCCACG CGCTCGGCGC GGTGCCCGAC TCCGTCCTCG CCGACGTCGA GGACGAGGTC AACGAGATCG CCCTGCGTGA TCTGGAGGTC CGCTGGTACG TCACCTCTCA GGAGGAGGCG CGCCGGCTGG GCGCGATGGC GCTGTTCGGC GAGAAGTACG GCGACCGGGT CCGTGTCGTG GACGTCGGGG ACTACGCCCG CGAGCTGTGC GGTGGTACCC ATGTGGCCAG CTCGGCCCAG CTTGGCGCGA TCAAGCTGCT GTCCGAGTCG TCGATCTCGG CCGGGACGCG CCGGGTGGAG GCGCTGGTCG GCATGGATGC CTTCCGGTTC CTGGCCCGCG AGCACGTGCT CGTCTCGCAG CTCTCCAGCA CGCTCAAGGC CCGTCCGGAC GAGCTCGCCG ACCGGGTCGC CGACATCGTC GGGCGGCTGC GAGACGCGGA GAAGGAGCTG GAGCGGCTGC GGGCACAGGC GGTGCTGGCC GGCTCGGCGG CGCTCGCCGC CGGCGCCGAG GACGTGGGGG GCGTGGCGCT GGTCACCGCG CAGGTGCCCG CGGGCACTCC GGCAGACGAC GTCCGCCTGC TCGCCCTGGA TGTGCGCGGC CGGCTCGCCG GCCGGCCGGC GGTGGTCGCG GTCGTCGAGG CCGCCGGCGC GGCAGTCGTC GTGGCGACCG ACGAGACCGC GCGGACCCGC GGCCTGCGGG CCGGCGACCT GGTCCGGCAC TCCTGGGCCG CGCTCGGAGG CAAGGGCGGC GGCAAGCCTG ACGTCGCCCA GGGCGGACGC GGTGACGCGG ACATGATCCC GAAGGTCTTC GCCCGGCTGC GCGAGCTGGT CGCCGACCAG AGCGCGTGA
|
Protein sequence | MPYPDDPGRR EPMDTAEIRR RFLNHFSERG HTVVPSASLV AQDPTLLLVN AGMVPFKPYF LGDLKAPWNR ATSVQKCVRT VDIDNVGRTA RHASFFQMCG NFSFGDYFKA EAIPFAFELI VDGYGFNPDD LWATVYLDDD EAEAIWRTLL PAERIQRRGK KDNFWSMGVP GPCGPCSEIY FDRGPAYGRE GGPEADEDRY LEIWNLVFMQ FARGEGSEYG YEIVGDLPAR NIDTGMGLER MATILQGVEN LYEIDISRPV LDAAGRLTGT RYGADPDSDV RLRVVADHTR TAAMLISDGV SPSNEGRGYV LRRMLRRAVR DARLLGAREP VMDELFGVVR AAMGPIYPEL VDQAEAITAV AVAEETAFLE TLRTGTTLFD TAVTQARSSG SSQLSGESAF RLHDTYGFPI DLTMDMAADA GLTVDEAGFR RLMERQRQAA KADRASRRIG NLDLSAFRPI LAASGPTTFT GYTELGRESG IVGIVGIGDG DSLTAAGEGE EVGILLDATP FYAESGGQEA DLGRIRFDGG EAEVLDVQRP VPDLVMHRVK VLGGELRVGA DVFAEVDVER RRAVSRSHTA THLVHTAFRR ALGESATQAG SLNSPGRLRF DFHALGAVPD SVLADVEDEV NEIALRDLEV RWYVTSQEEA RRLGAMALFG EKYGDRVRVV DVGDYARELC GGTHVASSAQ LGAIKLLSES SISAGTRRVE ALVGMDAFRF LAREHVLVSQ LSSTLKARPD ELADRVADIV GRLRDAEKEL ERLRAQAVLA GSAALAAGAE DVGGVALVTA QVPAGTPADD VRLLALDVRG RLAGRPAVVA VVEAAGAAVV VATDETARTR GLRAGDLVRH SWAALGGKGG GKPDVAQGGR GDADMIPKVF ARLRELVADQ SA
|
| |