Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1957 |
Symbol | alaS |
ID | 6980696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2001344 |
End bp | 2003998 |
Gene Length | 2655 bp |
Protein Length | 884 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643396680 |
Product | alanyl-tRNA synthetase |
Protein accession | YP_002281468 |
Protein GI | 209549551 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0013] Alanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00344] alanine--tRNA ligase [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.103878 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.2106 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGTG TGAACGATAT CCGGTCGACA TTCCTCGACT ATTTCAAGAA GAACGGTCAT GAGATCGTCT CGTCGAGCCC GCTCGTGCCG CGCAACGACC CGACGCTGAT GTTCACCAAT GCCGGTATGG TGCAGTTCAA GAACGTCTTC ACCGGCCTGG AGAAACGTCC TTATTCGACG GCGACGACTT CGCAGAAATG CGTGCGCGCC GGCGGCAAGC ATAATGACCT CGACAATGTG GGCTATACCG CCCGCCACCT GACCTTCTTC GAGATGCTCG GCAACTTCTC GTTCGGCGAC TATTTCAAGG AAAATGCGAT CGAGCTTGCC TGGAAGCTCG TCACCGAAGG TTTCGACCTG CCGAAACATC GCCTGCTGGT CACCGTCTAT TCCGAAGATG AGGAAGCAGC GACACTCTGG AAGAAGATCG CCGGCTTCTC CGACGACAAG ATCATCCGCA TCCCGACCTC CGACAATTTC TGGCAGATGG GCGATACCGG CCCCTGCGGT CCATGCTCTG AAATCTTCAT CGACCAGGGT GAGAACGTCT GGGGCGGCCC TCCCGGTTCG CCTGAGGAAG ATGGCGACCG GTTCCTGGAA TTCTGGAACC TCGTCTTCAT GCAGTTCGAG CAAACGGAGC CTGGCGTTCG CAACCCGTTG CCGCGTCCGT CGATCGATAC CGGCATGGGC CTGGAGCGCA TGGCGTGCAT TCTGCAAGGC GTTCAGAGCG TTTTCGACAC AGACCTGTTC CGCACACTGA TCGGCGCCAT CGAAGAGACG ATGGGCGCCA AGGCCAAGGG CAGCGCCAGC CACCGCGTCA TCGCCGACCA TCTGCGCTCT TCGGCCTTCC TGATCGCCGA CGGCGTGCTG CCGTCGAATG AAGGCCGCGG CTATGTGCTG CGCCGTATCA TGCGCCGCGC CATGCGGCAC GCCCAGCTTC TCGGCGCCAA GGAGCCGCTG ATGTACAAGC TGCTGCCGAC GCTGGTGCAG GAGATGGGCC GCGCTTATCC GGAACTCGTG CGCGCCGAAG CACTGATCTC GGAGACGCTG AAACTCGAAG AAGGCCGCTT CCGCAAGACG CTGGAGCGTG GCCTGTCGCT GCTGTCGGAT GCGACCACCG ATCTCGGCAA GGGCGACATG CTGGATGGCG AGACCGCCTT CAAGCTCTAC GACACCTATG GCTTCCCGCT CGACCTGACG CAGGACGCGC TGCGCGCCCG TGAGATCGGC GTCGATATCT CCGGCTTCAC CGATGCCATG CAGCGCCAGA AGGCCGAAGC TCGTTCGCAC TGGGCCGGTT CCGGCGAGAA GGCAACCGAA ACCATCTGGT TCGAGCTTAG AGAAAAGCAC GGCGCAACCG AATTCCTGGG TTACGACACC GAGACCGCTG AGGGCGTCGT GCAGGCGATC GTCAGGGAAG GCGCCGCTGC CGAAGAGGTC AAGGCCGGCG ACAAGGTGCA GATCGTCGTC AACCAGACGC CGTTCTACGG CGAGTCCGGC GGCCAGATGG GCGATACCGG CGTCATCTCG TCCGACCACG GCAGAATAGA GATCTCGGAC ACGCAGAAGA AGGGTGAAGG CCTCTTCGTG CATTCCGGCG TCGTTGTCGA AGGTGCATTC AAGGCTGGCG ATGCGGTTGT CCTGACCGTC GATCACGCCC GCCGTTCGCG CCTGCGCGCC AACCACTCGG CAACACACCT GCTGCATGAA GCGCTGCGCG AGGTGCTCGG CACCCATGTT GCCCAGAAGG GTTCTCTGGT TGCGCCCGAA CGGCTGCGTT TCGACGTGTC TCACCCGAAG CCGATGTCGG CCGAGGAGTT GAAGATCGTC GAAGATATGG CCAACGAGAT CGTGCTGCAG AATTCGGCCG TCACCACCCG CCTGATGAGC GTCGACGATG CCATCGAGGA AGGCGCGATG GCACTCTTCG GCGAGAAGTA CGGCGATGAA GTGCGTGTCG TCTCGATGGG CACCGGCGTT CATGGGGCAA AGAGCAACAA GCCTTACTCG GTCGAGCTTT GTGGCGGCAC GCATGTGTCG GCCACCGGTC AGATCGGCCT GATCCGCGTT CTCGGCGAAA GTGCTGTCGG CGCCGGGGTT CGCCGCATCG AAGCGGTGAC CGGTGAATCG GCGCGCGAAT ATCTCGCCGA GCAGGATGAT CGGGTGAAGA CGCTGGCTGC CTCGCTGAAG GTCCAGCCTT CGGAGGTTCT ATCGCGTGTC GAAGCCTTGA TGGACGAGCG CCGCAAGCTG GAAAAGGAAC TGGCCGATGC CAAGCGCAAA CTCGCCATGG GCGGCGGGCA GGGCGGCTCG GCCGATGCCG TGCGCGAAGT CGCCGGCGTC AAGTTCCTCG GCAAGTCGAT ATCGGGCGTC GATCCCAAGG ACCTCAAAGG GCTTGCCGAT GACGGCAAGA CCAGCATCGG CTCCGGCGTC GTCACGCTGA TCGGCGTGTC CGACGATGGC AAGGCGAGCG CCGTCGTCGC GGTGACGCCG GATCTCGTCG ATCGTTTCAG TGCTGTCGAT CTGGTGCGCG TCGCCTCGGC CGCCCTCGGC GGCAAGGGCG GCGGCGGCCG CCCCGACATG GCCCAGGCCG GCGGCCCGGA TGGCGCCAAG GCTGATGAGG CGCTTGAGGC TGTTGCAGCA GCACTCGCCG GCTGA
|
Protein sequence | MSGVNDIRST FLDYFKKNGH EIVSSSPLVP RNDPTLMFTN AGMVQFKNVF TGLEKRPYST ATTSQKCVRA GGKHNDLDNV GYTARHLTFF EMLGNFSFGD YFKENAIELA WKLVTEGFDL PKHRLLVTVY SEDEEAATLW KKIAGFSDDK IIRIPTSDNF WQMGDTGPCG PCSEIFIDQG ENVWGGPPGS PEEDGDRFLE FWNLVFMQFE QTEPGVRNPL PRPSIDTGMG LERMACILQG VQSVFDTDLF RTLIGAIEET MGAKAKGSAS HRVIADHLRS SAFLIADGVL PSNEGRGYVL RRIMRRAMRH AQLLGAKEPL MYKLLPTLVQ EMGRAYPELV RAEALISETL KLEEGRFRKT LERGLSLLSD ATTDLGKGDM LDGETAFKLY DTYGFPLDLT QDALRAREIG VDISGFTDAM QRQKAEARSH WAGSGEKATE TIWFELREKH GATEFLGYDT ETAEGVVQAI VREGAAAEEV KAGDKVQIVV NQTPFYGESG GQMGDTGVIS SDHGRIEISD TQKKGEGLFV HSGVVVEGAF KAGDAVVLTV DHARRSRLRA NHSATHLLHE ALREVLGTHV AQKGSLVAPE RLRFDVSHPK PMSAEELKIV EDMANEIVLQ NSAVTTRLMS VDDAIEEGAM ALFGEKYGDE VRVVSMGTGV HGAKSNKPYS VELCGGTHVS ATGQIGLIRV LGESAVGAGV RRIEAVTGES AREYLAEQDD RVKTLAASLK VQPSEVLSRV EALMDERRKL EKELADAKRK LAMGGGQGGS ADAVREVAGV KFLGKSISGV DPKDLKGLAD DGKTSIGSGV VTLIGVSDDG KASAVVAVTP DLVDRFSAVD LVRVASAALG GKGGGGRPDM AQAGGPDGAK ADEALEAVAA ALAG
|
| |