Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_28371 |
Symbol | alaS |
ID | 4777394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2504068 |
End bp | 2506860 |
Gene Length | 2793 bp |
Protein Length | 930 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640088360 |
Product | alanyl-tRNA synthetase |
Protein accession | YP_001018832 |
Protein GI | 124024525 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0013] Alanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00344] alanine--tRNA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.578023 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGTGC ATCTGGTTCC CATCTCTAGG GTTGTTTGGG GTGGCGGGCA ACGTCATAGG GAGGTCCAAA GGCGATCATG GCTGGTGGAT CCTTGTTCCA CGTCCGAATT CTTCATGGCC GTTGCAAGAT CATTGCGTTC TGGTGAGTCG GGACCTCGCA CTGGGTCAGA GATCCGCACT GCTTTTCTGA CATTTTTTGC TGAGCGTGCG CATCAGGTGA TTCCTAGTGC TTCGTTGGTT CCTGAAGACC CCACGGTGCT ACTAACCATC GCTGGCATGC TGCCGTTTAA GCCCGTTTTC ATGGGTCAGG CTGAACGCCC TGCGCCGCGG GCCACCAGTA GCCAGAAATG TATTCGCACG AATGACATCG AGAACGTGGG TCGCACAGCG CGGCATCACA CGTTTTTTGA GATGCTTGGC AACTTCTCGT TTGGCGATTA CTTCAAGCAA CAAGCGATTG AGTGGGCCTG GGAGCTCTCG ACTGAGGTGT TTGGACTTAA TCCGAAGAAT TTGGTGGTGA GCGTCTTTCG TGAGGACGAT GAGGCTGAGG CCATCTGGCG AGATGTGGTG GGGGTGAACC CCAAGCGCAT CATTCGTATG GATGAGGCTG ATAATTTTTG GGCTTCAGGG CCGACTGGCC CCTGTGGACC TTGTTCGGAG ATCTATTACG ACTTCAAGCC TGATCTGGGC AACGACGACA TTGATCTGGA AGACGACGGT CGTTTTGTTG AGTTCTACAA CCTGGTTTTT ATGCAATACA ACCGCGATGG GGAGGGCAAT CTCACCCCAC TCGCGAACCG CAATATTGAT ACCGGCATGG GCTTGGAGCG GATGGCTCAG ATTTTGCAGG GCGTCCCTAA TAACTATGAA ACTGACATTA TTTACCCATT GATCGAGACG GCTGCTGGCC TGGCGGGTCT CGATTATCAA AAGCTTGATG ACAAGGGGAA GACCAGCTTC AAGGTGATCG GCGATCACTG CCGCGCGATT ACGCATCTGA TCTGTGATGG GGTGACTGCT AGCAACCTTG GCCGCGGTTA CATCATGCGG CGTTTGCTAC GCAGGGTGGT GCGTCATGGG CGACTGGTCG GGATCGAGAA GCCTTTCCTG CAGGCAATGG GGGAAGCGGC GATTGCCTTA ATGGTGGAGG CTTACCCCCA GCTTGAGGAG CGCCGCAAGC TGATTCTGGC GGAACTCAAT CGCGAGGAGG CCCGCTTCTT GGAAACGTTG GAGCGTGGTG AGAAGGTGCT GGCTGATGTG TTGGTTGCTA ATCCCCAGAT GATTTCAGGG GGCCAGGCCT TTGAGTTGTA CGACACCTAT GGCTTTCCTT TGGAACTCAC CCAGGAGATT GCTGAAGAGC ATGGTTTGAC TGTGGATCTC CAAGGATTTG AGCAAGCGAT GGACCAGCAA CGTCAGCGGG CTAAGGCCGC TGCAGTGAGC ATTGATCTCA CGCTTCAGGG GGCTATCGAG CAAATGGCAG CTGAGTTGGA GGCCACTCGC TTCAAGGGTT ATCAGGTCTT GGAGCAGCCC TGCTGTGTCT TGGCCCTAGT CGTGAATGGG GAGTCGGCCG AACGAGCCAG TGCTGGTGAC AATGTGCAGA TCGTGCTCGA TACCACGCCC TTCTACGGTG AAAGTGGCGG CCAGGTGGGT GATCACGGTG TGCTTTCGGG TGAAGGATCC GGTGGCAATG GTGTGATCGT GGCTGTTGAC GATGTGAGTC GTCATCGCAA CGTATTTGTG CATTTTGGTC GTATTGAGCG CGGCACGTTA GCCCTGGGTG ACCTGGTTAA CGCTCAGGTA GATCGGGCCT GTCGTCGCCG TGCCCAGGCC AATCACACCG CAACTCACCT CTTGCAGGCG GCGCTCAAGC AGGTCGTTGA TTCGGGGATC GGTCAGGCAG GTTCTCTGGT GGACTTCGAT CGCTTGCGCT TCGACTTCCA CTGTTCGCGA GCTGTTACGG CCAAGGAACT CGAGCAGATT GAGGCTTTGA TTAACGGTTG GATCATGGAA TCTCATGATC TGATTGTTGA GGAGATGTCG ATCCAAGAGG CCAAGGCTGC CGGCGCTGTA GCGATGTTCG GAGAGAAGTA CGCCGATGTG GTGCGCGTGG TGGATGTGCC AGGTGTGTCG ATGGAACTTT GCGGCGGAAC CCATGTGGCC AATACAGCTG AGATCGGCTT GTTCAAGATC GTTGCTGAGA GCAGTGTTGC TGCAGGAATT CGGCGGATTG AGGCGGTGGC TGGTCCGGCG GTGCTGGCTT ATCTCAATGA GCGTGATGTT GTCGTCAAGG AGTTGGGCGA TCGCTTCAAG GCGCAGCCCA GCGAAATCAT CGAACGGGTG ATATCGCTGC AGGAGGAACT GAAGAGCAGC CAAAAAGCGT TGACTGCAGC ACGGGCTGAA TTAGCTGTCG CGAAGTCAGC GGCCTTGGCA ACCCAGGCGG TAGCTGTTGG TGAATACCAG TTGTTGGTGG CCCGTCTTGA TGGGGTGGAG GGTGCAGGCT TACAAAACGC AGCTCAGGGC TTATTGGATC AATTGGGAGA TGCCACTGCT GTTGTGTTGG GAGGTTTGCC TGATCCGAGC GACGAAGGCA AGGTGATTTT GGTGGCAGCT TTTGGCAAGC AGGTGATCGC TCAGGGTCAG CAAGCGGGCA AGTTCATTGG TTCGATTGCC AAGCGTTGCG GCGGCGGCGG CGGCGGTCGC CCCAATCTGG CCCAGGCGGG TGGACGCGAT GGAGCGGCTT TGGATGGAGC ATTAGAAGCG GCAAAGGTTG AGCTGAAGCA ATCCTTGGGC TGA
|
Protein sequence | MQVHLVPISR VVWGGGQRHR EVQRRSWLVD PCSTSEFFMA VARSLRSGES GPRTGSEIRT AFLTFFAERA HQVIPSASLV PEDPTVLLTI AGMLPFKPVF MGQAERPAPR ATSSQKCIRT NDIENVGRTA RHHTFFEMLG NFSFGDYFKQ QAIEWAWELS TEVFGLNPKN LVVSVFREDD EAEAIWRDVV GVNPKRIIRM DEADNFWASG PTGPCGPCSE IYYDFKPDLG NDDIDLEDDG RFVEFYNLVF MQYNRDGEGN LTPLANRNID TGMGLERMAQ ILQGVPNNYE TDIIYPLIET AAGLAGLDYQ KLDDKGKTSF KVIGDHCRAI THLICDGVTA SNLGRGYIMR RLLRRVVRHG RLVGIEKPFL QAMGEAAIAL MVEAYPQLEE RRKLILAELN REEARFLETL ERGEKVLADV LVANPQMISG GQAFELYDTY GFPLELTQEI AEEHGLTVDL QGFEQAMDQQ RQRAKAAAVS IDLTLQGAIE QMAAELEATR FKGYQVLEQP CCVLALVVNG ESAERASAGD NVQIVLDTTP FYGESGGQVG DHGVLSGEGS GGNGVIVAVD DVSRHRNVFV HFGRIERGTL ALGDLVNAQV DRACRRRAQA NHTATHLLQA ALKQVVDSGI GQAGSLVDFD RLRFDFHCSR AVTAKELEQI EALINGWIME SHDLIVEEMS IQEAKAAGAV AMFGEKYADV VRVVDVPGVS MELCGGTHVA NTAEIGLFKI VAESSVAAGI RRIEAVAGPA VLAYLNERDV VVKELGDRFK AQPSEIIERV ISLQEELKSS QKALTAARAE LAVAKSAALA TQAVAVGEYQ LLVARLDGVE GAGLQNAAQG LLDQLGDATA VVLGGLPDPS DEGKVILVAA FGKQVIAQGQ QAGKFIGSIA KRCGGGGGGR PNLAQAGGRD GAALDGALEA AKVELKQSLG
|
| |