Gene P9303_28371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_28371 
SymbolalaS 
ID4777394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2504068 
End bp2506860 
Gene Length2793 bp 
Protein Length930 aa 
Translation table11 
GC content56% 
IMG OID640088360 
Productalanyl-tRNA synthetase 
Protein accessionYP_001018832 
Protein GI124024525 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0013] Alanyl-tRNA synthetase 
TIGRFAM ID[TIGR00344] alanine--tRNA ligase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.578023 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGTGC ATCTGGTTCC CATCTCTAGG GTTGTTTGGG GTGGCGGGCA ACGTCATAGG 
GAGGTCCAAA GGCGATCATG GCTGGTGGAT CCTTGTTCCA CGTCCGAATT CTTCATGGCC
GTTGCAAGAT CATTGCGTTC TGGTGAGTCG GGACCTCGCA CTGGGTCAGA GATCCGCACT
GCTTTTCTGA CATTTTTTGC TGAGCGTGCG CATCAGGTGA TTCCTAGTGC TTCGTTGGTT
CCTGAAGACC CCACGGTGCT ACTAACCATC GCTGGCATGC TGCCGTTTAA GCCCGTTTTC
ATGGGTCAGG CTGAACGCCC TGCGCCGCGG GCCACCAGTA GCCAGAAATG TATTCGCACG
AATGACATCG AGAACGTGGG TCGCACAGCG CGGCATCACA CGTTTTTTGA GATGCTTGGC
AACTTCTCGT TTGGCGATTA CTTCAAGCAA CAAGCGATTG AGTGGGCCTG GGAGCTCTCG
ACTGAGGTGT TTGGACTTAA TCCGAAGAAT TTGGTGGTGA GCGTCTTTCG TGAGGACGAT
GAGGCTGAGG CCATCTGGCG AGATGTGGTG GGGGTGAACC CCAAGCGCAT CATTCGTATG
GATGAGGCTG ATAATTTTTG GGCTTCAGGG CCGACTGGCC CCTGTGGACC TTGTTCGGAG
ATCTATTACG ACTTCAAGCC TGATCTGGGC AACGACGACA TTGATCTGGA AGACGACGGT
CGTTTTGTTG AGTTCTACAA CCTGGTTTTT ATGCAATACA ACCGCGATGG GGAGGGCAAT
CTCACCCCAC TCGCGAACCG CAATATTGAT ACCGGCATGG GCTTGGAGCG GATGGCTCAG
ATTTTGCAGG GCGTCCCTAA TAACTATGAA ACTGACATTA TTTACCCATT GATCGAGACG
GCTGCTGGCC TGGCGGGTCT CGATTATCAA AAGCTTGATG ACAAGGGGAA GACCAGCTTC
AAGGTGATCG GCGATCACTG CCGCGCGATT ACGCATCTGA TCTGTGATGG GGTGACTGCT
AGCAACCTTG GCCGCGGTTA CATCATGCGG CGTTTGCTAC GCAGGGTGGT GCGTCATGGG
CGACTGGTCG GGATCGAGAA GCCTTTCCTG CAGGCAATGG GGGAAGCGGC GATTGCCTTA
ATGGTGGAGG CTTACCCCCA GCTTGAGGAG CGCCGCAAGC TGATTCTGGC GGAACTCAAT
CGCGAGGAGG CCCGCTTCTT GGAAACGTTG GAGCGTGGTG AGAAGGTGCT GGCTGATGTG
TTGGTTGCTA ATCCCCAGAT GATTTCAGGG GGCCAGGCCT TTGAGTTGTA CGACACCTAT
GGCTTTCCTT TGGAACTCAC CCAGGAGATT GCTGAAGAGC ATGGTTTGAC TGTGGATCTC
CAAGGATTTG AGCAAGCGAT GGACCAGCAA CGTCAGCGGG CTAAGGCCGC TGCAGTGAGC
ATTGATCTCA CGCTTCAGGG GGCTATCGAG CAAATGGCAG CTGAGTTGGA GGCCACTCGC
TTCAAGGGTT ATCAGGTCTT GGAGCAGCCC TGCTGTGTCT TGGCCCTAGT CGTGAATGGG
GAGTCGGCCG AACGAGCCAG TGCTGGTGAC AATGTGCAGA TCGTGCTCGA TACCACGCCC
TTCTACGGTG AAAGTGGCGG CCAGGTGGGT GATCACGGTG TGCTTTCGGG TGAAGGATCC
GGTGGCAATG GTGTGATCGT GGCTGTTGAC GATGTGAGTC GTCATCGCAA CGTATTTGTG
CATTTTGGTC GTATTGAGCG CGGCACGTTA GCCCTGGGTG ACCTGGTTAA CGCTCAGGTA
GATCGGGCCT GTCGTCGCCG TGCCCAGGCC AATCACACCG CAACTCACCT CTTGCAGGCG
GCGCTCAAGC AGGTCGTTGA TTCGGGGATC GGTCAGGCAG GTTCTCTGGT GGACTTCGAT
CGCTTGCGCT TCGACTTCCA CTGTTCGCGA GCTGTTACGG CCAAGGAACT CGAGCAGATT
GAGGCTTTGA TTAACGGTTG GATCATGGAA TCTCATGATC TGATTGTTGA GGAGATGTCG
ATCCAAGAGG CCAAGGCTGC CGGCGCTGTA GCGATGTTCG GAGAGAAGTA CGCCGATGTG
GTGCGCGTGG TGGATGTGCC AGGTGTGTCG ATGGAACTTT GCGGCGGAAC CCATGTGGCC
AATACAGCTG AGATCGGCTT GTTCAAGATC GTTGCTGAGA GCAGTGTTGC TGCAGGAATT
CGGCGGATTG AGGCGGTGGC TGGTCCGGCG GTGCTGGCTT ATCTCAATGA GCGTGATGTT
GTCGTCAAGG AGTTGGGCGA TCGCTTCAAG GCGCAGCCCA GCGAAATCAT CGAACGGGTG
ATATCGCTGC AGGAGGAACT GAAGAGCAGC CAAAAAGCGT TGACTGCAGC ACGGGCTGAA
TTAGCTGTCG CGAAGTCAGC GGCCTTGGCA ACCCAGGCGG TAGCTGTTGG TGAATACCAG
TTGTTGGTGG CCCGTCTTGA TGGGGTGGAG GGTGCAGGCT TACAAAACGC AGCTCAGGGC
TTATTGGATC AATTGGGAGA TGCCACTGCT GTTGTGTTGG GAGGTTTGCC TGATCCGAGC
GACGAAGGCA AGGTGATTTT GGTGGCAGCT TTTGGCAAGC AGGTGATCGC TCAGGGTCAG
CAAGCGGGCA AGTTCATTGG TTCGATTGCC AAGCGTTGCG GCGGCGGCGG CGGCGGTCGC
CCCAATCTGG CCCAGGCGGG TGGACGCGAT GGAGCGGCTT TGGATGGAGC ATTAGAAGCG
GCAAAGGTTG AGCTGAAGCA ATCCTTGGGC TGA
 
Protein sequence
MQVHLVPISR VVWGGGQRHR EVQRRSWLVD PCSTSEFFMA VARSLRSGES GPRTGSEIRT 
AFLTFFAERA HQVIPSASLV PEDPTVLLTI AGMLPFKPVF MGQAERPAPR ATSSQKCIRT
NDIENVGRTA RHHTFFEMLG NFSFGDYFKQ QAIEWAWELS TEVFGLNPKN LVVSVFREDD
EAEAIWRDVV GVNPKRIIRM DEADNFWASG PTGPCGPCSE IYYDFKPDLG NDDIDLEDDG
RFVEFYNLVF MQYNRDGEGN LTPLANRNID TGMGLERMAQ ILQGVPNNYE TDIIYPLIET
AAGLAGLDYQ KLDDKGKTSF KVIGDHCRAI THLICDGVTA SNLGRGYIMR RLLRRVVRHG
RLVGIEKPFL QAMGEAAIAL MVEAYPQLEE RRKLILAELN REEARFLETL ERGEKVLADV
LVANPQMISG GQAFELYDTY GFPLELTQEI AEEHGLTVDL QGFEQAMDQQ RQRAKAAAVS
IDLTLQGAIE QMAAELEATR FKGYQVLEQP CCVLALVVNG ESAERASAGD NVQIVLDTTP
FYGESGGQVG DHGVLSGEGS GGNGVIVAVD DVSRHRNVFV HFGRIERGTL ALGDLVNAQV
DRACRRRAQA NHTATHLLQA ALKQVVDSGI GQAGSLVDFD RLRFDFHCSR AVTAKELEQI
EALINGWIME SHDLIVEEMS IQEAKAAGAV AMFGEKYADV VRVVDVPGVS MELCGGTHVA
NTAEIGLFKI VAESSVAAGI RRIEAVAGPA VLAYLNERDV VVKELGDRFK AQPSEIIERV
ISLQEELKSS QKALTAARAE LAVAKSAALA TQAVAVGEYQ LLVARLDGVE GAGLQNAAQG
LLDQLGDATA VVLGGLPDPS DEGKVILVAA FGKQVIAQGQ QAGKFIGSIA KRCGGGGGGR
PNLAQAGGRD GAALDGALEA AKVELKQSLG