Gene NATL1_21621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21621 
SymbolaspS 
ID4780569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1818274 
End bp1820112 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content37% 
IMG OID640085460 
Productaspartyl-tRNA synthetase 
Protein accessionYP_001015982 
Protein GI124026867 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0173] Aspartyl-tRNA synthetase 
TIGRFAM ID[TIGR00459] aspartyl-tRNA synthetase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.403344 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATTG AGAATTTTAT GCGCAATAAG ACTTGTGGAG AACTACGTGC TTCCGCAATT 
AGCGCAAATG TTCAACTATG TGGTTGGGTT GATCGCAGAA GGGATCATGG CGGAGTAATT
TTTATTGATC TAAGAGACCG TTCTGGAACA ATTCAAATCA CAGTTGATCC AGATCAAGGT
CAGGATCTTT TTAGCATCGC TGAGAGTCTT AGAAATGAGA CTGTTCTTCA GATCAATGGA
TTAGTAAGAG CAAGACCTGA CGAAGCTATT AATACAAAAA TCCCAACCGG TGAAGTAGAA
GTCTTAGCTA AAAATATAAA AATTCTCAAT ACTGTCACTA GTACACTTCC TTTCTCAGTG
TCAATTCATG ATGAGGAGAG TGTTAAAGAA GAAATCAGAC TAAGGCATAG ATATTTAGAT
CTCAGAAGAG AGAGAATGAA TAATAATCTT CGATTGAGAC ATAACACGGT CAAAGCGGCT
AGAAGTTTTC TTGAAAACGA AGGATTTATA GAAGTTGAAA CACCAATTTT GACTCGCTCA
ACTCCTGAAG GAGCCAGAGA TTACTTAGTA CCCTCACGTG TATGTGGTGG CGAGTTTTTT
GCTTTACCGC AATCCCCACA ATTATTCAAA CAATTGTTGA TGGTTGGTGG AGTTGAACGT
TATTACCAAG TCGCTCGTTG TTTTCGTGAT GAAGATTTAC GCGCAGACAG GCAACCAGAA
TTTACTCAAT TAGATATTGA AATGAGTTTT ATGGAGGAAA AAGAGATCAT CGAATTAAAT
GAAAAATTAA TTGTAAGTAT ATGGAAAAAA ATTAAAGGGA TTGATCTCCA AACTCCATTT
CCGAGAATGA CTTGGCAAGA ATCTATGGAT CGTTTTGGAA CTGACAGACC TGATACTCGA
TATGGAATGG AACTTGTCAA CACAAGTGAT TTATTTTCCA AAAGTGGATT TAAAGTTTTT
TCAAATGCTA TTTCTTCTGG TGGATGCGTT AAGTGCATCA CCATTGAGGA TGGAAATAAT
TTGATTAGTA ATGTAAGAAT AAAACCGGGT GGAGATATTT TTAGCGAAGC CCAAAAGGCT
GGTGCTGGTG GACTAGCATT TATCAGGGTT CGAGATGATC AAGAAGTCGA TACAATTGGA
GCCATAAAAG ATAATTTAAC TACCTCGCAA ATAAAAGAAC TCCTATTAAA AACCCAAGCT
AAACCTGGTG ATCTAATACT TTTTGGTGCT GGGCCCACAA ACATTGTTAA TAGAACCTTA
GATAGAGTTC GTCAATTTAT TGCGAAAGAT CTAAAGATAA TCTCAGACAA CGAATTAAAA
ACTCAGTGGA ATTTTCTTTG GGTCACTGAT TTTCCTATGT TTGAATTCAA TTCTGATGAA
AATCGTCTTG AAGCAATTCA TCATCCTTTC TGTGCTCCTA AGCCTGAAGA TATTGGTGAA
TCAGAAAGCC TATGGAAAGA CAAATTACCC AATTCAAATG CTCAAGCGTA TGATCTAGTT
CTTAATGGAT TAGAAATTGG CGGGGGATCT TTAAGAATTC ACAACTCAGA ACTTCAAAAA
ACCGTACTAG AAGTAATTGG TCTATCAAAA AATGAAGCAG AAGAGCAGTT TGGTTTTTTA
ATTGATGCCC TTGCCATGGG TGCTCCACCA CATGGTGGGA TTGCATTTGG ACTGGACAGA
ATAGTTATGC TCTTAGCCAA TGAAGATTCA ATTAGAGATA CTATTGCTTT TCCAAAAACA
CAACAAGCTC GTTGTTCTAT GGCTAAAGCG CCTGCAAACG TGGAAAACAA ACAATTAGAA
GACCTCCACA TAGCTTCTAC TTGGATAGAT CCTGATTGA
 
Protein sequence
MMIENFMRNK TCGELRASAI SANVQLCGWV DRRRDHGGVI FIDLRDRSGT IQITVDPDQG 
QDLFSIAESL RNETVLQING LVRARPDEAI NTKIPTGEVE VLAKNIKILN TVTSTLPFSV
SIHDEESVKE EIRLRHRYLD LRRERMNNNL RLRHNTVKAA RSFLENEGFI EVETPILTRS
TPEGARDYLV PSRVCGGEFF ALPQSPQLFK QLLMVGGVER YYQVARCFRD EDLRADRQPE
FTQLDIEMSF MEEKEIIELN EKLIVSIWKK IKGIDLQTPF PRMTWQESMD RFGTDRPDTR
YGMELVNTSD LFSKSGFKVF SNAISSGGCV KCITIEDGNN LISNVRIKPG GDIFSEAQKA
GAGGLAFIRV RDDQEVDTIG AIKDNLTTSQ IKELLLKTQA KPGDLILFGA GPTNIVNRTL
DRVRQFIAKD LKIISDNELK TQWNFLWVTD FPMFEFNSDE NRLEAIHHPF CAPKPEDIGE
SESLWKDKLP NSNAQAYDLV LNGLEIGGGS LRIHNSELQK TVLEVIGLSK NEAEEQFGFL
IDALAMGAPP HGGIAFGLDR IVMLLANEDS IRDTIAFPKT QQARCSMAKA PANVENKQLE
DLHIASTWID PD