Gene NATL1_03171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03171 
SymbolileS 
ID4780471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp293790 
End bp296693 
Gene Length2904 bp 
Protein Length967 aa 
Translation table11 
GC content34% 
IMG OID640083583 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_001014146 
Protein GI124025030 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATAACA TCAATAAAAA TTCTCAAAAG GATCGTCCCA CTTACAAAGA CACTCTCAAC 
CTTTTGCAGA CTAATTTTGG AATGAGGGCA AATGCAACTC TAAGAGAACC TGAGTTACAA
GCTTTTTGGA GAGAAAAAAA TATAGATTTC GAATTAGGCT TAAATAATAC TGGAGAGACT
TTTACTTTGC ATGATGGCCC GCCATATGCA AATGGAACGC TTCATATGGG GCATGCTCTC
AACAAAGTAT TGAAGGACAT AATCAATAAA TTTCAAACAA TGAAAGGGAA AAAAGTTTGT
TATGTCCCTG GATGGGATTG CCATGGATTG CCTATTGAAT TGAAAGTTCT TCAAGCTATG
GATAAAAGTC AACGAGCTGA ATTAACACCT ATTAAGTTGA GAAAAAAAGC TGCTGCTTAT
GCAAAAAAGC AAGTTTCCCA ACAAATGGAT GGTTTTAAAA GATGGGGCGT ATGGGGTGAC
TGGGATCAAC CATATTTAAG TTTAGACAAA AAGTTTGAGG CCTCTCAAAT CAAGTTGTTT
GGTGAAATGG TCTTCAAGGG ATACATATAT CGAGGCCTAA AACCAGTTCA TTGGAGTCCA
AGTTCTCAAA CAGCTCTGGC CGAGGCGGAA TTAGAATATC CAACCGGTCA TACTAGCAAA
AGTATTTATG TGGGATTTAA AGTTAATCAA ATACCAAAAA GATTAACTCA AGAAATTTCT
AAGCAAGCTC CAGATCTTAT TAATTCTGAA GGGAAATTAA AAGAAGTAAA ACTTGTCATT
TGGACTACTA CTCCTTGGAC AATTCCTGCA AATGAGGCCA TTTCTGTTAA CCAAAAATTA
GAATATGTAA TTGCACAAAG TTCTGATCGT TCATTGATAA TTATTGCTAA CGATCTTTTG
GATGAAGTAT CTAAGAGTGT AGGAATTAAT TATGAAAAAA GAGTATTAAT CAAAGGATCA
ATCTTAGATG GAATTATATA TAAACATCCT TTATTTGATA AAATAAGCCC TGTTGTTTTA
GGAGGAGATT ATATTACAAC TGAATCCGGA ACTGGATTAG TACATACTGC TCCAGGTCAT
GGTGTTGATG ATTTTAATAC TGGTAAAAAA TATAATTTAT CAATTTCTTG CACAGTTGAT
GCAAAGGGTT TTCTAACGAA AGAAGCCGGT AAATATGAAG GTCTAAATGT ATTAAAAGAT
GCTAATAGTG TCATAATAAG TGATCTAATT AATTCTGGAT CTTTGCTTAA AGAAATTCCA
TATGAGCATA GGTATCCTTA TGATTGGAGA ACTAAAAAAC CAACTATTTT TAGAGCTACA
GAACAATGGT TTGCTTCCGT TGAAGGATTT AGAGATAAAG CCCTTTCTGC CATAGAAGAT
GTTATTTGGC TTCCTGAATC GGGAAAAAAT AGAATTAATT CTATGGTTAG AGAAAGAGGA
GATTGGTGTA TCTCCCGACA AAGGACCTGG GGAGTTCCAA TACCAGTATT TTATGAAAAG
AATGGACAAG AAATCTTGCT CAATAAAGAA ACTATTTCTC ATATAGCTGA TTTATTTTCT
GTTCATGGAG CAGATATTTG GTGGGAATAT GAAGTATCTG AGCTATTACC TCCTTCTTAT
TTAAATCAGG CAGATCGATG GCAAAAAGGT ACTGATACTA TGGATGTTTG GTTTGACTCT
GGCTCTAGTT GGTCTTCAGT TATTTCTAAG AAAGAAAATT TAAACTATCC AGCAGATTTA
TATTTGGAGG GATCTGATCA ACATCGGGGT TGGTTCCAGT CCTCTTTATT AACTTCGGTA
GCAGTGAATG AACATGCACC TTTTAAAAAG GTCCTTACTC ATGGTTTTGC ATTAGATGAG
AATGGTAGGA AGATGAGTAA ATCCTTAGGA AACATTATTG ATCCTTTAGT TATAATTAAT
GGTGGTTCAA ATAAGAAATT AGATCCTGCG TATGGAGCTG ATGTTTTGAG GTTATGGGTT
AGTTCTGTTG ATTATTCTGC AGATGTTCCT ATTGGATCAA ACATACTAAA GCAAATTTCT
GATGTTTATC GTAAGGTTCG AAATACGTCT AGGTATCTAT TAGGTAACCT CTATGATTTT
GATTATAAAA TTGATTCCAT TGATATTGCT AACTTACCAT TGTTAGATAA GTGGATGTTG
AATAGAACAG CTGAAGTAAT TGATGAAATA TCAGATGCAT ATAATAATTT TGAATTTTCT
AAATTTTTCC AAACAATTCA AAATTTTTGT GTTGTTGATC TATCTAATTT TTACTTAGAT
ATTGCAAAAG ATAGGTTGTA TGTGAGTTCT AAATCTGACT TTAGAAGAAG AAGTTGTCAG
ACAGTTTTAT CCTTGGTAAT TGAAAAAATA TCTGGATTAA TTGCACCTGT TTTATGTCAT
ATGGCAGAAG ATATTTGGCA GAATATTCCA TATGACTTAG AGGAAGCCTC AGTATTTCAA
AGAGGATGGC CTAATGTACC TAAATCATGG CGAAATAGTA GTTTTAATTG TCATGTGACT
GAACTCCGTA AACTCAGAGC AGTTATTAAT CGTATGTTGG AGAGTTGTAG AAATAATCAA
GCGTTAGGTT CTTCTTTGGA AGCATCAGTA AGGGTTGATA TATCTGATGA AAAAGTTCAA
GCTGCTATTG AATGGTTAGC TGAAAGCGAA TCTAATAATG TTGATGTATT AAGAGATTGG
TTCCTAGTTT CATCTTTACA AATTGGCGGT GAGCCATGGG CTGAGGTTTT AGTTAGTGAG
GACAATGATT ATGCTTCAGT CGAGATTGCA AAAGCAAGGG GATTTAAGTG TGAAAGATGT
TGGCATTATG AAATAGAAAT GAGCAAGAAT CCTCAACATA CAAATATTTG CAAAAGGTGC
GAAAAAGTAG TCTTAGCTAT TTAA
 
Protein sequence
MNNINKNSQK DRPTYKDTLN LLQTNFGMRA NATLREPELQ AFWREKNIDF ELGLNNTGET 
FTLHDGPPYA NGTLHMGHAL NKVLKDIINK FQTMKGKKVC YVPGWDCHGL PIELKVLQAM
DKSQRAELTP IKLRKKAAAY AKKQVSQQMD GFKRWGVWGD WDQPYLSLDK KFEASQIKLF
GEMVFKGYIY RGLKPVHWSP SSQTALAEAE LEYPTGHTSK SIYVGFKVNQ IPKRLTQEIS
KQAPDLINSE GKLKEVKLVI WTTTPWTIPA NEAISVNQKL EYVIAQSSDR SLIIIANDLL
DEVSKSVGIN YEKRVLIKGS ILDGIIYKHP LFDKISPVVL GGDYITTESG TGLVHTAPGH
GVDDFNTGKK YNLSISCTVD AKGFLTKEAG KYEGLNVLKD ANSVIISDLI NSGSLLKEIP
YEHRYPYDWR TKKPTIFRAT EQWFASVEGF RDKALSAIED VIWLPESGKN RINSMVRERG
DWCISRQRTW GVPIPVFYEK NGQEILLNKE TISHIADLFS VHGADIWWEY EVSELLPPSY
LNQADRWQKG TDTMDVWFDS GSSWSSVISK KENLNYPADL YLEGSDQHRG WFQSSLLTSV
AVNEHAPFKK VLTHGFALDE NGRKMSKSLG NIIDPLVIIN GGSNKKLDPA YGADVLRLWV
SSVDYSADVP IGSNILKQIS DVYRKVRNTS RYLLGNLYDF DYKIDSIDIA NLPLLDKWML
NRTAEVIDEI SDAYNNFEFS KFFQTIQNFC VVDLSNFYLD IAKDRLYVSS KSDFRRRSCQ
TVLSLVIEKI SGLIAPVLCH MAEDIWQNIP YDLEEASVFQ RGWPNVPKSW RNSSFNCHVT
ELRKLRAVIN RMLESCRNNQ ALGSSLEASV RVDISDEKVQ AAIEWLAESE SNNVDVLRDW
FLVSSLQIGG EPWAEVLVSE DNDYASVEIA KARGFKCERC WHYEIEMSKN PQHTNICKRC
EKVVLAI