Gene NATL1_05661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_05661 
SymbolproS 
ID4780178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp512261 
End bp514051 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content33% 
IMG OID640083843 
Productprolyl-tRNA synthetase 
Protein accessionYP_001014393 
Protein GI124025277 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTCT CCCGCCTAAT GCTGAACACT CTTAGAGACG TCCCTTCAGA AGCAGATATA 
ATTTCACATC AGTTACTGGT AAGAGGTGGT TATATTAAGC GCATAACCGG AGGTATTTAT
GCATATATGC CATTACTTTG GAAGGTTCTA AAAAAAATTA CCTCAATAGT TGAAGAAGAG
TTATCAACAA AAGGTTGCCT GCAAACTCTT CTCCCTCAAC TTCAGCCTTC AGAAATATGG
GAAAGAAGTG GGAGGTGGAA ATCATATACA CAGGGAGAAG GTATTATGTT TAGTCTTAAA
GATAGACAAG GGAAAGAACT AGGACTGGGA CCAACGCATG AAGAAGTAAT TACGCAAATA
ATTTCTCAAA CTATTCACTC TTACAAACAA TTACCGATAA ATATATTCCA AATTCAAACA
AAATTTAGAG ATGAAATAAG ACCAAGATTT GGGTTAATGA GAAGTAGAGA ATTCATCATG
AAGGATGCTT ATTCCTTTCA TGCAAATGAA AATGATCTTC AATCAACTTA TTCAGACATG
AGAAATGCCT ATCAAAATAT ATTTACAAAA TGTGGTCTAG ATTTTGTTTG TGTCGACGCA
GATAGTGGAG CAATTGGGGG TGCAGCATCT CAAGAATTCA TGGTAACAGC TGAGTCTGGG
GAGGACTTAA TTTTGATAAG TTCTGATGGC AAGTATGGGG CTAATCAAGA AAAAGCTGTT
TCCATTATTG AAGAAGGAAA CTTATTAGAA CCTAATAAAC CATCGATAAT TAAGACTCCT
AATCAAAAAA CAATAGATGA ATTATGTAAT TACAATGATT TCCACCCAAG TCAAATTGTA
AAAGTATTAG CTTATCTAGC AACGTGTGAT GATAATAAAA AATACCCAGT TCTAGTAAGT
ATTCGGGGGG ATCAAGAAAT AAATGATATT AAACTTTCAA ATAAAATATC TCAAGAATTA
AAGAAAAATG TACTTGATAT TAGAATTATT TATAATGAAG ACATGCAAAA GCAAGGCATT
ACTAATATAC CATTTGGTTT TATAGGTCCT GATCTTAGCG ATAATTTACT TGCACAATCA
AAAGGATGGG AAAAAAAATT CATAAGAATC GCTGACAATT CTGCAAAAGA TCTTAAAAGT
TTTATATGTG GAAACAATAT TAAAGATGAG CATAAAATAT TTTATAATTG GAATCTAATT
AATACTGTGC AACTGATATG TGATATTAGA AAAGCCAAAC CAGGAGACAG GTGTATTCAT
GATAAAACAC AAAAACTTGA AGAATGTAGA GGGATAGAAA TAGGGCATAT ATTTCAATTA
GGAACTAAGT ATTCTAAATC ATTAAATGCT ACTTTTACCA ACGAAAAAGG TATTGAAGAC
CACTTGTGGA TGGGGTGCTA TGGAATTGGT ATTTCCAGAT TAGCTCAAGC AGCAGTAGAA
CAAAATCATG ATGATTTAGG TATTATCTGG CCGACATCAA TTGCCCCTTT TACAGTAATA
ATTATCATTG CCAATATAAA GAATAATGAT CAAAAATGTT TAGCTGAAGA TATCTATCAA
AAATTAATAC AAAATCGAGT TGATGTTCTT CTTGACGATA GGGATGATAG GGCTGGGATC
AAGTTTAAAG ATGCAGACCT TATTGGAATC CCATGGAGGA TTGTTGCTGG GCGAGAAGCT
AGTTCGGGAC TAGTTGAATT ACATAATAGA AAAACAAAAA CTACAGAGTT GTTAGATCTG
AACTCCGTTT TAAAAAAGCT TTCTGAAGAA TTTAATACTG AAAAACTATA A
 
Protein sequence
MRVSRLMLNT LRDVPSEADI ISHQLLVRGG YIKRITGGIY AYMPLLWKVL KKITSIVEEE 
LSTKGCLQTL LPQLQPSEIW ERSGRWKSYT QGEGIMFSLK DRQGKELGLG PTHEEVITQI
ISQTIHSYKQ LPINIFQIQT KFRDEIRPRF GLMRSREFIM KDAYSFHANE NDLQSTYSDM
RNAYQNIFTK CGLDFVCVDA DSGAIGGAAS QEFMVTAESG EDLILISSDG KYGANQEKAV
SIIEEGNLLE PNKPSIIKTP NQKTIDELCN YNDFHPSQIV KVLAYLATCD DNKKYPVLVS
IRGDQEINDI KLSNKISQEL KKNVLDIRII YNEDMQKQGI TNIPFGFIGP DLSDNLLAQS
KGWEKKFIRI ADNSAKDLKS FICGNNIKDE HKIFYNWNLI NTVQLICDIR KAKPGDRCIH
DKTQKLEECR GIEIGHIFQL GTKYSKSLNA TFTNEKGIED HLWMGCYGIG ISRLAQAAVE
QNHDDLGIIW PTSIAPFTVI IIIANIKNND QKCLAEDIYQ KLIQNRVDVL LDDRDDRAGI
KFKDADLIGI PWRIVAGREA SSGLVELHNR KTKTTELLDL NSVLKKLSEE FNTEKL