Gene NATL1_19421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_19421 
Symbolrne 
ID4779501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1594832 
End bp1596754 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content33% 
IMG OID640085232 
Productribonuclease E/G 
Protein accessionYP_001015762 
Protein GI124026647 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1530] Ribonucleases G and E 
TIGRFAM ID[TIGR00757] ribonuclease, Rne/Rng family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.839153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAGC AAATTGTCAT TGCTGAGCAT TTGCGAATTG CCGCTCTGCT CACTGATGAG 
AGAATAGATG AATTAATCGT CGCTCAAGGT AGCTACCAAA TCGGAGATAT TTTCTTAGGA
ACCGTTGAAA ATGTTCTTCC TGGGATAGAC GCTGCTTTTG TCAATATTGG TGAAAGTGAA
AAAAATGGCT TTATTCATGT AAATGATTTA GGTCCGCTTA GATTAAAAAA AGCAACTGCA
GGAATAACAG AGTTGCTCGA ACCAAGGCAA AAGGTTTTGG TTCAGGTAAT GAAAGAACCC
ACCGGGACAA AAGGGCCTAG ATTAACTGGA AACATAGCTC TTCCTGGTAG ATATCTAGTA
CTCCAACCTT ATGGACAAGG GGTAAATATT TCGAGAAGAA TAAGTACTGA AAGTGAAAGA
AATCGACTTA GAGCATTAGG AGTATTAGTT AAGCCTCCAA GTACTGGACT TTTAATAAGA
ACAGAAGCGG AAGATATTTG TGAAGAATTT TTAATTGATG ATCTTGAAAA TCTTCTTAAA
CAATGGGAGC TTATTCAACA AGCATCAGAG AGTTGCTCTC CGCCAATTCT TTTAAATAGA
GATGAAGACT TTATTCATAG AATTCTTCGA GATCATACAG GTCAAAATCT TACTGAGATT
GTTGTAGATA ATTCTGAAGC AATTGGTCGA GTTAAAAACT TCCTTGGCAA GGATAGTAAT
GAATTAACAA TAGAATTACA TAACGATTCG GAAAATATTT TGGAGAAATA CAAAGTAATA
TCTTCAATTA ATGAAGCATT AAAACCTAGA GTTGATCTAC CTTCGGGTGG CTATATAATA
ATAGAGCCAA CAGAAGCTTT GACAGTTATT GATGTCAACT CAGGCTCATT TACACGATCT
GCGAATTCCA GAGAAACAGT ATTATGGACT AATTGTGAAG CCGCTATTGA GATAGCAAGA
CAATTAAAAT TAAGGAATAT TGGTGGGGTT ATTATTATTG ATTTTATTGA TATGGATACA
AAAAGAGATC AACTTCAATT ATTAGAACAT TTTACATCTG CTATTAATGG AGACTCGGCG
CGGCCACAAA TAGCTTCACT TACAGAACTT GGACTTGTTG AGCTCACTAG AAAAAGACAA
GGTCAAAATA TATATGAATT ATTTGGAAAG ACTTCTCCTA ATTCTCAAGG GCAAGGTTAT
CTTCCAAGCA TTACTATTCA AGACATAAAT CCAACAACCC CATCTGAAGC TGGCGTAATC
AATGCAACTT TAATATCAGG CGAAGATATT CAATCTTTAC AAGAAACTAA TAACAAAAAA
AAGCGTATAA ATAAAACAAG AGATATAGAA GCAAACTTAA GCAATGAAGA AAATAAATCA
TCTACAGACA ACTCAAAAGC TATCTCCACG GATACAATTA CTGAAGATAT TCAAAAAGAG
AGTAATAATA AAAGGAAAGA AACAACGATA ATAAATATCA ATATGAATCA GAATGAAGAG
ATTGTATATA GTTTGATGGG ATTAGATCCT ATTTTACTTT TAGAGAAACC TCCACTATCT
GAAAACTATA AGGTTAATAT AATCAGACCT GGGAAAAAGG AAGCTAGAGA AGAAAAAAAT
AACATACCTG AGGATAATCA ACAAAAAATA GTTGATGATT CTATTAGCAA ACATCAAAAT
AATAACAAGG ATATTATTCG TCTTAAAAAC AAAAGTAATA TTGAACAAAA ATCAACCAAT
TCTGACGTAA AAGAGAGTAT TGAAGAAGAA AATATAAATG TAGCTTTGGA TCAAGAAACA
AATGAATTAA TAAATATTAA TCATAATTCA ATAAGCGAAA AGAATGAATT ACCTTCCACC
GATTCACAAG AAGTTAATGA GGATCCAAGA CGAAAAAGAA GAAGGTCTTC AGCCTCTTCT
TAA
 
Protein sequence
MPQQIVIAEH LRIAALLTDE RIDELIVAQG SYQIGDIFLG TVENVLPGID AAFVNIGESE 
KNGFIHVNDL GPLRLKKATA GITELLEPRQ KVLVQVMKEP TGTKGPRLTG NIALPGRYLV
LQPYGQGVNI SRRISTESER NRLRALGVLV KPPSTGLLIR TEAEDICEEF LIDDLENLLK
QWELIQQASE SCSPPILLNR DEDFIHRILR DHTGQNLTEI VVDNSEAIGR VKNFLGKDSN
ELTIELHNDS ENILEKYKVI SSINEALKPR VDLPSGGYII IEPTEALTVI DVNSGSFTRS
ANSRETVLWT NCEAAIEIAR QLKLRNIGGV IIIDFIDMDT KRDQLQLLEH FTSAINGDSA
RPQIASLTEL GLVELTRKRQ GQNIYELFGK TSPNSQGQGY LPSITIQDIN PTTPSEAGVI
NATLISGEDI QSLQETNNKK KRINKTRDIE ANLSNEENKS STDNSKAIST DTITEDIQKE
SNNKRKETTI ININMNQNEE IVYSLMGLDP ILLLEKPPLS ENYKVNIIRP GKKEAREEKN
NIPEDNQQKI VDDSISKHQN NNKDIIRLKN KSNIEQKSTN SDVKESIEEE NINVALDQET
NELININHNS ISEKNELPST DSQEVNEDPR RKRRRSSASS