Gene NATL1_16741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_16741 
Symbol 
ID4779748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1365451 
End bp1366854 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content30% 
IMG OID640084958 
Producthypothetical protein 
Protein accessionYP_001015495 
Protein GI124026379 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTTCAA AAGAAAAAGA ATCTGTGGGA GAACAAGAAG GAAAGAAAAA AGTCACTGAA 
GTAAAAACAT TTCCCATTCC TTTTGCTTTA GAAGAAATAA AAGAAAATAT CACCCTTAAC
ACCAAGACTA AATCCCAATT GCCTAAAGAA CAAATAATTA ATCAAGCTTT TAAATTTCAT
TCACAAGGAA ATATTTCAAA GGCAACAAAA TATTATCAGA TTTGTATAAA ACAGGGATTT
AATAATCCAC AAGTTTTTTC AAATTTTGGG ATTTTATTAA AAGAGATAGA TCAATTAAAA
GAGGCGGAAA AAATGATTAA ACAAGCTATT AAATTAAAAC CTGATTATGC TATAGCATAT
AATAACTTGG GAAATATATT AATAGATTTA GGCAGACTAA AAGAGGCAGA AATATATACT
AAAAAAGCTA TTGACTTAAA ACCTGATTAT GCAAATGCTT ATAATACATT AGGAAATATA
TTAAAAGAAT TGGACAATTT AAAAGATGCC GAAATTTGCT TTTCAAAGGC AATTTCATTG
GAGCCAGATC ATGAATCAGC AATTATTAAT AGAGGTCAAT TATATTTTGA TAAAGGAGAA
TTTAAGAAAG CCTTAAAAGA CTCTGACTTA TGTAATACAA AACAATCTAG AGCATTTTCT
TTGGAAATTC TTTATTCATT AGGGAGTATC AATGAAATTT ATAATAGAAT TGAAAAGACC
TATGCATTTG ATGATAAAAA CTTAAGGTTG GCAGCATTCT CTTCATTTAT ATCAGAACGG
GAAAATAAAT ATACTCATCA TAATTTTTGT CCAAAGCCAC TTAAATTTCT ACATTTCAAC
AATCTTAAAA ATCAACTTAA CAATTATGAG GAATTTATAA AAGGACTACT TAAAGAATTA
TCTGAGATTA AAACCGTTTG GGAACCACCA AAAAAAACAA CTCATAATGG ATTTCAAACT
CCAAGTTATA TAAATTTGTT TTCAGAATCT TCAATAAAAA TTTCAAAACT AAAGGCCATA
ATCTGTAATG AATTAGATTC TTATTATCTA AAATTCAAAA GAGAGTCTTG TTCTTATATT
AAAAAATGGC CTTCACATAA AAAGCTTTTG GGATGGCATG TAATCCTGAA GAAGCAAGGA
TATCAAGAGG CGCACATACA TCCAGCTGGC TGGCTAAGTG GAGTTATTTA TTTAAAGGTT
GTCCCTTCAC TAGGGAAAGA TGAGGGGGGA ATTGAATTTA GTTTAAATGG GCCGAATTAT
TCCAATATCA ACTCTCCACA ATTAATTCAT CAACCAGAAG TAGGTGATAT GGTTTTTTTC
CCCTCTTCAC TTCACCACAG GACTATCCCT TTCTCTACAG ATACAGATCG AATAGTCGTG
GCTTTTGACT TGATGCCAAA TTGA
 
Protein sequence
MLSKEKESVG EQEGKKKVTE VKTFPIPFAL EEIKENITLN TKTKSQLPKE QIINQAFKFH 
SQGNISKATK YYQICIKQGF NNPQVFSNFG ILLKEIDQLK EAEKMIKQAI KLKPDYAIAY
NNLGNILIDL GRLKEAEIYT KKAIDLKPDY ANAYNTLGNI LKELDNLKDA EICFSKAISL
EPDHESAIIN RGQLYFDKGE FKKALKDSDL CNTKQSRAFS LEILYSLGSI NEIYNRIEKT
YAFDDKNLRL AAFSSFISER ENKYTHHNFC PKPLKFLHFN NLKNQLNNYE EFIKGLLKEL
SEIKTVWEPP KKTTHNGFQT PSYINLFSES SIKISKLKAI ICNELDSYYL KFKRESCSYI
KKWPSHKKLL GWHVILKKQG YQEAHIHPAG WLSGVIYLKV VPSLGKDEGG IEFSLNGPNY
SNINSPQLIH QPEVGDMVFF PSSLHHRTIP FSTDTDRIVV AFDLMPN