Gene NATL1_16831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_16831 
Symbol 
ID4780386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1370736 
End bp1372643 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content39% 
IMG OID640084967 
Productcell division protein FtsH3 
Protein accessionYP_001015503 
Protein GI124026388 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTTAACT TAGGTTTAAT TAAATTTTTG CTAATGCCAA TACGACAGGA CGAAAATCAA 
CCAAATAAAC GCTTTGGAAT AATTAATTTA GTTTTAATTG GTTTTGGAGC GCTTCTTCTC
TTTAGTAGTT TTTTTCCAAG CCAAAATACA CAAGTACCTA GAGTTCCTTA TTCACTTTTC
ATAAATCAAG TTGATGATGG AGAAGTTAAA CGTGCATACA TAACACAAGA TCAAATCAGA
TACGAACTTT CTACAGTTGA AGAAGGAGCC CCCTCCGTTC TAGCCACGAC TCCCATTTTT
GATATGGAGT TACCTCAAAG GCTCGAGAAA AAAGGTGTCG AGTTCGCAGC TGCCCCTCCC
AAAAAACCCA ATATATTTAC CACTATTCTT AGCTGGGTTG TACCACCACT AATATTTATT
CTTGTATTAC AGTTTTTCGC TCGAAGAAGC ATGGGTGGAG GTGGAGCACA AGGAGCCCTA
AGTTTCACAA AAAGTAAAGC TAAAGTTTAT GTCCCTGACG ATGAATCGAA GGTTACTTTC
GAGGATGTTG CAGGAGTTGA TGAAGCAAAG AACGAATTAA CTGAAATAGT TGATTTCCTT
AAGAAGCCAC AAAGATATAC AGATATTGGT GCAAGAATTC CAAAAGGTGT TTTATTAGTT
GGACCACCAG GAACTGGTAA GACTCTTTTG TCTAAGGCTG TAGCAGGTGA AGCAGAAGTT
CCTTTCTTTA TCATTTCCGG TTCTGAATTT GTTGAATTAT TCGTTGGTGC TGGTGCGGCG
AGAGTTAGAG ATTTATTTGA ACAAGCTAAG AAAAAAGCTC CGTGCATAAT TTTTATCGAT
GAATTAGATG CTATTGGTAA AAGTAGATCT GGATCAATGG GAGTAGTTGG AGGTAATGAT
GAAAGAGAGC AAACTCTTAA TCAACTACTT ACAGAAATGG ATGGGTTTTC TTCAGCTGAC
AAGCCAGTAA TTGTTCTTGC GGCAACAAAC CAACCCGAAG TACTTGATGC GGCGTTATTA
CGTCCGGGTA GATTTGATAG ACAAGTTCTT GTGGATAGAC CTGATTTATC TGGTAGGAAA
ACTATTTTAG AAATTTACAC AAAGAAAGTA AAACTCTCGG CAAAAATAGA TCTTGATAGA
ATTGCTCAAG CTACAAGCGG ATTTGCAGGC GCCGATTTAG CAAACATGGT AAATGAAGCG
GCTCTTTTAG CGGCAAGAGC CTACCGCCCA GAAGTCGAGC AGCAAGATTT AAATGAAGCC
ATTGAAAGAG TTGTCGCTGG GCTTGAGAAA AAAAGCAGAG TCTTGCAAGA TGATGAGAAA
AAGATTGTTG CTTATCACGA GGTTGGCCAC GCAATAGTGG GACACTTAAT GCCTGGTGGA
AGCAAAGTGG CAAAAATTTC AATAGTACCA AGAGGTATGA GCGCTTTAGG CTATACTCTT
CAACTCCCAA CAGAAGAAAG ATTCCTAAAT TCCAAAGAAG AACTACAAGG TCAAATCGCT
ACTCTTCTTG GGGGAAGATC TGCAGAGGAG ATAATTTTCG GAAAAGTTAC TACAGGTGCT
TCAAACGACT TGCAAAGAGC AACAGATATT GCTGAGCAAA TGGTAGGTAC ATATGGAATG
AGCGATATCC TAGGTCCATT GGCATATGAC AAACAAGGAG GGGGTCAATT CCTTGGAGGG
AACAATAATC CTAGAAGAGA ATTAAGTGAT GCTACTGCTC AAGCAATTGA TAAAGAAGTC
AGAAGTTTGG TAGATGATGC ACATGAAAAA GCTCTAAATA TCCTTAAAAA TAATCTTTCA
TTACTTGAAG ATATTTCTCA AAAAATCCTT GAGAAAGAAG TTATAGAAGG AGATGATCTA
ATTAAAATGC TATCAACCAG TGTTATGCCT GAAAAAGTTT CTAATTAA
 
Protein sequence
MFNLGLIKFL LMPIRQDENQ PNKRFGIINL VLIGFGALLL FSSFFPSQNT QVPRVPYSLF 
INQVDDGEVK RAYITQDQIR YELSTVEEGA PSVLATTPIF DMELPQRLEK KGVEFAAAPP
KKPNIFTTIL SWVVPPLIFI LVLQFFARRS MGGGGAQGAL SFTKSKAKVY VPDDESKVTF
EDVAGVDEAK NELTEIVDFL KKPQRYTDIG ARIPKGVLLV GPPGTGKTLL SKAVAGEAEV
PFFIISGSEF VELFVGAGAA RVRDLFEQAK KKAPCIIFID ELDAIGKSRS GSMGVVGGND
EREQTLNQLL TEMDGFSSAD KPVIVLAATN QPEVLDAALL RPGRFDRQVL VDRPDLSGRK
TILEIYTKKV KLSAKIDLDR IAQATSGFAG ADLANMVNEA ALLAARAYRP EVEQQDLNEA
IERVVAGLEK KSRVLQDDEK KIVAYHEVGH AIVGHLMPGG SKVAKISIVP RGMSALGYTL
QLPTEERFLN SKEELQGQIA TLLGGRSAEE IIFGKVTTGA SNDLQRATDI AEQMVGTYGM
SDILGPLAYD KQGGGQFLGG NNNPRRELSD ATAQAIDKEV RSLVDDAHEK ALNILKNNLS
LLEDISQKIL EKEVIEGDDL IKMLSTSVMP EKVSN