Gene NATL1_10341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_10341 
Symbol 
ID4780908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp952977 
End bp954704 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content34% 
IMG OID640084313 
Productcell division protein FtsH4 
Protein accessionYP_001014857 
Protein GI124025741 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.194795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000108584 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAAAA GTTATAGTCA ATTATTAAGG GATATTGAAA GTGGAGAAAT TATTTCAATT 
ATTTTAATTC CAAATAGAAG AGAGGTCATT GTTGAATTAA TAAATGGAGA GAAAAAATTA
ATACCTATAT TTTATAATGA TCAGAAAATT CTTCGAATAT CAGAGGAATA TAATGTTCCT
CTTACTGTAA GAGATATTAG ATCAGATCAG AGATTAGCAA ACTTCATAAC AGGTTTTGGC
CTCACTTTGA TTTTTGTTTT CTCTCTTGTA TTTCTAATTA GAAGATCATC CAAGTTATTA
AACAATCTGC AAAGTTTTTC TGGTCGTTCT TCTCAAGTAA ATGAAGATGA TATTAGAAAA
TACACATTCG ATGATGTGGC TGGATTAAAT CAAGAATCAG ACGAATTAAA GGAAATAGTA
ATCTTTTTAA AGAATCCACA GACATTGAAA GACCTGGGAG CTAAAACGCC GAAGGGAGTT
TTATTAGTGG GTCCACCTGG AACCGGTAAG ACATTACTAG CTCGCTCAAT TGCAGGAGAA
GCTGATGTTC CTTTCTTTTC TATTTCTGCA TCGGAATTTG TTGAAATGTT CGTAGGTGTT
GGAGCTGGAC GTGTCCGTGA TTTATTTAAA AGTGCTAAAT CAAAGGCTCC TTGCATAGTC
TTCATAGATG AAATTGACTC TATTGGACGA CAAAGGGGAG CTGGAATAGG AGGAGGCAAC
GATGAGAGAG AGCAAACCCT TAACCAGCTC TTAACCGAGA TGGATGGATT CGAAGCAAAT
AATGGAGTCA TAGTTATTGC AGCTACGAAT AGACCCGATA TTCTTGACCG AGCATTAACT
AGGCCTGGTA GATTTGACCG TCGTATTGAC ATATCCCTTC CTGATCGAGA AGCTAGACAC
AAAATTTTGT CCGTTCACGC AAGGACAAAA CCTTTATGTG ATTCAGTGAA TCTGAAAGAC
TGGGCAACTA AGACGCCTGG CTACTCAGGT GCAGATTTAC AAAACCTAAT GAATGAGGCT
GCAATCTATG CTGCTAGAAA TAATAAGTCA GTCATAAGTA GTATTGAACT AGAAAATGCA
CTTGAAAAAA CACGATTTGG AATACTGTCT AAGCCACTTT CAGATCAGAT AAAAAAAAGA
CAAATTGCTT ATCAGGTTAT TGGTAAAACC CTTGTTGCTT TATTGATTCC GACCCAAGAT
AAATTAGAAA AAATCTCTTT ATTTAAGTCT CTCGGAAATA TTTCTGGAAT GACTTATTTC
ACACCAGATG AGGAGACCAT AGACAGCGGA CTTTTAACAC GTAACTACAT TTATAACAAA
ATTGTTATTT CACTTGGTTC TAGGGCTGCT GAAATGATTA TTTTTGGCTC TAAGGAAGTT
ACACAAGGAT CGCAAAAGGA ACTAGAAAAT GTATATTTTT GGGCAAATCA AATGGTTACT
AAATTTGGAT TCTCAGATCT TGGACCTATT ACTTATGATT CAGAAAAAGA TACAATTTTT
CTTGGAAAAG ATCTTATGAA GAATAAAAAT GAATATTCTC AGAAGACAAG CAGGGAAATT
GATAAGCAAA TTATTTCAAT AGCAAATAAG GCTGTTAATC ATGCAATATT TCTTCTTTCA
GATAAAGTTT CCTTAATGGA TAATCTTGTT GATGAATTAA TAGTTAAGGA AACTCTTGAA
TCTGATTTTA TAATTGATTC ACTTAATTCT TACTTATCTA GCAACTAA
 
Protein sequence
MSKSYSQLLR DIESGEIISI ILIPNRREVI VELINGEKKL IPIFYNDQKI LRISEEYNVP 
LTVRDIRSDQ RLANFITGFG LTLIFVFSLV FLIRRSSKLL NNLQSFSGRS SQVNEDDIRK
YTFDDVAGLN QESDELKEIV IFLKNPQTLK DLGAKTPKGV LLVGPPGTGK TLLARSIAGE
ADVPFFSISA SEFVEMFVGV GAGRVRDLFK SAKSKAPCIV FIDEIDSIGR QRGAGIGGGN
DEREQTLNQL LTEMDGFEAN NGVIVIAATN RPDILDRALT RPGRFDRRID ISLPDREARH
KILSVHARTK PLCDSVNLKD WATKTPGYSG ADLQNLMNEA AIYAARNNKS VISSIELENA
LEKTRFGILS KPLSDQIKKR QIAYQVIGKT LVALLIPTQD KLEKISLFKS LGNISGMTYF
TPDEETIDSG LLTRNYIYNK IVISLGSRAA EMIIFGSKEV TQGSQKELEN VYFWANQMVT
KFGFSDLGPI TYDSEKDTIF LGKDLMKNKN EYSQKTSREI DKQIISIANK AVNHAIFLLS
DKVSLMDNLV DELIVKETLE SDFIIDSLNS YLSSN