Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_10341 |
Symbol | |
ID | 4780908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 952977 |
End bp | 954704 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640084313 |
Product | cell division protein FtsH4 |
Protein accession | YP_001014857 |
Protein GI | 124025741 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.194795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000108584 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTAAAA GTTATAGTCA ATTATTAAGG GATATTGAAA GTGGAGAAAT TATTTCAATT ATTTTAATTC CAAATAGAAG AGAGGTCATT GTTGAATTAA TAAATGGAGA GAAAAAATTA ATACCTATAT TTTATAATGA TCAGAAAATT CTTCGAATAT CAGAGGAATA TAATGTTCCT CTTACTGTAA GAGATATTAG ATCAGATCAG AGATTAGCAA ACTTCATAAC AGGTTTTGGC CTCACTTTGA TTTTTGTTTT CTCTCTTGTA TTTCTAATTA GAAGATCATC CAAGTTATTA AACAATCTGC AAAGTTTTTC TGGTCGTTCT TCTCAAGTAA ATGAAGATGA TATTAGAAAA TACACATTCG ATGATGTGGC TGGATTAAAT CAAGAATCAG ACGAATTAAA GGAAATAGTA ATCTTTTTAA AGAATCCACA GACATTGAAA GACCTGGGAG CTAAAACGCC GAAGGGAGTT TTATTAGTGG GTCCACCTGG AACCGGTAAG ACATTACTAG CTCGCTCAAT TGCAGGAGAA GCTGATGTTC CTTTCTTTTC TATTTCTGCA TCGGAATTTG TTGAAATGTT CGTAGGTGTT GGAGCTGGAC GTGTCCGTGA TTTATTTAAA AGTGCTAAAT CAAAGGCTCC TTGCATAGTC TTCATAGATG AAATTGACTC TATTGGACGA CAAAGGGGAG CTGGAATAGG AGGAGGCAAC GATGAGAGAG AGCAAACCCT TAACCAGCTC TTAACCGAGA TGGATGGATT CGAAGCAAAT AATGGAGTCA TAGTTATTGC AGCTACGAAT AGACCCGATA TTCTTGACCG AGCATTAACT AGGCCTGGTA GATTTGACCG TCGTATTGAC ATATCCCTTC CTGATCGAGA AGCTAGACAC AAAATTTTGT CCGTTCACGC AAGGACAAAA CCTTTATGTG ATTCAGTGAA TCTGAAAGAC TGGGCAACTA AGACGCCTGG CTACTCAGGT GCAGATTTAC AAAACCTAAT GAATGAGGCT GCAATCTATG CTGCTAGAAA TAATAAGTCA GTCATAAGTA GTATTGAACT AGAAAATGCA CTTGAAAAAA CACGATTTGG AATACTGTCT AAGCCACTTT CAGATCAGAT AAAAAAAAGA CAAATTGCTT ATCAGGTTAT TGGTAAAACC CTTGTTGCTT TATTGATTCC GACCCAAGAT AAATTAGAAA AAATCTCTTT ATTTAAGTCT CTCGGAAATA TTTCTGGAAT GACTTATTTC ACACCAGATG AGGAGACCAT AGACAGCGGA CTTTTAACAC GTAACTACAT TTATAACAAA ATTGTTATTT CACTTGGTTC TAGGGCTGCT GAAATGATTA TTTTTGGCTC TAAGGAAGTT ACACAAGGAT CGCAAAAGGA ACTAGAAAAT GTATATTTTT GGGCAAATCA AATGGTTACT AAATTTGGAT TCTCAGATCT TGGACCTATT ACTTATGATT CAGAAAAAGA TACAATTTTT CTTGGAAAAG ATCTTATGAA GAATAAAAAT GAATATTCTC AGAAGACAAG CAGGGAAATT GATAAGCAAA TTATTTCAAT AGCAAATAAG GCTGTTAATC ATGCAATATT TCTTCTTTCA GATAAAGTTT CCTTAATGGA TAATCTTGTT GATGAATTAA TAGTTAAGGA AACTCTTGAA TCTGATTTTA TAATTGATTC ACTTAATTCT TACTTATCTA GCAACTAA
|
Protein sequence | MSKSYSQLLR DIESGEIISI ILIPNRREVI VELINGEKKL IPIFYNDQKI LRISEEYNVP LTVRDIRSDQ RLANFITGFG LTLIFVFSLV FLIRRSSKLL NNLQSFSGRS SQVNEDDIRK YTFDDVAGLN QESDELKEIV IFLKNPQTLK DLGAKTPKGV LLVGPPGTGK TLLARSIAGE ADVPFFSISA SEFVEMFVGV GAGRVRDLFK SAKSKAPCIV FIDEIDSIGR QRGAGIGGGN DEREQTLNQL LTEMDGFEAN NGVIVIAATN RPDILDRALT RPGRFDRRID ISLPDREARH KILSVHARTK PLCDSVNLKD WATKTPGYSG ADLQNLMNEA AIYAARNNKS VISSIELENA LEKTRFGILS KPLSDQIKKR QIAYQVIGKT LVALLIPTQD KLEKISLFKS LGNISGMTYF TPDEETIDSG LLTRNYIYNK IVISLGSRAA EMIIFGSKEV TQGSQKELEN VYFWANQMVT KFGFSDLGPI TYDSEKDTIF LGKDLMKNKN EYSQKTSREI DKQIISIANK AVNHAIFLLS DKVSLMDNLV DELIVKETLE SDFIIDSLNS YLSSN
|
| |