Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_18671 |
Symbol | |
ID | 4779391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1524972 |
End bp | 1527755 |
Gene Length | 2784 bp |
Protein Length | 927 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640085156 |
Product | putative DNA helicase |
Protein accession | YP_001015687 |
Protein GI | 124026572 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4581] Superfamily II RNA helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCAT CAGAAAGAGA TACGTCTGAA AACGAAATCC TCCAAAACAA TTTAAATCCA GAGCAGATCT TTCCTTTTGC ATTGGATGAA TTTCAACTTA AAGCAATTGA TTCACTCAAT CAAGGGCACT CTGTAGTTGT AAGTGCCCCT ACAGGATCAG GAAAAACACT AATAGGTGAG TATGCGATTT ATAGGGCTAT CTCTCATGGT AGTAAGGTGT TTTATACAAC GCCTTTAAAA GCTTTATCTA ACCAAAAGCT AAGAGACTTT AGAAATCAAT TTGGTTCAAG CAATGTGGGT CTTTTAACAG GTGATTTAAG CCTGAATAGA GAGGCTTCGA TTCTTGTGAT GACTACTGAA ATTTTTAGAA ATATGCTTTA TGCAGCAGCA GATAGAAATG ATGACCCTTT GCTTGATATA GAAACTGTCG TTCTTGATGA ATGTCATTAC ATGAATGATG CTCATAGAGG GACAGTGTGG GAGGAATCAA TTATTCATTG TCCTAAATCA GTTCAGTTCG TTGCTCTATC GGCAACCGTT GCCAATGCAG GTCAATTAAC TGATTGGATT GAGCAAGTTC ATGGACCTAC AGATTTGATA TCTAGTGATT TAAGACCAGT TCCACTGGAA TTCAATTTTT GTAGTGCAAA AGGACTTCAT CCATTACTCA ATGATAAAGG AACTGGATTA CATCCAAACT GTAAAATTTG GCGCCCAACA AAATCACATA AGAAGAGAGG ACGCTTATCC AAGCCTACTC AACCTGAATC TCCTTCACTG GGCTTTGTGA TATCAAAGTT GGCAGAAAGA AACATGTTAC CCGCTATTTA TTTCATATTT AGTCGTCGTG GTTGTGATAA GGCTGTGAAA ACGATAGCTA GCACTTGTTT AGTGAACCAA GAAGAAAGAA AATCAATTCA AGATCGATTT GAAAAATATG TAATTTTAAA TTCAGAAGGT TTAAGAGATG ATTTACATAT ACAAGCTTTA TTTAATGGGA TCGCATCTCA TCACGCAGGA GTTCTTCCCG CCTGGAAAGA ATTGATTGAG GAATTATTTC AAGAAGGATT GATAAAGGTT GTTTTTGCAA CTGAAACCTT AGCGGCAGGA ATCAATATGC CGGCGAGAAC CACAATTATA TCTACTTTGT CGAAGAGATC AGATAATGGA CATCGTCAAT TAATGGGGAG TGAATTCTTA CAAATGGCTG GTAGGGCTGG AAGAAGAGGT CTTGACTCTA GAGGCTATGT GGTCACTTTA CAAACTCGAT TCGAAGGGGT TCGAGAAGCT GGTCAATTAG CGACAAGCCC AGCCGATCCA CTTATTAGTC AATTCACGCC TAGCTATGGC ATGGTACTGA ATCTATTACA GCGTTATGAG TTAGATAAAT CAAAAGAATT GATAGAAAGA AGCTTTAGTA GATACTTGGC GAGCTTGGAT TTAGTTGAAG AAGAAGAAGA GTTATGCAGA TTAAAAGAAG AGTTTAAAGA GTATAAAAAT TTTGCAGAAG ACATACCATG GTCAGATTTT GAAAAATATG AGAAACTAAA AAGTCATCTT AAGGAAGAAA GAAGACTATT AAAAATTCTC CAAAAACAAT CAGCAGACAC CTTATCTAAT GAATTGATCT CAGCCTTAGA ATTTGCAAAT AATGGAACCC TCATAAGTCT GAAGACATCA CAATTGCGAG GAAAAGTTAC ACCTGCAGTC ATTATTCAAA AAATACAAAA AAGAGAAAGA CATACTCAAT TGTTATGCTT AACAGATGAA AACATTTGGA TACTTATTGC TTGTAAGGAA GTTGTTAGTC TTTATGCTGA CTTGACTTGT TTAGACGTAT CTCATCTTAC GACTCCCGAA ATTAGTAGAT TAGGTGAAAT ACATCATGGA GATTTACTAA GTAATGAAAT AGCTTCAATG ATTTCAAATT TGGCTCAGAA AAATGATATG AGAACAGCTC AATATGATCT TGCTAGTGAG GTTTTATCTC AAGCTAAATT AGTTAAATCT CTGGATGATG AATTATTGAT TCAGCCAGCT CATCGATGGG GGGATAAGAA AAAATTAAAG AGACATAGAC GAAAAATGGA TGAACTGGGT ATTGAAATAC ATGAACGTGA GGAGATGCTC TATGACAGGT CCAATCGTCA CTGGGAAACT TTCTTATCGC TAATTAAAGT ACTAAACCAT TTCGGTTGCT TGGACGATTT AAACCCTACT GAGATCGGCA GGAGTATTGG ATCTCTGAGA GGTGAAAATG AGTTATGGTT AGGCTTGGTT TTGATGAGTG GACATCTTGA TGAACTAACA CCAACAGAAT TAGCTGGAGT TGTCCAGTCA ATTGCTACAG AAGTTAATCG CCCTGATCTA TGGTCTGGAT TTATTCCAAG TGCAGTTGCG GACGAAGCAT TTAATGATTT ATCAAATATT CGAAGAGAAT TGTTTAGAGT TCAAGAACGA TTTGGTATAG AAATTCCAAT CTTATGGAGT TCAGAGTTAA TGGGTTTAGT AGAAGCATGG GCTAGAGGAA GCTCTTGGAC GGATTTAATT TCAAATACTT CGTTAGATGA GGGAGATGTA GTAAGAATTC TTAGAAGGAC AAATGATTTA CTTTCACAGA TTCCCTATTG CGAGTCTGTG AGTAGACAAC TCAGAAATAA TGCTAAGGCA GCGATGAAAT TAATGGATAG ATTTCCAGTT CGTGAAGCTG AAGATATTAA TCAAGCAAAA GAAAAAAATC ATGAACTTAC TAATCCAGCG ACTGAAAGAC ATAATGGAGA TTGA
|
Protein sequence | MTSSERDTSE NEILQNNLNP EQIFPFALDE FQLKAIDSLN QGHSVVVSAP TGSGKTLIGE YAIYRAISHG SKVFYTTPLK ALSNQKLRDF RNQFGSSNVG LLTGDLSLNR EASILVMTTE IFRNMLYAAA DRNDDPLLDI ETVVLDECHY MNDAHRGTVW EESIIHCPKS VQFVALSATV ANAGQLTDWI EQVHGPTDLI SSDLRPVPLE FNFCSAKGLH PLLNDKGTGL HPNCKIWRPT KSHKKRGRLS KPTQPESPSL GFVISKLAER NMLPAIYFIF SRRGCDKAVK TIASTCLVNQ EERKSIQDRF EKYVILNSEG LRDDLHIQAL FNGIASHHAG VLPAWKELIE ELFQEGLIKV VFATETLAAG INMPARTTII STLSKRSDNG HRQLMGSEFL QMAGRAGRRG LDSRGYVVTL QTRFEGVREA GQLATSPADP LISQFTPSYG MVLNLLQRYE LDKSKELIER SFSRYLASLD LVEEEEELCR LKEEFKEYKN FAEDIPWSDF EKYEKLKSHL KEERRLLKIL QKQSADTLSN ELISALEFAN NGTLISLKTS QLRGKVTPAV IIQKIQKRER HTQLLCLTDE NIWILIACKE VVSLYADLTC LDVSHLTTPE ISRLGEIHHG DLLSNEIASM ISNLAQKNDM RTAQYDLASE VLSQAKLVKS LDDELLIQPA HRWGDKKKLK RHRRKMDELG IEIHEREEML YDRSNRHWET FLSLIKVLNH FGCLDDLNPT EIGRSIGSLR GENELWLGLV LMSGHLDELT PTELAGVVQS IATEVNRPDL WSGFIPSAVA DEAFNDLSNI RRELFRVQER FGIEIPILWS SELMGLVEAW ARGSSWTDLI SNTSLDEGDV VRILRRTNDL LSQIPYCESV SRQLRNNAKA AMKLMDRFPV REAEDINQAK EKNHELTNPA TERHNGD
|
| |