Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_18141 |
Symbol | |
ID | 4780893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1478820 |
End bp | 1480745 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640085103 |
Product | hypothetical protein |
Protein accession | YP_001015634 |
Protein GI | 124026519 |
COG category | [R] General function prediction only |
COG ID | [COG0488] ATPase components of ABC transporters with duplicated ATPase domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0376333 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.70207 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGATTA GTCTTGTTAA TGCTTCAACA GACTTTGGGA TAAAAAATCT TTTTAAAAAT TTAGATCTTC ATGTAAATAA AAAAGAGAGA CTTGGTTTGA TTGGTCCAAA TGGATCTGGC AAGTCAACAC TTTTGAGAGT CATTGCAGGA ATTGAACCTT TGATGGAAGG AGAAAGGAGA TGTTTATCAT CTTTGCGGAT ATCTTTAGTT GGGCAAGAAA CAAGTTACAA CAGTGAAAAA AGTATTTTGG AAGAAGTCCT TGAAGGGTGT GGAGAAAAAA GAAAATTATT ACTTAATTTC AGTCAGCTAA GTAGAAAAAT CGCTCAAAAT CCAGAAGATG AAGACCTTTT GAAAAAACTT GGTCAGGCGA GTGAACTTAT GGATGCCGCT GGGGCATGGA ATTTAGAACA ACAATGCCAA GATGTTTTAA GAAGATTAGG TATAAAAGAC TTAGATAAGC CAGTAAAAGA GCTTTCTGGT GGTTATCGCA AAAGAGTGGG ACTTGCCGCT GCGCTTGTCT CTAAACCAGA CGTCTTACTT CTTGATGAAC CTACTAACCA CCTAGATGCA TCTGCAGTGG AATGGCTTCA AAATTGGTTA GACCATTATG AGGGTGCGCT GGTCTTGATA ACTCACGATA GATATGTTCT TGATCGCATT ACCAATCGGA TGGTCGAAAT TAACAATGGA GAAACTCGCA AGTATTCGGG CAATTATCGT GAATTTCTTC AACAAAAAGT TGAACAAGAG CAATCAGAGG CATCTACACA GAAAAAGTTT CAGGGTGTTT TAAGAAAGGA ATTAGCTTGG TTAAGACAGG GCCCCAAAGC AAGAAGTACA AAACAAAAAG CACGTATTCA ACGGATTGCT GAAATGCAAG CGAAACCTAA AAGTCATGTC AAAGCTAATT TAGAAATGAA TTCATTGAGT AGAAGAATTG GAAAAATCGC AATTGAGGCT GAAGGTGTAG GGCTATCTCT CAACAATAAA GAGAATAATT TGGATCTTTT ATGTGATTTT ACTTATAGCT TTAGTCCAGA GGACCGAGTA GGAATTATTG GTCCAAATGG GAGTGGTAAA TCCACTCTTT TAGATCTAAT TTCGGGTAAA AGATTGCCTA CAAGTGGGAA AATAAAACTT GGAGAAACGG TTCATATTGG CTACTTAGAT CAACATACAA ATGACTTAAA TCAAGGGAGT GGCTTAAACC GCAAAGTTAT CGATTTTGTG GAGGAGGCTG CATTACGAAT TGATCATGGA GGGAAACAAA TTACAGCATC ACAACTCTTA GAAAAATTCC TCTTTCCACC CAGTCAACAA CATAGTCCTC TGCTAAAACT TTCAGGGGGA GAAAAAAGAA GACTTGCTCT ATGCAAAATG CTCATACAAG CTCCCAACGT ATTATTGCTT GATGAGCCTA CAAATGATTT AGATATACAA ACACTAAGTG TGCTAGAAGA TTTTCTTGAT GATTTTAAAG GTTGTGTCGT AGTCGTATCG CATGATAGAT ATTTTCTTGA TCGCACTATT GATCGAATTT TTAATTTTGA AAACGGTCAC TTGCGAAGGT ATGAAGGAAA TTACTCTCGA TTTCTTGATC ATAAAATATT AGAGGAGCGA AACAATGAAA CCAAAAAACG AGCCAAAATA GTCAATAATT CACAAAATAA GCGTGGACAA GAAATAAAAT TAGATTCTAA AAATGATTCG AGACGATTGA GTTTTAAAGA AGCTAGAGAA TTAAAAGAAT TAGATGTGAG ACTACCTATG TTAGAAAAAA AGAAAATATG TTTAGAAAAA AAGATTACTG ATAGTGATGT AGACATTAGT GAAATTAGTC ATCAATTAGC AGAACTAATT GAATCTATTC AAGAGCATGA GGATAGATGG ATTGAGCTAA GTGAGTTGTC TGAGTCAGCA AAGTAA
|
Protein sequence | MLISLVNAST DFGIKNLFKN LDLHVNKKER LGLIGPNGSG KSTLLRVIAG IEPLMEGERR CLSSLRISLV GQETSYNSEK SILEEVLEGC GEKRKLLLNF SQLSRKIAQN PEDEDLLKKL GQASELMDAA GAWNLEQQCQ DVLRRLGIKD LDKPVKELSG GYRKRVGLAA ALVSKPDVLL LDEPTNHLDA SAVEWLQNWL DHYEGALVLI THDRYVLDRI TNRMVEINNG ETRKYSGNYR EFLQQKVEQE QSEASTQKKF QGVLRKELAW LRQGPKARST KQKARIQRIA EMQAKPKSHV KANLEMNSLS RRIGKIAIEA EGVGLSLNNK ENNLDLLCDF TYSFSPEDRV GIIGPNGSGK STLLDLISGK RLPTSGKIKL GETVHIGYLD QHTNDLNQGS GLNRKVIDFV EEAALRIDHG GKQITASQLL EKFLFPPSQQ HSPLLKLSGG EKRRLALCKM LIQAPNVLLL DEPTNDLDIQ TLSVLEDFLD DFKGCVVVVS HDRYFLDRTI DRIFNFENGH LRRYEGNYSR FLDHKILEER NNETKKRAKI VNNSQNKRGQ EIKLDSKNDS RRLSFKEARE LKELDVRLPM LEKKKICLEK KITDSDVDIS EISHQLAELI ESIQEHEDRW IELSELSESA K
|
| |