Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_03321 |
Symbol | |
ID | 4779110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 305207 |
End bp | 307141 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 640083598 |
Product | hypothetical protein |
Protein accession | YP_001014161 |
Protein GI | 124025045 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1452] Organic solvent tolerance protein OstA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.377617 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCTTTAT TTGAGACCTC TATAGGAAAG GAGCATTTTC TCGGAAAATC AGTTATTGAT AAACACTATA ATTTTTCTGT TGGTAATTTA GAAGAAAATG ATAAAATATT ATCCTCAAGA AAATTACAAC AGAATGAAGT TACATCTTTA ACTCTTAAAA TATTTGCTGA TAAGCAATAT GATTATGATC AAAATATTTA TCTAGCAGAG GGGAATGTCA AGGCTCTAAT AAATGGTGGA ATCTTAAGAT CTGACTTATT AAGTTATGAT AAATCAACCG GGATCTTGAT CGCTGAGGGT AATATTAGAT TCAGAAAAGG AGGGCAATAT TTTAGAGCTA AAGAATTCAA GTTTGATTTA TTAAAGAAAG AAGGTAGTAT TAAAGATGCC TATGGGATAT TAGATTTAAA AAATGTATTA AATGATTTGA AAATTGATGC TATTTCAAAT CAGGCTAAAG CTCTACATAG CACTAATGAT AAAGAAATAA ATACTTACGA TGATGGAATA GAATTTTCTT TTGGAAATAT TAAATTACCA GATAATAAAA TCACAAGATC TAATAAATCT ATTGGTTTAA TTAATAACTG GAGATTTAAA TCTAATTCAA TAAGTATTGA GGAACATGGA TGGAAATCTA ACAGGATTGT TTTTACTAAT GATCCATTAG ATCCTAATCA AATTTCTTTT GAGGGTATAG ATGTTATCGC AGAAGAAGAA GAAGATGGTA GATTACTTAT TACTAGTTCT AAAACTAATT TAATTCTTGA GAACAGATCA AAAATTTTTA TAGGAAAAAG AATATTTGGA CAAAAAAAGA AAAAGAGAAG TAAATTTCAA TTGATATTAG ATGGTAAAGA TCGTGACGGA TTAGTCTTAG TAAGAAGAAG TAATTCTACG AATATTGCTG ATAATATAAA ACTTGAACTT CAACCTCAAT TTTTAATTAG TAGAGCTATT TTAGGTAAAA CTAATAGTTA TAAAAATAAC AAAGATAAAA ATATTAACTT TTCTGATTTA TTTGGTTTAA ATATAAATCT GAATGCAAGT AATCCCAATT GGAGTTTTGA TAGTTTAAAT GATTTAAGTA CATTAAATAC CTCCAGATTA TTTAATGGAT TAAGGCATTC AAGCTCTTTT AGTATACCTA TTTTAGAAGA ATCAACTTTC AATATTTTTA CTGCTTATAG ATCGAGAGCT TGGAATGGAA CAATTGGTGA GACTGAGATT AAATCTGCAT TTGGAGGCTT TATCGAAAAG TCGCAGCATT TTACGACTGG CAGTTTAAAA AATAACTTAA ATATTAGGAT AGGAACAGGT AGATATGAAA CAGAAAAATT TGAAAATAGT GAAATGATTA GCCATTGGCG CTCTAGTATC TTTTCTTCTT TAGATAGTGA ATATCAAATA TGGAAAAGTA ATAAAAAAAA TCTTTATCAA AATCATTTAA CGCCTTTATC CCCTGTTTTA CTTAACTCTG AATTAGTCTT AAGAACTAAT ATTGATTCAG CTTATTTCAA CTATTTGAAT GATAGTGATC AAGGCTTTCT CAAGCTTAGT ATTGGTCCCG AAATTAGACT AGGTAATTTA GAGAGAGATT ATTTTGATTA TACAAAACTT TCGGTTATGC CAGGCGTGAA ACTTAAATTT GGCAATAGTC CATTTAAGTT TGATAAAGCA ATAGATTTGA AAACTATAAA TATAAGTTTA ATACAACAAA TATATGGACC TTTAATGTTT GATGTTATTT CTAATTTAAA TATTGATACT ACCTCCAAAA ACTATGGAGA ATATTATGAT ACAAAGTTAG GGATCTTATG GCATAAAAGA GCATACGAAT GTGGGATTTA CTATCATCCT AATAATGATG CTGGAGGATT ATATTTTCGT ATAAATGGAT TTAAATTTGG TAATTCTACT AAAGCAGTAT TTTAG
|
Protein sequence | MSLFETSIGK EHFLGKSVID KHYNFSVGNL EENDKILSSR KLQQNEVTSL TLKIFADKQY DYDQNIYLAE GNVKALINGG ILRSDLLSYD KSTGILIAEG NIRFRKGGQY FRAKEFKFDL LKKEGSIKDA YGILDLKNVL NDLKIDAISN QAKALHSTND KEINTYDDGI EFSFGNIKLP DNKITRSNKS IGLINNWRFK SNSISIEEHG WKSNRIVFTN DPLDPNQISF EGIDVIAEEE EDGRLLITSS KTNLILENRS KIFIGKRIFG QKKKKRSKFQ LILDGKDRDG LVLVRRSNST NIADNIKLEL QPQFLISRAI LGKTNSYKNN KDKNINFSDL FGLNINLNAS NPNWSFDSLN DLSTLNTSRL FNGLRHSSSF SIPILEESTF NIFTAYRSRA WNGTIGETEI KSAFGGFIEK SQHFTTGSLK NNLNIRIGTG RYETEKFENS EMISHWRSSI FSSLDSEYQI WKSNKKNLYQ NHLTPLSPVL LNSELVLRTN IDSAYFNYLN DSDQGFLKLS IGPEIRLGNL ERDYFDYTKL SVMPGVKLKF GNSPFKFDKA IDLKTINISL IQQIYGPLMF DVISNLNIDT TSKNYGEYYD TKLGILWHKR AYECGIYYHP NNDAGGLYFR INGFKFGNST KAVF
|
| |