Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_17071 |
Symbol | |
ID | 4781156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1390933 |
End bp | 1392987 |
Gene Length | 2055 bp |
Protein Length | 684 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640084991 |
Product | hypothetical protein |
Protein accession | YP_001015527 |
Protein GI | 124026412 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.331624 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAATTAC CTATTGATCA TTTTCGCTTA TTGGGGGTTA GCCCTTCTGC AAATGCTGAA GAAGTCCTCA GGGCTTTTCA GCTAAGGCTA GATCGTCCTC CCAAGCAAGG CTTTACTTAT GAAGTTTTAG CTCAGAGGTC TGAGCTTCTA AGGCTTTCTG CTGATCTTTT ATCTAATCCT GCCGAACGAC AATCTTACGA ACTCGCTCTG ATTGAGGGTT CGTCTGGACT TGAATTATCT TCAAATAGAG AAGTTGCTGG TTTACTTCTC TTGTGGGAAT CTAATGCTTC TTTCCAAGCT TTCAAGCTTG CAAAAAAAGC ATTACAACCT CCACAAGCCC CAGCCTTGGG AAGTGGTAGA GAGTCAGATT TAACATTAAT AGCAGCTTTA GCTTGTAGAG ATGCTTCTAT TGAGGAGCAA GCTTGCAGAA GATACGCTTC AGGTGCTGAT TTGCTTCAGG AAGGTATACA GTTATTGCAG AGGATGGGAA AGCTTGTTGA AGAAAGAAAA ACCCTTGAAT CCGATTTAGA GTCTTTACTT CCATACAGAA TTCTTGATTT ATTAAGTAGA GAGAAAGAAG AAGAAAAATC TCATCAGGAA GGCCTGATGT TGCTGGAAGA CTTTGTTAAT AAAAGAGGTG GACTCGAAGG AAAAAGAAAT TCAGAAAAAA TAGCAGGATT AAATCAAAAT GATTTTGAGC TGTTTTTCCT CCAAATCAGA AAATTTTTAA CTGCTAAAGA ACAGTCAAAA ATTTATGTAA ACTGGTATAG AAGAGGTTCC GAAGATGCTG GCTTTCTTGC TGCTTTTGCT TTGATTGCTT CTGGCTATTC TTATAGGAAA CCAGAACTCT TGCAAGAAGC TCGGAAATAT CTTCGAAATA TCAACATTAA TGGCTTTGAC CCTATGCCAT TAATTGGATG CCTAGATCTT TTATTAGGGG ATGTAACGCA AGCAGAGTCT CGTTTTCGAA GTAGTTCAGA TGAGAAATTA AAAGACTGGT TAGACAATTA CCCAGGCGAA ACATTGGGAG CTTTATGCGA CTACTGTAGA AATTGGTTGA AAAAAGATGT ATTAGTAGGT TTTAGCGATG TTGAGATACA AACTGTAAAT CTTGATGATT GGTTCGCTAG CCAAGAAGTT CAAGTTTATG TAGAGCAGTT GGAATCAAAA GGGGCATTAG GCATTGCAAA AGCTGGATTT TCTTTCTTAT CGTCATTGAC CCCTGAACAA CAAATTGAGA ATAATTCATC AAGAAACTTA GAGGAAAAGG CAGATTTGCC AATGCCGGGT GGGGCTTTAG GTGAGAATTT TAATGAAAGC TCTTTCAAAT CACGATTAGA TATTAAAGAA TTTTTCTTAA GATCTAATTT GGTTGAAAAG ATAGTTTCAA AATATTATTC AATATTTGAA TTGATAAAAA ACTCAGATTT CAAATCTTTC ATATTAAAAC GACCAATATA TACAAGTTCT TTAGCATTTA TTGGTTTATT TATTGTTGGA ACAAGCTTAG GAGTCCTTAC GCAAAGAAAA CCCTCGCAAA ATAATGATCT CAGCAATATC TCTAAGTCTG AATTGGTTAA ACCAGAAGAT ATTAAAAATA GAGATAATGG ATCAACTAAA ATAGAAAATA ATAAAGAAAA ATTAGATTTA AAAAAATCAA TTCCTCTTAC TTCATTAGAC CCCTCAAATC AAGAAATTAA ATCTCTTGTT GAATCATGGT TGGAAGGTAA GGCGGACATT CTGAATGGTT CGGAAAGTCA GTTTCTTTCT TCTGTTGCTA GAGCCTCTCT ATTCAATAGA GTTATCGAAC AGAGAAAGAA AGATAAACTT TTAGGACAAA GACAGATTAT TAATGCAGAT ATAACTTCAA TCAATATTGT TCAAAAATCT GACAGGAGAA TTGCAGCAGA TGTTGAATTA AACTATCAAG ATAAATTGAT TAGTTCTTCG GGTGAGATTT TATCTGAAAC GGTTATTCCT TCTTTGAAAG TTAAATATAT AATAGGTAAG AATAAAAAAA ACTGGCTAAT AGTTGACTAT ATTAGTGGAA ATTAA
|
Protein sequence | MELPIDHFRL LGVSPSANAE EVLRAFQLRL DRPPKQGFTY EVLAQRSELL RLSADLLSNP AERQSYELAL IEGSSGLELS SNREVAGLLL LWESNASFQA FKLAKKALQP PQAPALGSGR ESDLTLIAAL ACRDASIEEQ ACRRYASGAD LLQEGIQLLQ RMGKLVEERK TLESDLESLL PYRILDLLSR EKEEEKSHQE GLMLLEDFVN KRGGLEGKRN SEKIAGLNQN DFELFFLQIR KFLTAKEQSK IYVNWYRRGS EDAGFLAAFA LIASGYSYRK PELLQEARKY LRNININGFD PMPLIGCLDL LLGDVTQAES RFRSSSDEKL KDWLDNYPGE TLGALCDYCR NWLKKDVLVG FSDVEIQTVN LDDWFASQEV QVYVEQLESK GALGIAKAGF SFLSSLTPEQ QIENNSSRNL EEKADLPMPG GALGENFNES SFKSRLDIKE FFLRSNLVEK IVSKYYSIFE LIKNSDFKSF ILKRPIYTSS LAFIGLFIVG TSLGVLTQRK PSQNNDLSNI SKSELVKPED IKNRDNGSTK IENNKEKLDL KKSIPLTSLD PSNQEIKSLV ESWLEGKADI LNGSESQFLS SVARASLFNR VIEQRKKDKL LGQRQIINAD ITSINIVQKS DRRIAADVEL NYQDKLISSS GEILSETVIP SLKVKYIIGK NKKNWLIVDY ISGN
|
| |