Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_06561 |
Symbol | |
ID | 4780440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 601920 |
End bp | 603275 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640083934 |
Product | hypothetical protein |
Protein accession | YP_001014483 |
Protein GI | 124025367 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0586116 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAAAATC ATTCATTAAC AAAATCTCCT CTCTCCTTGC CTTCTTTTTC GATTCCAAAA ATTAGTCTTT TTATTGGGCT AACTATCACA GGTCAATGGG TTTTGAGTGA TGTGGCCCAT ATTCCTGGGG GTGGTCTTGG ATTGCTATTA GGACTTGGTT GTATTTTTTA TTTTTTAAAA CCAGGGAAGG TTTCATTTGA TGCTCCCTCA ACTGTTCAAG GATGGGTAAG AAGATGTCAT GACGTTTTAG AGAATTTTGA GTACTTACTT GAGGATGGAG AGCAAAGTGA AAGAAAAAAA GAAAGAATAA ATTCCTTGCA AAAAATTATT GATAGAAGCG AAGATCAAAG CATTGGTTTC TTGAAAACAA AAGGCGTAAA ATTACCTGAT GAACAGCAAT TGGAAAAAGT TTTAGGAATA AATAACCAAA TAAAAGTTTC TTTTCCACCA GCTCTTCCTG TAAGAGATCG AAATTGGATT TTGCCAGATT TAATCCAAGA GCAAGATTTT ATTGTTTATT CTTTGACACT TCCAATGAGC GCAGCTGATC TTTTGTGGAT TAAAAATATC CCTACTGATC AACCAGCCTG GCTAATGGTT GCCAGTAAAG AATCTACTGA TTGGTCTGAT GAGCGAAATG CATTAGAGGC TCAATTACCA GATAGATGGA CTAACAGAGT ATTGAAATGG GATGGATCTC AAACAGAAAT GGCAACGGTT CTTTCTCCAA TCAAGAAACT TCTTGAAAAT CCAAAGAGGA ATACAGACAT TACTAAGCAA AGACTTTTGT CTCGGTTGCA TACTTCTTGG CAAAAAGATT TAGAAAAATT AAGAAGAGAA AAATTCAAGG TTATTCAAAC AAGATCTCAG TGGATAGTTG CTGGTATCGT TTTCGCCTCT CCTGTCGCCT CAACTGATTT GCTTGCAGTT GCAGTGGTTA ATGGCTTGAT GATCAAAGAA ATGTCGAAAA TATGGTCTTC CAAAATGAAG CCAGAATTAC TTGAGGCAGT CTCACGACAA CTAGCAATGG CTGCAATTGC TCAAGGAGTG GTCGAATGGA GTGGACAGTC CTTGTTGAGC TTGGCAAAGC TTGATGGCTC CTCTTGGGTT GCTGCTGGAA CAATTCAGGC CTTGAGTGCT GCTTATTTAA CAAGAGTGGT TGGGAGATCG ATGTCTGATT GGATGGCTCT CAATAATGGA GTAACTCAAC CTGATTTAGA ACTTATTAAG CAACAAGCTC CTCAACTAGT ATCAAAAGCT GCTGAGCTAG AAAGAGTTGA TTGGGTGGCT TTTTTAAAGC AATCAAAAGA ATGGATTCAG TCTCAATCTA ATAATTACAA AGTTAAATCC GTGTAA
|
Protein sequence | MENHSLTKSP LSLPSFSIPK ISLFIGLTIT GQWVLSDVAH IPGGGLGLLL GLGCIFYFLK PGKVSFDAPS TVQGWVRRCH DVLENFEYLL EDGEQSERKK ERINSLQKII DRSEDQSIGF LKTKGVKLPD EQQLEKVLGI NNQIKVSFPP ALPVRDRNWI LPDLIQEQDF IVYSLTLPMS AADLLWIKNI PTDQPAWLMV ASKESTDWSD ERNALEAQLP DRWTNRVLKW DGSQTEMATV LSPIKKLLEN PKRNTDITKQ RLLSRLHTSW QKDLEKLRRE KFKVIQTRSQ WIVAGIVFAS PVASTDLLAV AVVNGLMIKE MSKIWSSKMK PELLEAVSRQ LAMAAIAQGV VEWSGQSLLS LAKLDGSSWV AAGTIQALSA AYLTRVVGRS MSDWMALNNG VTQPDLELIK QQAPQLVSKA AELERVDWVA FLKQSKEWIQ SQSNNYKVKS V
|
| |