Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_11771 |
Symbol | |
ID | 4780258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1043077 |
End bp | 1043925 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640084456 |
Product | Rossmann fold nucleotide-binding protein |
Protein accession | YP_001015000 |
Protein GI | 124025884 |
COG category | [R] General function prediction only |
COG ID | [COG1611] Predicted Rossmann fold nucleotide-binding protein |
TIGRFAM ID | [TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0454643 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCAC AAAATAGGAG TGACGATCTA GATCTGGTCA GCAAAAACCT TGAATTAATT ATTAGTTCGA GTAACTACCA ATTAGCCCAT GAAGATAGAA AGTTACTCAA TAGTGATGAA ATGAGAGGCG TGCGAATGCT TCTAGAAATA AATAAACCAG AAAAAATCTT GGAAGAGCAG AATATTTTAT CAACAATCAT CGTCTTTGGA GGGGCAAGTT TATCTGATAA AAGCTCTATA GATCGCAGAA TTGAACTAGT TAAGAACTCT CTCACGAAAG ATCCAAGCTC ATCAAATTTA GATAGAGAAT TAACACGACT AAAGAATTTA CAGTCAATCT CTCATTACTA CGACTCTGCT AGAGAATTCG CGAAGATTGT CTCCAGACAA AACCAAAAAG AGCATTGCAA TTCACATGTA ATTGTCACTG GTGGTGGTCC AGGGATTATG GAGGCTGCCA ATCGAGGTGC TTTTGATGCG GACTGTAAAT CAATAGGATT AAACATAAGT CTCCCAAACG AGCAACATCC AAATGCATAT ATCACTCCTG GACTTTGCTT CAAATTTAAT TATTTCGCCT TACGAAAATT TCATTTTGTG ATGCGATCAG TTGCAGCTGT CTTTTTCCCT GGGGGATTTG GAACATTTGA TGAACTCTTC GAATTACTCA CTCTTCGTCA AACAGGAATG AAAACAGAAA TTCCAATTAT TCTTTTTGGT CGAGATTATT GGTCGAAAGT GATCAACTTT CAATTCCTTT CAGATCACGG ACTTATCTCA GATGAACACA TGAAACTCTT TCAATACGCC GATAGTGCTT CAGAAGCATG GGACATAATC AAACAATAG
|
Protein sequence | MSSQNRSDDL DLVSKNLELI ISSSNYQLAH EDRKLLNSDE MRGVRMLLEI NKPEKILEEQ NILSTIIVFG GASLSDKSSI DRRIELVKNS LTKDPSSSNL DRELTRLKNL QSISHYYDSA REFAKIVSRQ NQKEHCNSHV IVTGGGPGIM EAANRGAFDA DCKSIGLNIS LPNEQHPNAY ITPGLCFKFN YFALRKFHFV MRSVAAVFFP GGFGTFDELF ELLTLRQTGM KTEIPIILFG RDYWSKVINF QFLSDHGLIS DEHMKLFQYA DSASEAWDII KQ
|
| |