Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_05891 |
Symbol | |
ID | 4779924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 535445 |
End bp | 536485 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640083866 |
Product | dehydrogenase |
Protein accession | YP_001014416 |
Protein GI | 124025300 |
COG category | [R] General function prediction only |
COG ID | [COG5322] Predicted dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0334693 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGGTC TAATTGGACA TTCAACAAGT TTTGAAGATG CCAAGCGAAA GGCATTAGGC TTGGGGTATG ACCATATCGC TGAAGGGGAT TTGGACGTGT GGTGTACTGC TCCTCCTCAG TTGGTTGAGA ATGTGAAAGT TGTGAGTGCG ATCGGAAAGA CTATTGAGGG AGCATATATA GACTCATGCT TTGTTCCAGA GATGCTAAGT CGCTTCAAAA CTGCAAGAAG AAAGGTTTTG AATGCAATGG AATTGGCGCA AAAGAAAGGG ATCAGTATCA CTGCTCTAGG TGGCTTTACA TCAATAATTT TTGAGAATTT TAATTTGCTC CAAAATCAAC AAGTTAGAAA TACGACTCTT GATTGGCAAA GGTTTACCAC TGGTAATACT CATACAGCTT GGGTGATATG TAGACAGTTA GAGCAAAATG CACCTCGAAT AGGAATTGAT TTGAGTAAAT CAAAAGTAGC TGTTGTTGGC GCTACTGGTG ATATTGGCAG CGCTGTTTGT AGGTGGCTTT CCAATCGAAC TGGTGTTTCT GAACTCCTTT TAGTAGCCAG GCAGCAAAAG CCTTTACTTG AACTTCAATC TCAACTTGGA GGAGGAAGAA TTCTTAGTCT TGATGATGCT TTACCAGAAG CAGATATTGT TATTTGGGTT GCAAGCATGC CTAAAACCTT AGAAATCGAC CCATCTAAAA TTAAAAGACC TTGTTTAATG ATTGATGGTG GATACCCTAA AAATTTGGGC GAGAAGTTTT CAGGACCTGG AATACACGTT TTAAAAGGAG GAATAGTTCA ATTTTTCAAA GATATTGGTT GGAGCATGAT GGAATTGGCT GAGATGGAAA ATCCTAAAAG AGAAATGTTT GCTTGTTTTG CTGAAGCAAT GCTTCTTGAA TTCGAGAATT GTCATACCAA CTTCAGTTGG GGAAGAAATA ATATAACGTT GGAAAAGATG GATTTCATCG GTAAGGCTTC TGAAAGGCAC GGGTTTTCGG CTGTTGGTTT GAAATCAAAT ATTCAGACAT TAACCGTCTG A
|
Protein sequence | MFGLIGHSTS FEDAKRKALG LGYDHIAEGD LDVWCTAPPQ LVENVKVVSA IGKTIEGAYI DSCFVPEMLS RFKTARRKVL NAMELAQKKG ISITALGGFT SIIFENFNLL QNQQVRNTTL DWQRFTTGNT HTAWVICRQL EQNAPRIGID LSKSKVAVVG ATGDIGSAVC RWLSNRTGVS ELLLVARQQK PLLELQSQLG GGRILSLDDA LPEADIVIWV ASMPKTLEID PSKIKRPCLM IDGGYPKNLG EKFSGPGIHV LKGGIVQFFK DIGWSMMELA EMENPKREMF ACFAEAMLLE FENCHTNFSW GRNNITLEKM DFIGKASERH GFSAVGLKSN IQTLTV
|
| |