Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_20791 |
Symbol | |
ID | 4780220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1721838 |
End bp | 1722854 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640085375 |
Product | putative carbohydrate kinase |
Protein accession | YP_001015899 |
Protein GI | 124026784 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGCTG GAAGTGTCAT TGCGATTGGA GAAGCTTTAA TAGATCGGCT TGGACCTCTT GGTGGAGCCC CATCTAGTGA TTTGCCAGTA ACAGATTGTT TTGGTGGCGC TCCAGCTAAT GTTGCTTGTG CCTTGAGCAG ACTAGGAGCG AAGGTCTCTT TTATTGGCTC TTTAGGAAAT GATGCTTTTG GAGAGGATTT TAATAATCTA TTAATCCAAA GGAGAATTAA TACCTCTGGA TTACAGCAAG ATACTCTTCG TCCAACAAGA GTTGTATTGG TACGTAGAGA CTCTGATGGA GAAAGATGTT TTGAAGGATT TGAGGGTGAT AAAGGCTTGG GATTTGCCGA TCAAGCCATA TCTTTGGAAC AAATTATTCG AGACTGGCCA TTGGTTGCGG AAAATGCGCA ATGGTTAGTG GCAGGAACAA TTCCTTTGGC TTCAGAAATA TCATCCAAAG CTTTTTTGTG GTGTATCGAA AATGCTATGC ATTCAGGAAT AAAGATTGCC CTTGATTTGA ATTGGCGCCC AACTTTTTGG CGAAACCAAG TTTCAAACGT CTCAGAACCC TCTGTGAAAG AAAAAAATCA AATATTGTCT ATTTTAAAAA ACGTTTCGTT AATAAAACTC GCTAAAGAGG AGGCTCAATG GTTTTTTCAT ACCTCTGATC CAACTGAAAT TTCTTCATCT CTTCCACAAA GACCATCTGT TGTAGTTACC GATGGATCAA ATCCCATTTT ATGGCGACTC AATAATCACG TTGGCAAATC ATTTGCGATT ATCCCCTCTT CTGTGGTTGA TACAACTGGA GCTGGTGATG CATTCACTGC AGGATTAATT TATAAACTCA TCTCTGTTGA ATTAGATCAA ATCAGTGAAC AAAGTGCTAA AGATATTATT CAATTTGGAA TTGCATGTGG CTCGCATGTT TGCAAGGGAG TAGGAGCGAT AGAACCACAA CCTTACTTAG ATGATATTGA TAATTTATTG TCTTTATCTA AAGGAGGAAT CAGCTGA
|
Protein sequence | MRAGSVIAIG EALIDRLGPL GGAPSSDLPV TDCFGGAPAN VACALSRLGA KVSFIGSLGN DAFGEDFNNL LIQRRINTSG LQQDTLRPTR VVLVRRDSDG ERCFEGFEGD KGLGFADQAI SLEQIIRDWP LVAENAQWLV AGTIPLASEI SSKAFLWCIE NAMHSGIKIA LDLNWRPTFW RNQVSNVSEP SVKEKNQILS ILKNVSLIKL AKEEAQWFFH TSDPTEISSS LPQRPSVVVT DGSNPILWRL NNHVGKSFAI IPSSVVDTTG AGDAFTAGLI YKLISVELDQ ISEQSAKDII QFGIACGSHV CKGVGAIEPQ PYLDDIDNLL SLSKGGIS
|
| |