Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_08621 |
Symbol | |
ID | 4779332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 792687 |
End bp | 793934 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 640084137 |
Product | glycosyltransferase-like protein |
Protein accession | YP_001014685 |
Protein GI | 124025569 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.784838 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000054897 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTAAAGA TTTGGATGAT AAATCAGTTT GCAAATACAC CTGATTTGCC AGGGCATACA AGAAACTACG AAGTTGCTAA ATATTTAGTT AAAAATGGAT GGAGAGTAGA TTTATTTGCA TCTGACTTTA ATCTTAGTCA GAGAAAGTAT TGCAAGTTAA AGAAATTTGA ATTATTTAAA ATCAATAAAA TTGATGGAAT AAAATGGCAT TGGTTAAGAG TATCTTCTTA TTCAATTAAT AATTGGAAAA GATATCTAAA CATATTAAGT TTTTCAATTC ATATATTTTT ATTTCTTATA TTAAAATCTA TTCTATCTTT AAAAAGAGAG CAATTACCAA ATATAATATT AGCTAGTTCT CCTCAATTAC CAGCAGCATA TTTTTGCTTA ATAGTTTCAA AAATTCTCCA TATACCATTG GTTTTAGAAA TAAGAGATCT ATGGCCACAA ATTCTTATTG ATCTAGGAGA TAAAAGCAAA GAAAACATAT TATATAGGAT ATTATATTGG ATGGAGCTTT TATTATATAA GGAATCAAAA ATTATAGTTA TTTTATCTAA AGGGTCAAAG GAACATGTAG TAAAAAAAGG AGGGAAACTT ATAAAATGGT TACCAAATGG GCCAGATCTA AGTAAATTCA AATTTTCAAG TCTACCTAAT GAAGAATTAA CTTTTTCTTT TAATAGACCC TTTAGATTGG TATATGCGGG AGCTCACAGC CAAGTTAATG GTTTGATGTA CGTTTTGAAT GCTGCTAAGT TATTGATCAA TCACCCAATT GAAATTACAT TGATAGGGGA TGGTCCAGAG AAAAATAATC TTGTTGAACA ATCAAAAAAA CTTGCTTTAA ATAATGTAAA ATTTTTAAAA CCTCATTCTA AAGATAATAT TCCAAAAATT CTATCTACAT TCGATGCAAT ATTACTTTCA CTAATAGATT CTGACCTATT TAGATATGGA ATATCTCCCA ATAAGTTATA TGATGCTTAT GCTCTTGGAA GACCAGTAAT AACTACTGTC CCAGGAATGA TAAATGATGA AGTAGAGTCA AATAAATTAG GGACTACTTC AAGAGCATGT GATTCATCTT CATTATCATT AGCAATAAAA AGATTAATGA ACACATCAAG GAGAGATAGA GAAATGATGG GGATCAGAGC TAGATCTATA GCCGAAAAAA CATATTCAAG AAATCGAATC AACAAAGAAT ATGATAAACT TCTGCGCTCA TTAATACCTA ATGAATAA
|
Protein sequence | MLKIWMINQF ANTPDLPGHT RNYEVAKYLV KNGWRVDLFA SDFNLSQRKY CKLKKFELFK INKIDGIKWH WLRVSSYSIN NWKRYLNILS FSIHIFLFLI LKSILSLKRE QLPNIILASS PQLPAAYFCL IVSKILHIPL VLEIRDLWPQ ILIDLGDKSK ENILYRILYW MELLLYKESK IIVILSKGSK EHVVKKGGKL IKWLPNGPDL SKFKFSSLPN EELTFSFNRP FRLVYAGAHS QVNGLMYVLN AAKLLINHPI EITLIGDGPE KNNLVEQSKK LALNNVKFLK PHSKDNIPKI LSTFDAILLS LIDSDLFRYG ISPNKLYDAY ALGRPVITTV PGMINDEVES NKLGTTSRAC DSSSLSLAIK RLMNTSRRDR EMMGIRARSI AEKTYSRNRI NKEYDKLLRS LIPNE
|
| |