Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21031 |
Symbol | |
ID | 4780346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1747478 |
End bp | 1748701 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640085399 |
Product | glycosyltransferase |
Protein accession | YP_001015923 |
Protein GI | 124026808 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.428649 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATTT TATTTGTTCA TCAAAACTTC CCCGGACAAT TTAAGTTTTT AGCTCCCGCT TTAATTAAAA GAGGGCATTC TGTCTCAGCG TTATGCTTTG AAAAAAATCA AAAGCAATTT TCAAAAGAAA TTAATGTTTT TTACTATCAA ATTAATAGGG GTACAACAAA AGAAGCTCAT CCTTACATAA CTGATTTTGA AGCTAAGGTA ATTCGAGGTG AAGCATGCTA TTTAAAGGCA ATTGAAATTA AGAAAAAAGG GTTTTCTCCA GATGTAATTA TTGCTCATCA TGGTTGGGGG GAAAGTATGT TTCTACATCG AGTGTGGCCA AAATCTCAAA TAGCTCTTTA TTGTGAATTT TTTTATAAAG TATCTGGTGC TGATGTTGCA TTTGATCTTG AGTTTGAATC AAAAATATCA TCCGACCATA ATCGGATTCA ATTGAAAAAT ATTAATAATT ATATTCATTT TGAAAATGCT AATAAAGCCA TTAGCCCAAC GTTATGGCAA GCAAGTACAT TTCCAGAGTT TTTCCGGAAA AATATAACTG TAATACATGA TGGTATAGAT ACTAAAGCTT TAAAACCTAA CGATGACGCT AATTTAATTA TTAATGGGAG TATAAGTATT AATTCTGGTG ATGAAATAAT TACTTTTGTG AATAGAAACT TAGAACCATA TAGGGGCTAC CATATTTTTA TGCGATCACT TCCTAATATA TTAAAGGAAA GGCCAAATGT GAAAGTTTTG ATTGTTGGTG GAGATGATGT TAGTTATGGG AAAAAAGCAC CAAATGGAGG TAATTGGAAG AACATTTTTT ATGAGGAAAT TAAGTCAAAA TTATCAGACA GTGAAAGAAA AAGAATTTTT TATCTTGGCT ATATTTCCTA TGAACATTAT GTCCTATTGC TTCAAATATC ATCTGTACAC ATTTATTTAA CTTATCCATT TGTACTTAGT TGGAGTTTAC TTGAAGCGAT GAGTGTAGGC TGCGCAATTG TTGCGAGTGA TACGAAACCT TTGCAAGAAG TTATAACTGA TCAAGAGAAT GGATTGTTAT TCGATTTCTT TGATTTTAAT AGACTTTCTA ACCTAGCGAT TAAGTTGTTA GAAGACTCTT CTTTAAGAAA CAAAATTGGT CATAATGCGA GAGAATTTGC TATAAAAAAT TATGATAAAG ATCTATGCTT AAAAAAGCAA TTAGAGTGGG TTGAAAATTT TTAA
|
Protein sequence | MKILFVHQNF PGQFKFLAPA LIKRGHSVSA LCFEKNQKQF SKEINVFYYQ INRGTTKEAH PYITDFEAKV IRGEACYLKA IEIKKKGFSP DVIIAHHGWG ESMFLHRVWP KSQIALYCEF FYKVSGADVA FDLEFESKIS SDHNRIQLKN INNYIHFENA NKAISPTLWQ ASTFPEFFRK NITVIHDGID TKALKPNDDA NLIINGSISI NSGDEIITFV NRNLEPYRGY HIFMRSLPNI LKERPNVKVL IVGGDDVSYG KKAPNGGNWK NIFYEEIKSK LSDSERKRIF YLGYISYEHY VLLLQISSVH IYLTYPFVLS WSLLEAMSVG CAIVASDTKP LQEVITDQEN GLLFDFFDFN RLSNLAIKLL EDSSLRNKIG HNAREFAIKN YDKDLCLKKQ LEWVENF
|
| |