Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21281 |
Symbol | |
ID | 4780909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1786081 |
End bp | 1787397 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640085425 |
Product | glycosyl transferase family protein |
Protein accession | YP_001015948 |
Protein GI | 124026833 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.133481 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTAG CCAAAATCAG TGGAGAAAAC CGACGCATTA AATCAATATT GTTTTTGATT TGCTGCGCTT TGGCAGGAAT TTTTCCTCAT CTGATGGCGC CTAGCCAGAA CTTATTCCCT TCAATTACTT TGGCTGTCTT GCTGGGTGGA TATGGATTAA GAGTTGTTTT ACGAGATCGC AGGGACAATT ATCAATTGCA AAACGAGTAT CTAATTAAGG GAGAAAAATT TGTTGAGACT CTTCCCAAAG TAGATGTGTT GGTGGCAGCA CGTGATGAAG AAAATGTAAT AGAGCGATTA GTTTCACGAC TTTTATCAAT TGAATATCCA GAGGAGAAGA TTTCTTTATG GATAATTGAT GATGGAAGTC AAGATCGAAC ACCTGATTTA TTAAAAAAAT TAGGTACAAG CTTTTCTAGG ATTAATGTTT TAAGTCGCCC TTTGATGAGT GGTGGTGGAA AATCAGGAGC TCTGAATTCT GTATTGAATA AAACTGATGG AGAGTGGTTA TTTATTCTTG ATGCTGATGC ACAACTACAG AGCAATGTAT TACTTAGAGC AATAGCTCTT GCTTTGCATG GTGGATGGTC TGCGGTGCAG TTAAGAAAAT CAGTTGTAAA TTGTGAGTTG AATCAAATTG CTTCATATCA AGCAATGGAG ATGGCAATGG ATGCCGTTAT TCAAAGGGGC AGACTTGCTA GTGGGGGTGT TTCGGAGTTA AGAGGTAATG GACAATTAAT TAATAGAAAA GTTCTTGAAT GTTGTGGAGG CTTTAATGAA CAAACTATCA CTGACGATTT AGATTTAAGT TTCCGGTTTT TATTAACACG ATCGCCAATA GTAATAATGT GGGATCCACC TATTCAAGAG GAAGCAGTGG AATCATTGTC GGCTTTATTC CGACAAAGAA AAAGATGGGC CGAGGGAGGT TTGCAAAGAT TTTTTGATTA CTGGCCCTTA TTGATCTCAA ATCGACTTTC AAAATTTAAA AAAATAGATT TATCTTGCTT TTTCTTATTG CAATACGTTT TACCGGTTGT ATCATTTATA GATTTTATTG TATCAATTAT TTTATTTGAA ACGCCACTTT ATTGGCCACT ATCAATAGTT GCATTTGGTA TTTCTAGTTT AGCTTTTTGG AAAGGATGTT CTCAAAATAG CGAAGGTCCA AGATTACCAT CACCTAATTT TATTAATATA CTTGGAGCAA CAATTTACCT TGCACATTGG TTTATAGTTA TACCCTTTAT AGCAGTTAAG ATGTCATTAT TTCGTAAGAC GTTGATTTGG GAAAAAACCG ATCATATTGG TGCTTGA
|
Protein sequence | MALAKISGEN RRIKSILFLI CCALAGIFPH LMAPSQNLFP SITLAVLLGG YGLRVVLRDR RDNYQLQNEY LIKGEKFVET LPKVDVLVAA RDEENVIERL VSRLLSIEYP EEKISLWIID DGSQDRTPDL LKKLGTSFSR INVLSRPLMS GGGKSGALNS VLNKTDGEWL FILDADAQLQ SNVLLRAIAL ALHGGWSAVQ LRKSVVNCEL NQIASYQAME MAMDAVIQRG RLASGGVSEL RGNGQLINRK VLECCGGFNE QTITDDLDLS FRFLLTRSPI VIMWDPPIQE EAVESLSALF RQRKRWAEGG LQRFFDYWPL LISNRLSKFK KIDLSCFFLL QYVLPVVSFI DFIVSIILFE TPLYWPLSIV AFGISSLAFW KGCSQNSEGP RLPSPNFINI LGATIYLAHW FIVIPFIAVK MSLFRKTLIW EKTDHIGA
|
| |