Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37761 |
Symbol | |
ID | 7202767 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 10414 |
End bp | 12051 |
Gene Length | 1638 bp |
Protein Length | 507 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181824 |
Protein GI | 219123006 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGAAG GAGATGTACG TTGTAAAATT GGGGGAAGGG TGACTGCAAA AGCTTGTCAC GTTGTTTTTC TTGGTGAATG TGCTCGTAGA TATGGTGCAT TGAAGACTAC CAAGGTCATT GTTGGGACTG TTGTAGAGGT CACAACCACC AGGAAGCCTC AAACCAACCG TACTTCCACC TTTGTTACTG CTGACTTTGA TTTGGGGGGT GGAGCAGTGA AGTGAAGCAC GCTGAACATC CGTAGCGTCA AGGCCTTTGT ACCAGACTTG ACCACGTCAC CAAATGACGC TGATACAGCA GCTGTGGCAG CGCCCAACAA TATGCTCGAA AACATTGATG CGCTTCCAAC CGTAACAGCG GATACTGCAA CTGAGACTGA TTTGTTTCAT ATGGAAGCAG GGGTTGATCA ACATCCAATT CTCGAACAAG ACACGGAGTC ACTGGTGCAG CTACCAGTAC TTCCAATAGT GCCCAATGCT GACTTTGGTA ACAATAACTT TTCCGAGGCA GAAGCTGTCC CTGCTGCAGT AGCACATGGC ACAAAGTGGT ATGAAGATGA TGAAGCTACT CTAAATGATA CAAATGGCAG TGTGCCAATA AAAGACTTCG GTATTTCCAC GCCTGTTGGT GAAGTCTTGG GTCCAAACTC TGATATTGGC GGAAAGTACT CGAGACTGGA ATACTTCCTT CTGATGTTTC CACCCAAACA GCTCACAACT ATGTGTCAGC TAACAAATAA CGCTCTGGTG CAACAGAACA AGCACATCAT CACTACTGGC GAGCTGCTTC GCTTTTTTGG AATAGTCATT CTGACAACAA AGTTTGAGTA CACAAGCCGA TCCCAGCTGT GGTCAACAAC TGCACTTTCA AAATATATTC CTGCTCGATG CTTTGGACGG ACAGGAATGT CAAGACAGCG ATTTAACGAT ATATGGCAAT GTCTTTGCTG GAGTGAGCAG CCTCCTGAGC GGCCAGAAGG TATGAGTTCG CAGAGCTACA GATGGAAACG TGTTGATGGC TTTGTAGCCA GGTACAATGA TCACCAAAGT ACAGCTTTCA AGCCCTCTCA CATGATTTGT GTTGACGAGT CCATCTCTCG CTGGTATGGC CAAGGGGGGA ATTGGATTAA TCATGGGCTG CCTATGTATG TTGCCATAGA TCGAAAGCCA GAGAACGGTT GCGAGATCCA AAATGCGGCA TGTGGATGTT CCGGAATTAT GCTTCGGTTG AAACTGGTCA AGTCAAAGAC TGCTCGGGAA GAAGGGGATG AGGGTGGTCT AAGCGACAAT CATCTTTTAC TTGGCACAAG GATTCTCAAA GAGCTAGTTA CTCCTTGGGC ATGGACAAAC CAAGTTGTAT GTGCTGATTC CTATTTCGCT TCTGTTGGTG CTGCATTGGA GTTGAGACAA ATAGGTTTGG GATTTATTGG GGTTGTGAAG AGTGCAACAA AGCACTTTCC AATGGCTTAT CTTTCGAGAC TGGAGTTCAA TCATCGAGGA GACCGAAAAG GATTGTTGAT GAAAGACGGA CTCAATGGAA GTAGCTTGAT GGCGTTTGTA TGGATTGATC GTGATTGCCG ATACTTTATA TCAAGTGTGT CCAGTCTTGA TGCCGGCAGT CCATTTGTTC GATATTGA
|
Protein sequence | MSEGDTTKVI VGTVVEVTTT RKPQTNRTST FVTADFDLGG GAVNVKAFVP DLTTSPNDAD TAAVAAPNNM LENIDALPTV TADTATETDL FHMEAGVDQH PILEQDTESL VQLPVLPIVP NADFGNNNFS EAEAVPAAVA HGTKWYEDDE ATLNDTNGSV PIKDFGISTP VGEVLGPNSD IGGKYSRLEY FLLMFPPKQL TTMCQLTNNA LVQQNKHIIT TGELLRFFGI VILTTKFEYT SRSQLWSTTA LSKYIPARCF GRTGMSRQRF NDIWQCLCWS EQPPERPEGM SSQSYRWKRV DGFVARYNDH QSTAFKPSHM ICVDESISRW YGQGGNWINH GLPMYVAIDR KPENGCEIQN AACGCSGIML RLKLVKSKTA REEGDEGGLS DNHLLLGTRI LKELVTPWAW TNQVVCADSY FASVGAALEL RQIGLGFIGV VKSATKHFPM AYLSRLEFNH RGDRKGLLMK DGLNGSSLMA FVWIDRDCRY FISSVSSLDA GSPFVRY
|
| |