Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40679 |
Symbol | |
ID | 7198498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 130387 |
End bp | 131517 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184652 |
Protein GI | 219128927 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTTG GCACATACCG ATGGTTCCCG TCCGCGGGTC CTAACAATCG ACTACGTATG GACAAACTAC GCGCGCCGGT GGGTCGTTCA TCAAGCACAT CCCGGACGGC CACGAAGCGC GGTTCCGTAA TTCCACCAAC TTCCGCTGAC ACCACCAGCA TCTTGTCCCC ATCGAGCACC TCGTTTGCGG TATCGCTCGG CGGTCGGGCC AAGGAAGAAC TCCGCCGAAA TTCGATAGCC TACACTTCGC GTCTCATGTC GGGGTCCGAT CTTACTTTGG CAGGAGCGGA AGCCTCGGCT CTTCTGGTAA GCTGTGGATA CTGGGACCAT GGCGTTCGCG TTCACGGTTT GGACAACAAT CTTAGAGTGT TGGCCACCGA GGCCGGTGGT CACCGTGGTC CCATACTATG CTTGGCCGTT GCTCAGGATG ATGCACTCAT GGTGACGGGA GGTGAGGATT GTACGTGTCG CGTATGGGTG GTTGACCATT CCGACCTGGC CGTGGCACTG TCTGACGGAT ACGTACAAAC CGCACTGGGA TCTGCCAATA CTGGCGAAAG TGTTTTAAGT TGCTGTCACG TCCTGTGGGG CCACGAAACG CCCATCACTT GTGTGGCTTT AGATTCTTCC CTAGACGTGG TGATTTCGGG TAGTAGAGAA GGCAAGATTT GCGTGCATAC GTTGCGTCGG GGTGAATTCG TCCGTTTCTT CACGCCCCCC GTATCCGGCG GCACCCCGCC GGCCATCGCA CGCGTGGCGC TGCACCCCAC AGGAACCGTC GTGGTACACG CGCGGGACCA GAGTTTGCAC GCCTTTAGCG TCAACGGCGT GCGATTGGCG AGCGTCAACG CTGGCGAAGA ACTGTACGAC CTGCAATTCT GTAACGAATT TGTCGTGACG GGCGGGACGC GGGGTCAAGT GTGTGTGCGA TCTTTGAGCG ACCTCCAAAT CCAATCCGTG GTGGACTTGT CTCGGCACGG ACCCGTACAC TGTTTGGCTT TGACGAATCC CGAACTCAAC CCGATTCCTC AATTCCTTTT CGTTGGCAGT GCCGACGGAA TGTTAACCAT CGTGGACGTG GATCCAACTC AAGAGCAACA ACACGTATCC GACGCAGTGG TGACTTTATA A
|
Protein sequence | MSVGTYRWFP SAGPNNRLRM DKLRAPVGRS SSTSRTATKR GSVIPPTSAD TTSILSPSST SFAVSLGGRA KEELRRNSIA YTSRLMSGSD LTLAGAEASA LLVSCGYWDH GVRVHGLDNN LRVLATEAGG HRGPILCLAV AQDDALMVTG GEDCTCRVWV VDHSDLAVAL SDGYVQTALG SANTGESVLS CCHVLWGHET PITCVALDSS LDVVISGSRE GKICVHTLRR GEFVRFFTPP VSGGTPPAIA RVALHPTGTV VVHARDQSLH AFSVNGVRLA SVNAGEELYD LQFCNEFVVT GGTRGQVCVR SLSDLQIQSV VDLSRHGPVH CLALTNPELN PIPQFLFVGS ADGMLTIVDV DPTQEQQHVS DAVVTL
|
| |