Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41002 |
Symbol | |
ID | 7198923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 107494 |
End bp | 108921 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185051 |
Protein GI | 219129764 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGG ACCGTTCCGA CCTTTGGATC GGCCTTGTAG CAGGATCTCT GAGCACTGTG ATAGCCACAT GGTGTCTCCA GCGATATCAA TCGACGAAAA ATCCGCAGAC GTTGTACTCA CCGACACTTA ACCAGACGCC AACGGCATCC ACCCTCTTAC CCGACGACAT TCGCGACGAG CAACTCTCCC GGCATTTGCT TTACTTTGGC GAGGATGGCA TGGACCGACT GAAACGTTGC AAGATTTGTG TCGTTGGACT GGGTGGAGTG GGAAGTCACA CTGCTCATAT GTTGGCCCGC GCCGGGGTGG GGTACCTCCG TCTCATTGAT TTTGACCAAG TCACCTTATC CAGTCTCAAT CGACACGCCT GCGCCGTCCT CGCTGACGTT GGCACTCCCA AAGCAACCTG TCTAGCGAAG TTTTGTCGCC GCATTTGTCC CGATCCGACG AAACTGGTTC TCGACACACG TGTGGAAATG TACACCGCCG ACACCGGCGC CGCGTTGCTG TCTCTGCCAG ACGGCGAGCA CTGGGATTTG GTCGTGGACG CCATTGATGA CGTACCGACC AAGGCGGTGC TTCTGGCTCG TTGCTGCCAA ACCCAAACAC GCGTAGTCTC TTGTATGGGG GCCGGAGGCA AAGCCGACGT TACGCGCTTG CACGTGTCCG ATTTGCGCAC GGCATCCCGC GATCCTCTGG CCACCAAGCT ACGGCAACAT CTCAAAAAAT ACATGGCGGA CCACAGCGAC GACCAAAAAA GTGACTACCT CGATAATATG GACAAAATAT CCATCGTGTA CAGTACCGAA AAGCCGGTGG TCAAGTTGGC GGATTTTACC GCCGAACAAA AAGAAGCCGG CGTGCACCAA TTTGGAGCCG TCGACGGGAT GCGAATCCGA GTGATTCCCG TGCTCGGTAC CATGCCTGCC ATTATGGGGC AGGCATTGGC GGCCATGGTC CTAACGCAAG TTGGTAACAA ACCCTTTCAA CCCGTGACGG GAGAACGAGT GGGAAAAAAT GTACGCAACA AATTGTTTCA GCATTTGCAA ACACGGGAAG ACCGCATCCA AAAGCGAGTA CTGCAAAATA CCACACGCGA CGACGTAGCA ACCATCGCTA CAACCGGTGG TACCGTTGTC GACAGTGTCT GGATCGGCCC GTTGCAGATC GACCGGGACG ACGTGGAATA CTTGAACGAA ATATGGCGGA ATCGGTGCGG CGTCACCAAC GCTCGCTTGG GCACCACGCT GGAGCTCGTC CGCTGGAATA ATGCAAAACC TTCACGATGT GACAATCTAG TGCTCATGTG CACCGCCGCG ATCCAAGCTT TTGATAAACC AGGGGGAAAG GAGAAAATTC CCGCCTACGT CGTTCAACGC ATCGAAGAGC GGTTGGCAAC CTGCCAAAAT GATAGATTAG CCTACTAA
|
Protein sequence | MKKDRSDLWI GLVAGSLSTV IATWCLQRYQ STKNPQTLYS PTLNQTPTAS TLLPDDIRDE QLSRHLLYFG EDGMDRLKRC KICVVGLGGV GSHTAHMLAR AGVGYLRLID FDQVTLSSLN RHACAVLADV GTPKATCLAK FCRRICPDPT KLVLDTRVEM YTADTGAALL SLPDGEHWDL VVDAIDDVPT KAVLLARCCQ TQTRVVSCMG AGGKADVTRL HVSDLRTASR DPLATKLRQH LKKYMADHSD DQKSDYLDNM DKISIVYSTE KPVVKLADFT AEQKEAGVHQ FGAVDGMRIR VIPVLGTMPA IMGQALAAMV LTQVGNKPFQ PVTGERVGKN VRNKLFQHLQ TREDRIQKRV LQNTTRDDVA TIATTGGTVV DSVWIGPLQI DRDDVEYLNE IWRNRCGVTN ARLGTTLELV RWNNAKPSRC DNLVLMCTAA IQAFDKPGGK EKIPAYVVQR IEERLATCQN DRLAY
|
| |