Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44629 |
Symbol | |
ID | 7198116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 1096988 |
End bp | 1098630 |
Gene Length | 1643 bp |
Protein Length | 475 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178362 |
Protein GI | 219115133 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGCAGTTGA ATCTCTGTCC AGGAAGACAA GGAGCAGCTC TTTCCACAGT GTACCGATTT GTTAAGAAAG CTAGAAGACG ATCATATCTA GTAGAGCTTC GCACCATGGA GAAGAATAGA AGAGCTACAG GATCGGTCAG CAGGAAGCGG GTGGGAGCAT TCGTCTGCTT CTCATTGCGA TGCCTTTGCA GCCCGATCGT CACTGCCATG GGACTGGAGC GCGTGGAAAC GATTCCGGAG GTTGTACCGA TGCCCACAAT TGCAGCTACC CGGTCGCAGC GCTTTCCATC CGCGGAAGAA AGGCTGAAGA TATACATGTC GAACTGGTAT GCTCCCAGCT GTCCGGGTTA TACAGAAGGT CTGGTTCGAT ACGCCTACGA TAAACCGTCT ACCGAAAAGC GATGGACCAG CTTGATCCTG CGCGAAATGG AAGACCAGGC CAATGCCACT ACTTGGCAGT TCGAAAGCAT AATAGCGCCC GACACGGCCT TCTTTCTGGA CCGCGCTACG CTTATGGACT GTCTCCGTGG AGAAACCGAA GGTTCTCCGG AGTTTGCCGA TCGAATTGAA TTCCGCCACA ATATGCACAT GTACTGCTCC GATGTTGCGG ACTCGGTCAT GACCGCTATC GATCATGTGA GTTTGGAAGC GGGCAAACAA TCAACCGTCC CAACACTCCT GCAATTCGGG GATCTTCCAC ATTCGCACGT GTTCCGCTTT CTAAATATTC CGGTCTTCAA AAAATTTCGT ACTGGGGCGA CCCGCCTTGC CGTAGCAGAT TCCGTTGCCC CAACATGCGT GGACGGGCCC CGTGCTGCGC TTGAAACCTT ACAAAAGACG ACCTACCGAC AACCTATTGT TTGGAAACTG GCCACCAATC GCCACTACCG GCTAATTCAC AAAGTTTATC GGGAAGACAT GCCTTGGGCA GGCAAAAGGA ATATGGCTGT CTTTCGGGGC CAATTGACGG GCTCTCTGAC CTACAACAAG AGAGTTTCGG ACGAAGAGAA TTGTATGAGT ATGCGGCGTT GTCGTCTTGT GTACACGCAC GCCAACAGTA CCTTGGTAAA CGCGCGACTA ACATCCACCC GCAACCGCCT ACCCAACGTT TTGAACGGCG TTTCCCTCGT GGAGGAATCA GTACCGCTTC GTAGATTACT ATCTTTTAAA GCCATTATCA TGTTGGAAGG AAACGATGTT GCATCGGGTC TCAAATGGGC GCTGCTATCC CAGTCAGTGG TATTGATGCC GCCACCGAAG GTCACATCCT GGGCACTAGA AGAATTGCTG GAACCTTGGG TGCACTACAT TCCTCTCAGT CCCGACGCGA GTGATCTTGA GGAGAAAATG GCGTGGGTGA TTGACAACGA CGATATCGCA CAGACTATCT CGGAACGAGG AACGCTGTGG ATGGAGGACT TGTGCTTTCA TCCAGATGCC AACGATGATG ATAGATGGAT TCAAGAGGAA ATGGTGCGTC GGTATCGCAG CCATTTCGTG GAAGCGAGAA AACCGTTGGA GTTGACTGCG TGAGTGACAC CTTTGTATAG CGCGCAATCC TGGTTTATCT CTACAGGGAA GTAACATTCT GCTCAGAGGT TTGCAGCACC TGAGTAAAAT AGAAATGCTT GTATATAGCT TCT
|
Protein sequence | MEKNRRATGS VSRKRVGAFV CFSLRCLCSP IVTAMGLERV ETIPEVVPMP TIAATRSQRF PSAEERLKIY MSNWYAPSCP GYTEGLVRYA YDKPSTEKRW TSLILREMED QANATTWQFE SIIAPDTAFF LDRATLMDCL RGETEGSPEF ADRIEFRHNM HMYCSDVADS VMTAIDHVSL EAGKQSTVPT LLQFGDLPHS HVFRFLNIPV FKKFRTGATR LAVADSVAPT CVDGPRAALE TLQKTTYRQP IVWKLATNRH YRLIHKVYRE DMPWAGKRNM AVFRGQLTGS LTYNKRVSDE ENCMSMRRCR LVYTHANSTL VNARLTSTRN RLPNVLNGVS LVEESVPLRR LLSFKAIIML EGNDVASGLK WALLSQSVVL MPPPKVTSWA LEELLEPWVH YIPLSPDASD LEEKMAWVID NDDIAQTISE RGTLWMEDLC FHPDANDDDR WIQEEMVRRY RSHFVEARKP LELTA
|
| |