Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47176 |
Symbol | |
ID | 7201956 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 717403 |
End bp | 720295 |
Gene Length | 2893 bp |
Protein Length | 435 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181253 |
Protein GI | 219121812 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTATCCCTC TCTCTCTCTG TGTGTGTCAA ACGCGGCAAG CCAGTGAGAG TCAGTCGACA GCGCTGATGC AACGAAAGGA CTTCCATTAT GAACGCCTCG GATCGGCCCA CGCTACTCTC CATTGAGGAG TTAAGCGTCC TCGTTTCATC CCCCGTTCGC ACGCACTTGG CTGGTGCGTA TCGTGGTCTG TATCCCGCCG CCAACGATCC AGCACAAAAG CTGCAGCACG AACAGCATCT CGAAAAGGTG CTTGCCGCCA TGAAAGTTTC GCCCCGGACG TCGACTTTTC GCGTCAATAC AATCCGCGCA ACGCGCACAG AGGTTGTGGA CAAACTGGTT CTCGCAATGA ATGAGTGGGC CGAGCGAGAA GGCCTCTGTA CGCCCAATCC CTTGCCAATC TCCGTCGCGG CTCATGCCGT GCTCGAGGAT ATCGTATCGG TTGATATTGC CCAGTCTCCG ACGACACTCT CCTCATGTCG AATACCGCCC CTCCAGGTGG AGCTCGACAC CGACTCATCG TTCGGGGACG AGGCGCGACG CGCCCGCCGT CACCGCCTGG GCTGGCCCAC CACGTGTAAG GTGATTGTTT GCGATCGCTT ATGCGGAGAA GCGGTTCTCC GCGGATCGCA CGTTTTCGTC CGCGGAATCC TCGCGGCAGA TCCCTCCATT CGAGTGGGGC AGGCCCTGGC CGTCTACGCT GACTTGCCGC AACCCCAACG ACCGTCTTGG CCTCGTGGAC TTGCCGTTGA GTCCTACACT GGAACCTGCG TCTTTGTGGG AATCGGCCAG GCGTGTTGCA CCCACGCCAG TTCTTTCGAC AGTCACAGGG ACTGGGTGTT CGCATGCGCT CCCTGCCACG AGTGGGGCCC CTCCTGCCCC CACTTTCCGG TGTACTGGAC ACGCACGCTT TTTTACAAAA CCTACCTTCC ACCGTCGTGG TACACGCACT GAATCCTCAA CCCGGCGATA CAATCCTCGA TCTCTGTGCC GCTCCAGGTG GCAAAGCCTC GCACGTGGCG AGTTTCACAA AAAATCAGGC CGTGATCTTG GCCGCTGATC GGAGTCGGTC CAAGGTCGTC GCCATGAAAC AACGCTTCCT GGCACTAGGA TGTACCAGTA TCGTACCCTT GCATCTTAAC GCAACTGCAT GCTGTATGGA CGAGCCCGGG CGGCCTCGCT GCAGCGTGGC CGAGGTACGT ACGCGCCTGT GGATGGGTGT CGGAGTAGCA GCTGTTCCTT GAGGAAGTGT GTGTGTTTCG CCACTTTCTC ATTCTGAGCA ATAAATTTGG TTTGTTAGAT CCGCGCCGCG GCCAAAGTCT CGGAGAGGGA CGGACTCTTG AACGTGAAAT ACTTTTTCCC GGGATCGTTC GATCGTATTT TGTTGGATCC ACCCTGCAGC GCTTTGGGTC TACGCCCCAA ACTGCAGATT GGACCGACCC GGGTGGACGA TCTGCTCGCT TTTGCCACAT ATCAGCGCAA ATTTGTCCCA CCGGCAGTGG CCTTGCTGAA ACCGGGTGGT ATTCTGACTT ACAGTACCTG TACCTTTCAC GTGGAGGAGA ATGAACGCAT GGTGCGGCAT ATTCTGGACG ACTATCCGGT CATGGAGTTG GTGCCCATCA CCATTGGCGT TGGTCTACCG GGTTTGCCCG GACACGGATT AACGACGGAC GAATGCCAAC GTGTGCGACG ATTCGACCCC ACCGACAGTG CCGATACTAT GGGATTTTTC ATCGCCAAGT TTCGGAAACG AGCGACTACC GCCTACTGTA ACAGTACAGC AGGATCACTC CTATAATTTT TTTAAGTGCC TAATGAATCG AACAGTTGTT ACTGTTCCCA CCAGTTAAAC CCGCCAGCGC CGCCGGCTCC TTGTCCGTAG TAACTCGATG GATCCCAACC AGTAGTCTCT TCTCCCGGAC GCACCGGAGG CTGTTGCGAC TGTGGAGCTC CCGGAGGCTG TTGCTGCTGC TGATGGTGAG CTTGTTCGGC CTGATAGCCT TGTTGCTGTG GCGCACCGTA TCCGTTGTTA TGGTAGCCTT GTTGTTGTTG GTGGTGGTGC GGCTGCTGCC CACCGTAAGT ACCGTAGGGA CCCGATGGGT GCTGCTGTTG TTGGTGTTGC TGACCAGGAT AGGCAGGAAC GCCAGGCGGT GGGGGCAAGT TGGAACCACC AGTAGGGGGC GGGGGAAGAT TGCCCGATGG TGGTGGGGGA GGCAAGTTGC TTACGGCGGC GGGCGGCGGT GGTGGCAAAA CAAAAGCCGG CGAGGCCCCA CCAGGTGGAT TGGGCGGACC ACCGTTGGCC GCTTCCGCCG CCGAGGTGGG TTTGGGGCCG TCTAATTCGG CCATGAAGGA CAAATACTCC GTATCGATTT CTTGAGCATT CTTCTGGTCC GTCGGCGGCG CCACTTTTTG TCGACAATCC CTCGTCGGGT GACTCGTGTC TCCGCAAATG GCACACTTGA CCGTCAGCGT GGGTTTGTTG ACCGAAAACC GACGGGGACA CTCAAAGGAG CGGTGTCCTT TTTCGGCACA AATCTGACAG AACTCTTCGT CCTTGAGTGT GCCGTTGAGC AGAGCCAACT CTCGCAGCTG TTGTTGCTTG TGTACGTTCT TTTCGTCGTC AATGACGACG AGCATTTGGT CGATCATCTC GGAGGCCGCA TCCACGGCCC GCTGGTCGTC TCCCGTAATG ACAACGTGTA GGGGTTCGTT GTCGCCTTCC ATGATTTTTC CGTCACGTCG TCCCCGGGCG CCATCCTTGA CGGAACCCCG GCCCCGAATA GCAATCTTGC AACCCGTCTT GGATTCGAGT TCCTTCTGCG TTTTCCCCCG CGGACCAATA ATGAGACCAA TAAAGTTGTA AGTAGGGTGT TTCTCGATAG GGATGCGAAT TTT
|
Protein sequence | MNASDRPTLL SIEELSVLVS SPVRTHLAGA YRGLYPAAND PAQKLQHEQH LEKVLAAMKV SPRTSTFRVN TIRATRTEVV DKLVLAMNEW AEREGLCTPN PLPISVAAHA VLEDIVSVDI AQSPTTLSSC RIPPLQVELD TDSSFGDEAR RARRHRLGWP TTCKVIVCDR LCGEAVLRGS HVFVRGILAA DPSIRVGQAL AVYADLPQPQ RPSWPRGLAV ESYTGTCVFV GIGQACCTHA SSFDMGPLLP PLSGVLDTHA FLQNLPSTVV VHALNPQPGD TILDLCAAPG GKASHVASFT KNQAVILAAD RSRSKVVAMK QRFLALGCTS IVPLHLNATA CCMDEPGRPR CSVAEIRAAA KVSERDGLLN VKYFFPGSFD RILLDPPCSA LGLRPKLQIG PTRVDDLLAF ATYQRKFVPP AVALLKPGGI LTYRE
|
| |