Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35637 |
Symbol | |
ID | 7201083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 411973 |
End bp | 413880 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180229 |
Protein GI | 219118925 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTCCT TAAATTCTGC GAGCGAAAAC AAAGCGCAAG CGATAGTGAT CATTACCGGG GGCGGTGGAT TCTTAGGGCA ATCGTTGGCG TCGGCCTTGC TGGAACGTCA AACGATCCAA GGTAACGGGG TTGTGCTTTC ACTGGGCTTG CTCGTTTTGG CCGACGTTGT TTTCCCCGAA ATCTTACAAC CAGTGCTTGA AACATCCAAG TGGGATAAGC TTGTCAAGCT TCAGGGAGAC ATTTCCGACC CTACCTTTGT CGATAACTTG TTCGGCCTAA TCCCTTCCGA CGCCGCCCAT GTATCGATTT TCCATCTGGG CGCCGTCATG AGCGGTGACG GGGAACGGGA CTTTGATTTA TGCATGAATG TGAATCTATA CGGCTTTTTA CATCTGATAC AAGGAGCCCG TAAGTACGTG TACGAGCGTC TCGGTTTCCC CGCCAAGTTC ATTCTGGCGT CGGCCGGAGC AACCATTGGA TCCGGCGCAC CGACCGACTA CATTGGCAAG GACGACATTA TTTCGGACGC TACACGAGCA ACACCGCACA CAACCTACGG GGCCACCAAG GCCTGTGCCG AATTACTTTT GAGCGATTAC AGTCGCCGGG GATTCGTAGA CGCCCGCGGA CTACGACTTC CTACCATTGT TGTACGGGCC GGTAAGCCCA ACGCCGCCAC GACGGGTTGT TTTTCCGGTG TTGTACGCGA ACCACTTGCT GGGGTCGATA CGACCTTGCC CATTGCCAGG GATGTACTGC ATGCCGTTAC TGGTAAACGT CACGCAATCG ACGCCATGCT GACACTCCAC AACGCAAGCC TGGAACAAAT CGAATCTGTT TTGGGCTACG ATCGGACCGT ATTCTTGCCA GCCGTAGCTC TGAGTCTGGG AGATCTCGAA GACGCCCTTT GGAAAACGGT CACACCTGAT ACGCAACACA AGTTGGGAAA GATCACGTAC CAGGTGGATG CCCATTTATC GGCCGTGGTG GCAAGTTTTC CGACCAAAAT CGATGCCCGA AGAGCTCGAC GGCTCGGCAT TCCGTCCGCG CCGGATGCGG ACACTTTGAT TCGCCAATAC GTCGCCGACT TTTCCTCGGC TATCGCCTCG GGTATTGAGC TTGTTGCTCC ACAAAGTGGC AACATTGCCG CTTTCCCCAA GGAAAGCAAA GTGGCCGTCA TTACAGGAGC CGGCAGTGGC ATTGGACAGG CCGTCGCTCA AAGATTGTCT CGAGGTGGTT GGATTGTAGT CTTGGCGGGA CGTCGCAAAA CGACGTTACG AGAAACAGCC AAGACTCTTG AAGGGCGCGC GTGTTTGTGC GTCCCGACGG ATGTGACCAT CGAGTCGGAA GTAGAAGCGC TCTTTGAAAC CGTCCACACC AACTACGGTA CGATTGATCT GTTGTTTAAC AACGCTGGTA TCAACAGCAC AGCGGCCAGT TTCGCAGACG TGGAGTTTGC CGACTTTGAG CGTGTGCTAC GTACCAACGT GTGCGGCCCG TTCTTGTGCG GCAAAGCGGC CATGAAACGC ATGGCCGCCA ATGGTGGCGG CCGAATCATC AACAACGGTA GCCTGTCGGC GCAAACGCCC CGACCCGGGT CCGCCTGCTA CACCGCCTCC AAACATGCTG TGCTGGGGCT AACAAGATGC ATGGCACTTG ATGGACGTGC GTTCAACGTG GCGTGTGGTC AGATCGATTT TGGCAACGTG GTGAGTGAAA TGAGCTTGCG TACTAACAAG GTAGGGACCG GGGCGTTGCA GCCCAACGGA ACCACTCTCG TTGAGTCTTC CATGAGTCTC AAGGATGCCG CCGAGACCGT CTGGAGCATG GTCAATCTAC CTCTGGAAGC CAATGTATTG CAGATGACGG TCATGGCCAC AACAATGCCG TTTGTCGGGC GTGGATGA
|
Protein sequence | MASLNSASEN KAQAIVIITG GGGFLGQSLA SALLERQTIQ GNGVVLSLGL LVLADVVFPE ILQPVLETSK WDKLVKLQGD ISDPTFVDNL FGLIPSDAAH VSIFHLGAVM SGDGERDFDL CMNVNLYGFL HLIQGARKYV YERLGFPAKF ILASAGATIG SGAPTDYIGK DDIISDATRA TPHTTYGATK ACAELLLSDY SRRGFVDARG LRLPTIVVRA GKPNAATTGC FSGVVREPLA GVDTTLPIAR DVLHAVTGKR HAIDAMLTLH NASLEQIESV LGYDRTVFLP AVALSLGDLE DALWKTVTPD TQHKLGKITY QVDAHLSAVV ASFPTKIDAR RARRLGIPSA PDADTLIRQY VADFSSAIAS GIELVAPQSG NIAAFPKESK VAVITGAGSG IGQAVAQRLS RGGWIVVLAG RRKTTLRETA KTLEGRACLC VPTDVTIESE VEALFETVHT NYGTIDLLFN NAGINSTAAS FADVEFADFE RVLRTNVCGP FLCGKAAMKR MAANGGGRII NNGSLSAQTP RPGSACYTAS KHAVLGLTRC MALDGRAFNV ACGQIDFGNV VSEMSLRTNK VGTGALQPNG TTLVESSMSL KDAAETVWSM VNLPLEANVL QMTVMATTMP FVGRG
|
| |