Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42714 |
Symbol | |
ID | 7196115 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 873885 |
End bp | 875829 |
Gene Length | 1945 bp |
Protein Length | 580 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176672 |
Protein GI | 219109838 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAGG CGCTGACAGA CGCTGCTGAC TTGGATGAGA ACGAAGAGGC GTCTTCAACA ACCGTGATGC CTCTTGCTGC TGACGTCCAC AGTGTCTCTG GGCCTGTGGC CCCTCCTCCT ACGCCAGTGA CTACAATACA ACAGCCACCA TCAACTGTTT TACCATTACA CCAAGGATCA AAATCAGACA CAATTAACTT TAGCAACTTC AACCACCCAG CGTTGGATGG CAGCCTATCG CCACAAAACC CCGCAATGGG TGTGTCAGGA GACAATCACA TCAATTTCCC CGAGGCAGTC TCTTCTACCA CTGCGTCGTC AGATACAGGA GACCAACAAG CACAACTGAG AGCCATGTAT CTAGCTGGCT TTCGTGCAGC TCAGGTGCAT AACGATCGTT TATCCTTAAA GGACAACTTT GAAATAGCCA AGCATGACTC ACAACCAGGG ACTCTAACCA TGGAAGGAAC AAACCCTCTA GCAGCGCCTG CGATTAATGC CGGCACGTTT CTCATGCCAG TAGCGACTGG AGAAGCAGCA GGATTGGTTG CAGCGAGCCC CACGTCATCG AATTTTCCAC CGGGCAATGG GAGCACCATG CTGACCCGCA GGCATAGCGA TCTTCCAGAC TCAGGGGTTG CGACTCGACG CATCACAAGA ACGGCCTCGT CAACGAGCTC CATGGCAGCT TCCCCCGCCC TATCGGCAAC TGCCTCGCCC AGTGGAGGGG GAAGTTCGGG CTCGAATCCG TTCCCGCGTA AGCTAATGGA CATGCTGCGC AAAGAAGATT CATCCGTTGT TGCATGGCTC CCTAGTGGTG ATTCCTTCTC GGTACGAGAC TCGGACCGTT TTGTGGCGGA TATTCTACCC AGATACTTTC GGCATACCAA ACTTACTTCG TTTCAGCGTC AACTAAATTT ATACGGGTTT CGACGAATGA CAAAGGGTCC CGACGCCGGT GCATATCGTC ACGACATGTT CAGGCGAGAC GATCCCGATC TGTGCCTACA GATGAAGCGA ACCAAGCAAA AGGGATCAGC GTCTCCTCAA TTGAGACCGA ACGGACGAGG TGGTTCTAGC TCGGTTACGT CGTCACCTCT TATGACTCCC GATCAAAGCC CTAGTCTATA TGCTTTGGAT CCCGATGCTC TCAGCCGGAG TGCGCCCTCT ATACTATCTG CATCCGTGAT GGGACAGTAA GTAACATTTG ATGGATTGAA ATGTGGGAAA TGTTGATTGA ATCTCACAGT TTTTTCTCTC GCAAGCCCGA ATGAGCCTCC TCCGTTCAGC CTCAACCCTC CCAGTGAGCA CAGACGAGCT GATTTTCGGA GTAATCCACC TGGCCATCCA GGGATTAACA TGGCGCAGAC AGGACTATCG ATCCTTATGG GAGATAATAG TGTCCAACAT CAGTCTTCGT CTTCAGTTCC ACAAGGAAAG TCACTGGGGA AATTGACTGC CGAGCAGCTA ACTCAGTATC AAGCCGATCT GATAGACAGG GAACGGCAAG CCAGCGCCTT AGCAGCCGCA GGGATGGTGG CGGAAAGTGT CAATAAGACT CAGGCCACCC ACGGTCACAG CATCGCGCAG GGCCTCGCAG CCCCGCCACA ACTGTCTCAT GCCACTGCGA CTCCGACCCA GACCGCAAAT ATATCAGAGC TCGACAGCAT AAACTGGAAT TTGATGGACA TAGGGGCGAT GCATCTTGAC GATATGGACA TGGATTTTGC TTCCCTTTTT GACCCCGCTA ACGAAGCGGC AAGTATGGAA ACGGAAGGCA GCGGATGGCC AAATGTAGGA AAGTCTGCCG CTTCTACCTC CAGCGATCCA AAGTAACTCA GGACACTTCT ACAGCGTCCG CTTAGACGGT CAACAATTTT GAAAGGTATC TATACGACTT TATATTACTG ATAGCTCATA ACTCTTCAGG GGCACTAGTC AAAGT
|
Protein sequence | MEKALTDAAD LDENEEASST TVMPLAADVH SVSGPVAPPP TPVTTIQQPP STVLPLHQGS KSDTINFSNF NHPALDGSLS PQNPAMGVSG DNHINFPEAV SSTTASSDTG DQQAQLRAMY LAGFRAAQVH NDRLSLKDNF EIAKHDSQPG TLTMEGTNPL AAPAINAGTF LMPVATGEAA GLVAASPTSS NFPPGNGSTM LTRRHSDLPD SGVATRRITR TASSTSSMAA SPALSATASP SGGGSSGSNP FPRKLMDMLR KEDSSVVAWL PSGDSFSVRD SDRFVADILP RYFRHTKLTS FQRQLNLYGF RRMTKGPDAG AYRHDMFRRD DPDLCLQMKR TKQKGSASPQ LRPNGRGGSS SVTSSPLMTP DQSPSLYALD PDALSRSAPS ILSASVMGHL NPPSEHRRAD FRSNPPGHPG INMAQTGLSI LMGDNSVQHQ SSSSVPQGKS LGKLTAEQLT QYQADLIDRE RQASALAAAG MVAESVNKTQ ATHGHSIAQG LAAPPQLSHA TATPTQTANI SELDSINWNL MDIGAMHLDD MDMDFASLFD PANEAASMET EGSGWPNVGK SAASTSSDPK
|
| |