Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50146 |
Symbol | |
ID | 7198847 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 191297 |
End bp | 193118 |
Gene Length | 1822 bp |
Protein Length | 589 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184984 |
Protein GI | 219129625 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGAGT CCAAAGACCA TATTTTGCTC TGTCCGTCGC CCGCTGAATC CATTCGGGAA TGTACGGTAA CTCTGGGACT CGGTTACCCG TGCGCTCCGA ACCGTAGGTT GGCGTTCCAC CATCGTCCAC GTGAATGGAT CGAATCCCGT CGTGGAACTC TCGGATCCGA AATCCAACGC CAAGGACTGG TGGGGCGGGC GAGCGGCTCG GCCGACCCGA CAGTGACAAC GGATGCTACC GAGTGCCTGT CTGACCTTGG GCGGACGTGC CCCAATCGCC TCAACCAACG CCGCTATCAC TTTCTCTTCC CGTTTGTGTG CGTGTGTGTG TGTGTGTCTC TCTCTCTCCC ACGCCCCACT GCGAACAAGA ATGCCGACGC GTACGACCGT TTTCGTCGGA CTCTGGTTGC TGGCGGCTGC CAGTCTGCCA ACGACCCACG CGTGGCAAGC GTCCCCGTGG ACTCGTTCTA CTTTGCGTTC CTCGTCGACG GCGCCACCGT CCGCTCGTTC CGTCACGACG CAACGATCCC GTCTCGCCTC AGCAGTCGCC ACTGCCGACG AACCACCGTT GCAGGGAACG GGGACCGCTT CGATTCCGCA AGAAGTCTTT AATCTCGTCA AGGGCATTGT CGGCGCCGGA GTCCTTTCGC TGCCTTCGGG GATTGCCGCC TTTGGGAACG CTCCCTCCGC CGTACTCCCC GCCGTCCTCC TCATTTCACT CATTGGCGCA CTCTCCGCCT ACGGGTTCGC CTTAATTGGA CGCGTCTGCA GTTTGACCGG CACCACCTCC TACCGGGACG CCTGGAACGA ATCCGTCTCG CCCAAAACGT CGTGGATCAC GGCTTGGTCG GTTACCTTTA TGACCATTAA CGCCACCTTG GCCTACAGCA TGATTCTGGG AGAAACATTC CAATCCCTCC TACTCACCGC CGGATACGCC TGGAGCAAAA CAAAGATTCT CGCCCTGCTC ACCACTACCG TCCTCTTGCC CCTCTGTTTA CTGAAAAACC TCAGTTCGCT GGCACCCTTT TCTCTCCTGG GTTCACTGGG AATGGTCTAC ACCGCCATCG CCATGGGAAT CCGCTACGTG ACCAAGGCCT ACGTCGGCAC GGGAAAGTTC GCCGCGGACT TGCCGCGAGC CTTGCAGCCG TCCTTTGGAG CGATTGGGGC CTCCGGGGTT TACAACGCCA AGGCCAGTAT ACTGCTCGGC ATGCTGTCGA CGGCCTACAT GGCCCACTTC AACGCTCCCA AATTCTATAC CGAACTCCGG AACAACACTG TCCCGCGGTA CGTCAAAGTC GTCGCCACCT CCTTTGGAAT TTCCATCGCC TTGTTCGCCA CCATGGCCTC CCTCGGATTC TTGACCTTTG GAGCCGCGTC CAGTGGACTC ATTCTCAACA ATTATTCCAT CAAGGACAAC TTGATGGGCC TTTCGCGGAT TGCCGTTGCC GTTAGTCTCG TCTTTTCGTA CCCCTTGGCC TTTGTCGGAG CCCGCGACGG GATTCTAGAC GTGGCCAACG TGGCCCCGGA AAAACGGTCC ACCGGATTGT TGAACGCCTT GACGGTCGGA TTGCTGTCCT TGGTTACTGG ACTGGCCTTG GTGATCCCGG ATGTCTCCTT TGTCATGGCC TTTGGGGGTT CCACCTTGGG CAACGCCTTG ATTTACATCT TCCCCGCCTT GATGTTCCGC GGGGCGGTCC GCAAACTCAA GGCACCCACC AAAGGACAGC GCCGTGAAGT CAAACTCGCC ATGACGTCGG CCTTGGTCGG ACTCGGTATG GGAGTAGTCG GTGCCGTCAA GGCTGTACAG TCGGTTCTCT AA
|
Protein sequence | MSESKDHILL CPSPAESIRE CTVTLGLGYP CAPNRRLAFH HRPREWIESR RGTLGSEIQR QGLVGRASGS ADPTVTTDAT ECLSDLGRTC PNRLNQRRYH FLFPFVMPTR TTVFVGLWLL AAASLPTTHA WQASPWTRST LRSSSTAPPS ARSVTTQRSR LASAVATADE PPLQGTGTAS IPQEVFNLVK GIVGAGVLSL PSGIAAFGNA PSAVLPAVLL ISLIGALSAY GFALIGRVCS LTGTTSYRDA WNESVSPKTS WITAWSVTFM TINATLAYSM ILGETFQSLL LTAGYAWSKT KILALLTTTV LLPLCLLKNL SSLAPFSLLG SLGMVYTAIA MGIRYVTKAY VGTGKFAADL PRALQPSFGA IGASGVYNAK ASILLGMLST AYMAHFNAPK FYTELRNNTV PRYVKVVATS FGISIALFAT MASLGFLTFG AASSGLILNN YSIKDNLMGL SRIAVAVSLV FSYPLAFVGA RDGILDVANV APEKRSTGLL NALTVGLLSL VTGLALVIPD VSFVMAFGGS TLGNALIYIF PALMFRGAVR KLKAPTKGQR REVKLAMTSA LVGLGMGVVG AVKAVQSVL
|
| |