Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47923 |
Symbol | |
ID | 7203178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 436475 |
End bp | 438196 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182394 |
Protein GI | 219124193 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCTCA ACAACATTCT TCCGCTCCCC CCTCATCATT TAAACCCGGA AAAATATAAA CGTAAATCGT ATATTCTAGG TCGCAGGGTA TTTTTGTGCG TCGGCCTAGT GGCTGCGGCT CTGGGGTACT CTTTGCAGGC CTCGTCGGTA CGGCTACAGG CTACTGCAGT GCTTGACGCG TTGGAGGCCG TCGATCAGGT TTCTTCGTGG GACTTAACCG CCAACGCAAA GTCACGAGAT CAAAACGACG ATACCACTTG CATATTCTCC GGATCGCCCA TATACCGCAA AGTATTCGTG TACCCTTCGC CCGGTGACAA TGATTGGCAA GGTGATATTC TCTCCTCACA AGGACGAGGT TTTTCGATGC CGTGGCCTTG GCAGCTTGTT GATAACCGTA CCAGAATGTC GGAAGAGAGT CACTATCATC CGTTCAGCAT GCATGCTCAA TTCAGCACGG AGCTTCTTGT CAGAGAGATT TTAACTCATC CCGACTCGTG TTTGCGAACT TACGATCCGG AACAAGCCAG TCTTTTTTAC GTTCCGTACT TGCCATCAAT GGAGTTCCAT GCCGGAGCTC GGGGACGGCC GCCATCCTTT AAAACTTCAA AATATGCAAA CGCCATTCTA CGTGCATTGG AAGGCGACTA TCAACCTTGG ACAGATCACT TCGGTCTCAC ACCAAAGTAT TGGCAACGGA GAAATGGATC GGACCATATA CTGGTTTTTT CCGAGCCTCT CCAAGGTCTT ACTCATCCAA AGAAGAAACG CGGCAACTAC CATTTTGTAC ATACACAAAA GCAGCTTGCA CCTCCAATAG TCGTTTCAGT GGAGCTAAGC ACGACATTTG TGAACATGTA CCCATCTTGC GCACAAAAGA ACATTCTCAT GCCATACCCG ATAACCGACG GTCGCTATTT TAACGGAGAT CTTGACAAAG AGGCTCGTTG GGCGATCCAG AACCGATCGT TAGACAGCAT AGACTCGAAA AGTTCACCCG TACTCGTCGC TGAGAAAGAC CCGGTCGGAA CACTTGCGGA TGCTCGCCCA ATTGCGCAAT GGTACCGAGC AGGTGTACAC GGAGAATGTG TTCCTTTACG CGCTGCGCTA CAGCAAAACT ACAAGTGTAC ACCATCATTC CCTTCTTTTA AGCGCACTCC TACAACGTAC CCGCTGGGTA TGCGCATGGC GACGTTCTGC CCTTGTCCAG GAGGCGACAC TGCCAGTGCC AAACGAATGT TTGATGCGGT TCTTGCAGGC TGTATACCAA TCATTTTGTC TCATGATTTT GTTTGGCCAC TGTCAGATGA GTTCGAACCA GAGATGTTGA TCAAGGTCTC TGATTTCGCT TTGCGCTGGA ATGCTTCAAA TTTCGTCGTA CGCAAATTTG ATAATCAGTG TCGTCCTAGT GTTGCCAATA CAAATTACGC ACTTCCCAGC GTCCAAGAGT TGTTGGAAGC AATACCGGCC TCCGAGATAC GGCGTCTTCG TCGTGGCTTA CGGCATGCTC AACAAGCTTA CTCCTATTAC AAGCCCCGCA AAGGCTTTCC TCGGAATCCT CTGCGGGATC GAGTTTTGCC GGATGGAGGA GCGTCTCAAG CTCTGGTTGC CGCTTTGGCG AAACGTGCCG GTGGTGTTCG ATGGCACGCT TGTCAGAAAG AACTTGGACA GCTGGTCGAA GCGGAGGGTA AAGACGCGGA ACCCGATCGA TTTAAATGCT AG
|
Protein sequence | MTLNNILPLP PHHLNPEKYK RKSYILGRRV FLCVGLVAAA LGYSLQASSV RLQATAVLDA LEAVDQVSSW DLTANAKSRD QNDDTTCIFS GSPIYRKVFV YPSPGDNDWQ GDILSSQGRG FSMPWPWQLV DNRTRMSEES HYHPFSMHAQ FSTELLVREI LTHPDSCLRT YDPEQASLFY VPYLPSMEFH AGARGRPPSF KTSKYANAIL RALEGDYQPW TDHFGLTPKY WQRRNGSDHI LVFSEPLQGL THPKKKRGNY HFVHTQKQLA PPIVVSVELS TTFVNMYPSC AQKNILMPYP ITDGRYFNGD LDKEARWAIQ NRSLDSIDSK SSPVLVAEKD PVGTLADARP IAQWYRAGVH GECVPLRAAL QQNYKCTPSF PSFKRTPTTY PLGMRMATFC PCPGGDTASA KRMFDAVLAG CIPIILSHDF VWPLSDEFEP EMLIKVSDFA LRWNASNFVV RKFDNQCRPS VANTNYALPS VQELLEAIPA SEIRRLRRGL RHAQQAYSYY KPRKGFPRNP LRDRVLPDGG ASQALVAALA KRAGGVRWHA CQKELGQLVE AEGKDAEPDR FKC
|
| |