Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44271 |
Symbol | |
ID | 7197947 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 83579 |
End bp | 85685 |
Gene Length | 2107 bp |
Protein Length | 645 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178161 |
Protein GI | 219114731 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0166766 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTGTATCTT TATTGCTGTG GCTCAGGATG GGCAAAGTCT TATCACTCGC AAACCTTGCG ATTGCTCGTT ATATTTCTCT AACCCCTGCG GTCCGTACCT TGATGCTGCT ACAAGTAGTG GCTCTGCTTT TGAACTTGTG CGGCGCTTTT GAGCCACCGA AGACTCTTCG CAATGTTCCT CTGGACCGCT CCGATGTCGG ACGTACAGGG CTTTCCCCAG AACCTACGTG CGGACAGTAT TTGGCACCGT CTATCCTCCC TGGAGCCGGT TTGGGCATGT ATTCGGGTGT GGAAATCGAA GAAGACGATC CTGTGAGCTA CGGAGACATT TGTATTCCAC TAATTGATTT GAAATTGGTA CGTCCATTCC CAGGACAGAA TTCTGTATCC ACTTCCCGCA ATGATTCGCA CTTTTCGTAG CGGCTTGCAA CGAAGCTTGT GAATACTTTT CGTCTCCACG AGATCTAACG GTGTTTTTTC TTCAATACTT TTGCATCAGC ACGTCGGGAA CGATGACAAC GACTTCTACA ATCCCTTTGC CGCCTACGTG TGGCAAGCCA GCGCCTTGGG TATGATTGAT CTCAACCTTT CGAAGGATGT CAGTGCCTTC TGTATGGGTA TCAACAGTAT TGCCAATTGC CACCTCCCCT TAAAGCATGC GTACCAGACT AAGCCGGAAT ACGATCCTCA AGTCCTGCGA CACAGGCACG CCTCGGTTGG ATCTTTTACG CCCTATTTCA ACCTCACATC TTTCGCGGCA CGTCCCATTC CGTCGGGCAG TGAAATTTTT AAGTCGTACG GCAATTACTG GTTTGAAACC CGATCCGACA CGTTCGGTCA ACAGTTTCCT TTGTCGACTT CATACAAGGA AGCTAGCAAC CTTCTGGAAA AATTGTTTGT GTCAATGAAA TTGACCAGCT CTCTAAGTGC GAGCTTGTAC GATGAGGTGG TGTTGAGTAT CAAAGAATTG CTACCTTCGC GTACCATGAA CGCCTTTCCC AATACGGTAC CCCATGCCAT TTTAGGAGCT CACGAAAGCT TGGCTGCTGT TCATCAAACA CTGGTCATCC GAGATTTGGA GTGGTTGCGA TTGAATGGTC GGTGCCTGGA TCATATCCGT CCGGGAACGT CAACTCTTGA AAACGTTGGG CACGGTGCAT TCGCCACACG AGATTTATCA TCCGGAACTA TCGTGTCTGC CTCCCCTGTA CACCACATTG AGCGCGCATT TACCCAAATG TATGAAGTCA GATACGATGA AGCTGCCAAA AAACATGTAC CCGACAAATC AAAAATTGTT GCCTATCAGC TGCTATTAAA TTATTGCTTT GGCCACAATG AATCAACTGT GTTGGTCTGT CCCTATGGAA ATGGTGTAAA TTATATCAAC CACGTCCGGG AACATGCCAA CGTCAAGGTG CGCTGGGCTC AAGACTTCCC AGCTCACCAA GATGATGTCC TGCATCAGGC AACCCCCGAA GACCTCTTCA ATTCCACTGC GGAACCTAAA TTTGTTCTCG AATATATTGC TTTACGCGAC ATTGTAGCAG GCGAGGAAAT ATTTTTGGAC TATGGCGAAT CCTGGGACGC TGCGTGGAAT GCCCACTCCT TGTCCTACGA GCCGTTTGGT GGAGCTACCT CGGCTGAACG CTACCGAGAT GCGTTTTGGG TCAACGAAAA TCACGGTGAC ATGCCACTAC GAACTCGGCA AGAGCAAATC GTGGATCCGT ACCCAGATAA TTGGGTTTTG CGCGTTCACC CCTGGGTCTT GCAGCACCAC ATAAAGTATG GCTACCTTTC TGGCAATTAC GATTGGCAGG AGTCAGATCG TACTCCACTC CGCGATGATT TCGGCTTGCC TTGTCAAATT TTGGAGCGGC ACGAAAACGG AACCATTCCA CACAAGTACA CAATCCTGGC TGATACCACT AGCGTTGATT GGTTCGAGAC AGACAGCGTG GCCATTTCTC AAGTGCCTCG ATCGGCTTTG ACCTGGTTGA ACGCACCCGG AACAATGGAT TTTCATCTCC CCAACGCCTT TCGACAACCC ATAGGATTGT CGGACGAAAT GATGCCAACC CAATGGCGGA ATCTCCTGGC CAAGTAA
|
Protein sequence | MGKVLSLANL AIARYISLTP AVRTLMLLQV VALLLNLCGA FEPPKTLRNV PLDRSDVGRT GLSPEPTCGQ YLAPSILPGA GLGMYSGVEI EEDDPVSYGD ICIPLIDLKL HVGNDDNDFY NPFAAYVWQA SALGMIDLNL SKDVSAFCMG INSIANCHLP LKHAYQTKPE YDPQVLRHRH ASVGSFTPYF NLTSFAARPI PSGSEIFKSY GNYWFETRSD TFGQQFPLST SYKEASNLLE KLFVSMKLTS SLSASLYDEV VLSIKELLPS RTMNAFPNTV PHAILGAHES LAAVHQTLVI RDLEWLRLNG RCLDHIRPGT STLENVGHGA FATRDLSSGT IVSASPVHHI ERAFTQMYEV RYDEAAKKHV PDKSKIVAYQ LLLNYCFGHN ESTVLVCPYG NGVNYINHVR EHANVKVRWA QDFPAHQDDV LHQATPEDLF NSTAEPKFVL EYIALRDIVA GEEIFLDYGE SWDAAWNAHS LSYEPFGGAT SAERYRDAFW VNENHGDMPL RTRQEQIVDP YPDNWVLRVH PWVLQHHIKY GYLSGNYDWQ ESDRTPLRDD FGLPCQILER HENGTIPHKY TILADTTSVD WFETDSVAIS QVPRSALTWL NAPGTMDFHL PNAFRQPIGL SDEMMPTQWR NLLAK
|
| |