Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47901 |
Symbol | |
ID | 7203108 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 389421 |
End bp | 391384 |
Gene Length | 1964 bp |
Protein Length | 538 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182384 |
Protein GI | 219124172 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTCAC AATCTGCGAT CTCAAAAGTG ACATGACCAT TTACGTTTCG TTTCGCTTTA TTTGTATTTA CAGTTAGTCT TTTACCACTT ACATTATTAC GGTTAGGAAA AATGCTCTCT GGCAGTCAGC TCCGCTTCGA AGCCTCTTGT CCAGCGTTCG CCTTGAATTT CCGAAATGTG GGCGTCTACC TTTGGTCATT CTTTCTCGAC CGCTTAGCCG TGATGGTGAT TCTTTGAGTG GTCGTCCACC AGCGCTACCC CAGAATTCTC GTCATGAGAA GAACATATGC CGACCGTTCC GCTATGCATC GGACAATCCG CACTTCACTG TCGAGAACTC GGCCAAATTT AAAGGCGCCA AAAAAGATGC CTTATTTCGT TTTTGTTATG CTTTTCATGC TCCTATTCTT CTTTCAACAA TGGGCGAGTC GAGCTGCTCG ACGATCGCAC CAAATACCGT TAGAGAACTT CGTCGAAGAA GTCGCTACTC ATGACGGTTT GATCCCCTCA AAAGAAGGAC CATCCCAGAA ATTACCTCAA TGGATTCAGA GTTATCTTCG TTGGCATCAA TCTGTTCGAG CTCAGTTTCC AGGTGACCTT CTTTTTACGG ATCCCGCCGC TCCAAATTTG CTGCTGAGGA CCTGTTTGGG ACTTTGCGGA GGACTGAATG ATCGACTTGG TCAGCTTCCG TGGGATTTGT ACTTGGCAAA CCAAACAAAC CGGATTCTAT TGTTGTATTG GCATCGACCG GTACCCCTCG AATCTTTTCT CATTCCGAAT GAACTAGACT GGACCGTCCC GAAAACGCGC CCGGGATTCT TTCCATCTCC CGGATCCAGA ATCGTTTCTC GCGGAGACAT GGTCTTGGCT CGAGATATTC CTGAGCTATT TGCAGACTTC AACTCGGAAC AGCCAACGGA CCAGTTTTGG AGCACTCATA TGGATGTGGC AATAAATAGA GCTACCGCCG GTCACTACCG GAACCATAAG GTCCTCCGGC ACCGTTTGTT GGGCCATTTG AACGAAGATC AGCTGGAAGT AAGGCTCCGC ACGCTTGGCG AAACTGACAT GATTCACTGG ACTGAATCAT TCGGCAATAT TTTCCGAATG TTTTTCCGTC CCGCTGCCGC TATCCAGGGG GAATTGAACC GGGTATTCAG TGATCTACAA ATCACTGCAG GATCTTATTC TGCCGTTCAC TGCCGAGTGC GACATCCTAA AGCCTCGCCT GCCCATGTTT TCGTGAAAGG AAAGAATGAT GCTTATCCAG CAGACAAAGC AGGACTACCG TGGATAGGAG AAACACAAGC TTTCGCAATT GCCACTGCTA CGAAGGCGCT GAAATGTGCC CGCCAGGCAG CACAAAACCT TTCTGAACCG ACGTACTTTT TATCGGATTC TAATGATTTG GTTCGATATA TAGCGCACGA GTTGACAAGT TCCAAATTCG TTTCCGCCAA TGCTACAATA CTCCACGCTG ACCCTGTTCA CAGTTCGGCG CTCCAAACTG TGGACTCCAT GCGTATCGTA GCCAGGGAAT CATCTCTGGA AAACGCCCAC ATCGACCTCC AGAAAGGACG GGAACCTGCG GCGTACTATG CTACATTTGT GGATCTGTTG CTCGCCGTCA ACGCACGATG TGTGACGTAC GGTATTGGCT ACTATGCAGT CCTCGCGACC AAGATTTCTG GCACGAAATG TAAGAACCTG TACCAAGAAG AAGCGTGGGG AGGCAGCGAA AACAAACGAA ACAATACACA TGTATGTCGC CTCTAAAGAA GAATGCCTCT TACAATTTCC ATTGTTGTCC ACACTGATGT GGCTGCTCCA TCTAGGCATC CAAGTCTGCT GAAGTTTGGT TACATATGCT GATGCGGCAG CAGTTTGGGC GTAAAATCAG GAGGGCGACG GATTGGAGTA TGCGGGCTGC TGATACAGAC GCTAGTCTTA AGGTCGGTAT TTGA
|
Protein sequence | MRRTYADRSA MHRTIRTSLS RTRPNLKAPK KMPYFVFVML FMLLFFFQQW ASRAARRSHQ IPLENFVEEV ATHDGLIPSK EGPSQKLPQW IQSYLRWHQS VRAQFPGDLL FTDPAAPNLL LRTCLGLCGG LNDRLGQLPW DLYLANQTNR ILLLYWHRPV PLESFLIPNE LDWTVPKTRP GFFPSPGSRI VSRGDMVLAR DIPELFADFN SEQPTDQFWS THMDVAINRA TAGHYRNHKV LRHRLLGHLN EDQLEVRLRT LGETDMIHWT ESFGNIFRMF FRPAAAIQGE LNRVFSDLQI TAGSYSAVHC RVRHPKASPA HVFVKGKNDA YPADKAGLPW IGETQAFAIA TATKALKCAR QAAQNLSEPT YFLSDSNDLV RYIAHELTSS KFVSANATIL HADPVHSSAL QTVDSMRIVA RESSLENAHI DLQKGREPAA YYATFVDLLL AVNARCVTYG IGYYAVLATK ISGTKCKNLY QEEAWGGSEN KRNNTHASKS AEVWLHMLMR QQFGRKIRRA TDWSMRAADT DASLKVGI
|
| |