Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35869 |
Symbol | |
ID | 7200864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 933837 |
End bp | 935441 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180350 |
Protein GI | 219119167 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.597228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATC TCGTGGTGCA TCGGTCGGAC GGTGCCCACG GGGATCATCG TCGCCCTCCT CCCGTATATC TCTGCTACGA CGAACGATTA CTCCGTCACC ATCCGGTACA TTGGCAACCT CCCGCAGTCT ATCCAACACA AGACGACGTA CGGATAAAAG CGTTTCCAGC CGAGTACGTC TACGAAAACC CCGAACGCAT CCGTTGCGTC TACGAACATT TGCAAAGCGT CTTTCCGCCC GACACCTTTC TGCCGCTCCC GTGCCGTCTA GCCACACGTG CCGAAATCAC CGCCGTCCAC GACGAGGCGC ACTACGATCG TCTCGCCGCA ACAGCCTGTA TGACTGCGGA AGAACTCGTC GAACAAAGCC AGCTCGACAG TGATTTGTAC TGGAACCAGG AGACCTTTGC TGCCGCACGC TTGGCTTGTG GCGGCCTGTT GAATTGTGTC GACGCCGTGT GCCGATTAAA CGCGACTCCC AACACCGACG CGACACCACC GGCAATTCAC GCCGTGGCCT TGATTCGGCC TCCGGGACAT CACGCTTGTC AACATCGCGA AATGGGCTTT TGCTTTTTGA ATTCCGTCGC CGTGGCCGCG CGGTACGCGA TTGAGCAAGG CCACGCCACC AAAGTTCTCA TTCTCGATTG GGACATACAC CACGGTAACG GCACACAAGA TCTCACCTAT AACGACGAAC GTATACTCTT CGTGTCAATG CACCGCTACA CGGGCAGCAA CGTCGCCAAG CATTTCTTTC CCGCCACCGG AAAACCCACC GAGACCGGAC GGAACGCCAC CAACGTCAAC CTGGCCTGGA CGCAAGGACA CATGGGTGAC GTGGAATACG CCGCCGCCTT CTCGGAACTC GTGTTGCCCA TCGTTGTCGC CTTCCAACCC GAATTGGTGT TGATTTCCTG TGGACTGGAT GCCGCTCGGG GAGATTTGAT TGGGGACAAT TCCGTGTCAC CACTGGGGTT TCGGGCCTTG ACGCACAGTG TTGTCCGCGC CGTAGGCACC CACACCACAC CCGTGGTAGT TGCCCTGGAA GGCGGCTACA GTATGGACGC TTTACCCATT TGTATGGAAC ACGTCGTCAG GGGACTATCG GCCGCCAACG ACACCAGTTT AGACTGGGAC GTGGAGAATC TTCCACAGGC ATGGGCGAGT GATAGCTTGG AGTACGCTCA CCAAGCTTTG GCAATGTACT GGGACTCCAA CCGTCGAGTA GCGGCCGACA ACCAGCCCGC GATACAACCT TCGGCAATCA GTAATATCAA TCAAACCGTT ACAGCCTTGC AAAAGTGTTC CTCACGCTGG AACGAATGTG GTTTAACGAA ACTCCTAAAG CCACCGGGGC CGTCTGCAGT CTCGACTCGC GCGTCACGGC GTTTGGTAAA GTCCTCTCCC AACACTTTTC TACCAAATGG GTCGAATGCC ACACCGCAGC CAAAAGCCTC CAAGGTTCTG GCTGTATCGT TGGCGAAGGA TCCCGTCAAG GAAACATCGC TCCCCGCTGC CAAAGCGGAC GAGAACGTCG ACGGTGGGGA TGGTGATGCG TTGATTGCGG CTTTGCAGTT GCTGTCCTTG TCCAATGGCA AGTAG
|
Protein sequence | MSDLVVHRSD GAHGDHRRPP PVYLCYDERL LRHHPVHWQP PAVYPTQDDV RIKAFPAEYV YENPERIRCV YEHLQSVFPP DTFLPLPCRL ATRAEITAVH DEAHYDRLAA TACMTAEELV EQSQLDSDLY WNQETFAAAR LACGGLLNCV DAVCRLNATP NTDATPPAIH AVALIRPPGH HACQHREMGF CFLNSVAVAA RYAIEQGHAT KVLILDWDIH HGNGTQDLTY NDERILFVSM HRYTGSNVAK HFFPATGKPT ETGRNATNVN LAWTQGHMGD VEYAAAFSEL VLPIVVAFQP ELVLISCGLD AARGDLIGDN SVSPLGFRAL THSVVRAVGT HTTPVVVALE GGYSMDALPI CMEHVVRGLS AANDTSLDWD VENLPQAWAS DSLEYAHQAL AMYWDSNRRV AADNQPAIQP SAISNINQTV TALQKCSSRW NECGLTKLLK PPGPSAVSTR ASRRLVKSSP NTFLPNGSNA TPQPKASKVL AVSLAKDPVK ETSLPAAKAD ENVDGGDGDA LIAALQLLSL SNGK
|
| |