Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47949 |
Symbol | |
ID | 7203198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 529665 |
End bp | 531300 |
Gene Length | 1636 bp |
Protein Length | 523 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182410 |
Protein GI | 219124227 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCGGAAAGC CATCGTCGTC AGGATGGACT TTGCTCGATG GATGTCCTTC CTCGGTGGTT GGGACTGCGT CGAGGTAGAA GAGAAAGATT CCAGTAAAAC CTGGACCACG CGCTTGCAAG ACGACCTTGA GGCTGTATGT TTCCCCCACA AAAATTTGGA CTCTCCCGAA CGTATAAAAA TTCCAAACCG AACTGTGTCG GAAGAGGAGT CTCCTCGTTT GCGGCCGGCA GAGGTCCGGA TACATTCATC GGAAATAGTG AACGAGGAAT GTCCACGCTA CGGGGACAGT GTACGAAAGC CGGAGACGCA CGTACTCACG GTGGACCCGA AAGTGGCAAC TCACACCAGT TCTCAACCGC AAAGAACCTT TCCCTCCGCG ACACTGCCAA CGCCACCCCA AAACAGGCAA AATATTCCGG ATAACACCGC CAACAACCTG CCTTTGCGCA AAGAGGACCT TGCGACTCCT CCCAGACCAC CTGTATTTTC TCGCGCACAT AGTTTCCAAT CGCACACAAA ACACCAAACG AGAGAACGAC TCGGAAATCG GGATAGAATT CTTTCAGAGA TGCCGGCCGT CAAGAATTTC GACGACGATA TAAACAGGCT TGTCGCTACG AGACATGGCA GCAAATATGG CTCTGGTGTC GAGGCAGCCA ACCTTTCGGA AGATTCTTAC GTGGCCACGG AAACGAGTAT GGACTCGGGA GATCTCCGCA CCCTGGAGAG GGGCGACCGT AGTCCGGATC TTTGCGTTGG TGTAATGACA GGTTCGCCAA ATGTCGCATC AGGTACAGAG CAACAAGATC GATTCAGCGG TGATGCTTTC GAGGGCGATT CCCGATCTAC CGACAGAGCT TTGCCCTTGG GCAAAATAAA CTATACACGA AAAGAAAGGC ACTTTCGCTG GTTCGCTCAT GGAAAAATGT GGACGTCTAT TGCCGTTCTG CTTTCCATTT GTGGGTCGTT GATGTCTGTC CTTTCCAGAC GAAGTACGAG GTTTGTTGTA CTAAGCGAGC CACTCAATAT TGCTCCCGTC TATAACGCTG TGGACAAGAT CGGAATGATC AGAATGGAGC TCTGCTACAA CACCTCTGTA GTCAGTGAAT CTGGCTGTAC GGTCATTCCC TTGACAACTG AGGATGTTGA CGATAACATG TTTGAGTTGG CGCGGATCTT TTTGACGCTG TCGGCACTGT CGGGCGTGTT CTTCACCATC TTCTTGTGCT CCGCTGTTTA CTGGCAATCA ATCAATCTGA AGCCCATTGG AATTGGCTTC ATCGTAACCT ACTTCTTTCA ATCTTTTTCG ATGATTTTCT TTGATTCCAT GATATGCTCT GACAAAAACT GTCGAGTAGG GTCTGGTAGT CTCTTAAGCA TATTTGCCAG TCTCTGCTGG ATTGGAGCTT GCCTTGCGAC GGCAAAAATG GATGCCTTCA AAATCATTGC TCAACGCAGA CGCCGACGTC ACGCTCGGCG TCTCGCAAAA ACGAGGAAGA TGGTGAGAAA GGCTTCTTCG GAAACTGTCA AGACTACCTC ATCAAGGGAG AGCGATGGTA GCAATTCCGT TATCGATCTG GAAGTAAATG GATAGGGGAA AAGTAGCTAC TTAAGTTTTG ATAGTCGACA CTGTGT
|
Protein sequence | MDFARWMSFL GGWDCVEVEE KDSSKTWTTR LQDDLEAVCF PHKNLDSPER IKIPNRTVSE EESPRLRPAE VRIHSSEIVN EECPRYGDSV RKPETHVLTV DPKVATHTSS QPQRTFPSAT LPTPPQNRQN IPDNTANNLP LRKEDLATPP RPPVFSRAHS FQSHTKHQTR ERLGNRDRIL SEMPAVKNFD DDINRLVATR HGSKYGSGVE AANLSEDSYV ATETSMDSGD LRTLERGDRS PDLCVGVMTG SPNVASGTEQ QDRFSGDAFE GDSRSTDRAL PLGKINYTRK ERHFRWFAHG KMWTSIAVLL SICGSLMSVL SRRSTRFVVL SEPLNIAPVY NAVDKIGMIR MELCYNTSVV SESGCTVIPL TTEDVDDNMF ELARIFLTLS ALSGVFFTIF LCSAVYWQSI NLKPIGIGFI VTYFFQSFSM IFFDSMICSD KNCRVGSGSL LSIFASLCWI GACLATAKMD AFKIIAQRRR RRHARRLAKT RKMVRKASSE TVKTTSSRES DGSNSVIDLE VNG
|
| |