Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49551 |
Symbol | |
ID | 7198221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 17522 |
End bp | 19539 |
Gene Length | 2018 bp |
Protein Length | 530 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184285 |
Protein GI | 219128156 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACACTTAATA GTCTTTTCAA ATTGTATCGA TTACAGTTAA TTGTGACATG AGTGACTTTC ACCTTTTTGT CGTCGAGAAT TTTTGTTGGG ATGAAGTACC TTTATAAATC CTAATTTCTA TTCCGATACT CCGAGGCATC ATCTCACATT TTCGATGTCT GCATGGCCAA CATCCATAAT ACCAATTGAA GATATATTTT TACGATGCAA TTCTTGCCGC TTGCTGTTGA TATCGATTGC TCCAGTACAT CCGTCTCAAC AGAGCAGTCA TTTGATGGAC GGGGTTGGAA AAAGGCGGTG ATGAAGCGCT TGGTCGGCAC ACTACCACGC AGGCATTCGT CGTTTCACCC AACTGACTGT GAATTGGAAA GTCGTGTGCA ATATACTGCA GACCCTCAGA AGTCTCTGTG TACAAGATTT GTACCCCCGC TTTTGGATGT CAAAGATCCA ATGATAGAGA TCAACGCCAG CCCGGAAATA GTTGTCCTCG GCAGCAAAAA CCCAAGAAGG AATCATGTGA CTTCTATTTC TCCTATCATA CCGTTTGGTC CAATCTCAAA ACGGAAGCAC AAGCGATGTA ACTCTGCCTG TGAGGAAAGC AAGAACAAGT ACCAGATGGC AAGGGGGCTT GCTTATGACG TACCAACGCA AAAACGCCGT GTATATCTCA TTGGCATTGA TTGCCCAGGG ATGTCGAGTC CAAAGCCGTC GAGGCAAGCC AAACTCTATC CGGAATCAAT TCCTCGAGAA TTTGGTACAG CTTCCTCTTT TCCGTTCCAG CGAGGTACAA GCGAACCGAA AGTTCAGCCG CTGTCAATAG AAGGGGATAC CATGTCCTTT GGATCTGGGA AAAAACTATA CAGAAGAAAT GAGAGTTCCC CATCGCCTAG ATTGAGGAAA AGAGGAAATC GGAACATTTC TATTTCGAGA ACTATGCATG GTATACCTTT TTCTTTGCGA GACCATCAAT CCATGCTGGA ACGGACCGAC AATGTGTACT ATGAAGAAAT GCCCACTCCG GATGCTGTGC CAGATGGTCT ATCGAATGGT CGTCGACCAC CGCTTATTTT AGGAAAAACA TCACGACAAT CGCGAAAGGT ATCTACGCGT TCGCTTATGG CTCACACAAC GAGGGTAAAA CCTCAGGTGC AAAGAACTTT GAAGAGCAAG CGACCTACTC CTGCCACTAG TATCTCTCCG GGCCCAGATC AGATGAGCGA TTGTGGCAGC TCAGATGTCC CTTTGCAATA CTCTTTCAAC AAACCTCTGT ATAACTCGCC GGGCAAGTAC TGCAGAGGTT TGCGTCTTGT ACAGTCGAAG GAGCAAATCG ATCAGGCAGT AAAGCCATCG TCAAAAAATG ACAAGCTTTC CAGCCCCATA CATTGGAAGC ATCGATACAA ACAGTCACAT ATTGAGAAGG CTTCCACTGT GTTGCCTCTA CCAAAACAAA GCAGTCTGGC TTCAAGAAGC TTTGCTTTCA TGTATCCGGG AAAAAATTCA CTTGCAGCAC ATGAGCTTGC TACTTTGCAT GCATTGACTA CCGATTGTCC AGGCAAAACA TATTTACGAA GTTGCGGACA AGCGAACACG AGTCCTTGCA GTATTCTTGA TACAGAGTTC TTTCAGAATG AAGACTTTTA TGCGGCGTGC AGCTATAAAA GGAATTTGAC CGACGCCCTT TGTGAATTCC CCCTTGTTGC TCAAGCAACA AAATGTCGAT GGGATTTCAC AGAGTCCGAT TGCAGATTTC CTGCGCTTCC CCTGCGTGGG TCGAGGAGTG CTTTGACTGC AGATTGATCA AACGAAATCG TAAAAGTAAG AGCAAGTTGC TGCAACCAAT CTGTCGATTT GATAAGCTGA TCAGGCACAG GATCTATGAA ACTGCGATCG TACAAACAAT CCCCAGGTCC TAGGAGAAAG CTCCGTATAT ATAACTTGTC GTTCTTGATA AGAATGGCTC TAGCTAGAGT ATATGGACGC AGTAGACAAC GTGGTGTCTT AGAAACTATG AATGTCAC
|
Protein sequence | MQFLPLAVDI DCSSTSVSTE QSFDGRGWKK AVMKRLVGTL PRRHSSFHPT DCELESRVQY TADPQKSLCT RFVPPLLDVK DPMIEINASP EIVVLGSKNP RRNHVTSISP IIPFGPISKR KHKRCNSACE ESKNKYQMAR GLAYDVPTQK RRVYLIGIDC PGMSSPKPSR QAKLYPESIP REFGTASSFP FQRGTSEPKV QPLSIEGDTM SFGSGKKLYR RNESSPSPRL RKRGNRNISI SRTMHGIPFS LRDHQSMLER TDNVYYEEMP TPDAVPDGLS NGRRPPLILG KTSRQSRKVS TRSLMAHTTR VKPQVQRTLK SKRPTPATSI SPGPDQMSDC GSSDVPLQYS FNKPLYNSPG KYCRGLRLVQ SKEQIDQAVK PSSKNDKLSS PIHWKHRYKQ SHIEKASTVL PLPKQSSLAS RSFAFMYPGK NSLAAHELAT LHALTTDCPG KTYLRSCGQA NTSPCSILDT EFFQNEDFYA ACSYKRNLTD ALCEFPLVAQ ATKCRWDFTE SDCRFPALPL RGSRSALTAD
|
| |