Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47728 |
Symbol | |
ID | 7202906 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 660147 |
End bp | 661389 |
Gene Length | 1243 bp |
Protein Length | 299 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182111 |
Protein GI | 219123601 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTTGG TCGAAAAGAC CAAGCGCGAG CGACAGGAAA AGGCGGAACA ACAAAAGTTA GACGCGCAAT TAGAGCATGA ACGGCAACAA CAAGAAGTAG AGGACCGGCA GCGGCGAGCG GAAGAAAAGG TGACGCGCGA ACAACAAAGA AACACATACG GAAAGCGCGA CAGAGTCTGC GAAAAAATGG CGTCGGCCTC TTTTGAATCT CTCGAATCGG AACAGAAATC GTCCATTGTA TGGGCGGATA CCTTCGACGT GAATCTGGAT GTCGAAGTGC TGTGTACAAA TTTGGATTTG ACGGGGTTGC AATCGTTGGC TCAAGAATTG GAAAATATCA CTTGTCCGAA AGAGTCGTTG ACGATGATTC ACCAAGAAGT ACTGGTGGCT AAGCAGCGGG AAGCAGACGG GGACTTTATC AATGGCGAAC AATCGTCTCC ATCTCACAAC GGAACTTTTT CGACGAAAGA AACGACAAAT CGCCTGTTGT AACACCGGCG TTGAAGCCGA ACCTCTGGAC CAAGGAGGAA TTGTCAGCCC TGGCTAAGGC AGTCAAGAAG TATCCACCCG GCGGTTCCTC ACGATGGGAA CAGATTGCGT TGTTTGTGAA TAATTTGTGT AAACAGGACG AACCCCGATC CACGGAGGAA TGTATCGAGA AATACAACAA CGTGGCGAAG ACGCACAGCA TACCAACCGA TAGCACGAAC GGCGTCGCGG CAGCATCAGA ACCCGAAGAC TCTTCGCAAT CCAACGAAGA CGTGTGGACG GCCGAACAGG ATCAGAAGCT ACAAGATGGA CAAGCTGCGA ATCCAGCGAG CATGGGCAAA AAGAGCGGTG GACCGCAATT ACAGAGTGTG TCCGGGAAAT CCAAGAAGCA GTGCGCACAA CGGTTTAAAG TGATTCGGGA TTCATTCACA ACAGTCAATG GCTAATTGGC CTCCATCGTT CACTGGTTGA CATTTGCGAT CGGCCCAACA CTACAGAGTT GACCCTTTTT GATGGATTCC ATGTAAAGAG GACATTCGAG TCTCTCGCCG TCACTTTCGC GCCCACCATC CATTGTGGTC GCGGTCTTTG TCATTTCTAA ATTCAAGTTA CGTTTTTGGT GCGACGCAAA GATGAGTCTG GCTCCCTTTA CGCCCTGCGA TTGTGCGGCA CTTAAAGAAT ATTGGTTTGG CAAAAAAAAC TGCCGCATAT GTGACAGGAT GCGTAAAGGT GAGCAATCTT TTCCTTTGAC GTG
|
Protein sequence | MFLVEKTKRE RQEKAEQQKL DAQLEHERQQ QEVEDRQRRA EEKVTREQQR NTYGKRDRVC EKMASASFES LESEQKSSIV WADTFDVNLD VEVLCTNLDL TGLQSLAQEL ENITCPKESL TMIHQEVLVA KQREADGDFI NGEQSNDKSP VVTPALKPNL WTKEELSALA KAVKKYPPGG SSRWEQIALF VNNLCKQDEP RSTEECIEKY NNVAKTHSIP TDSTNGVAAA SEPEDSSQSN EDVWTAEQDQ KLQDGQAANP ASMGKKSGGP QLQSVSGKSK KQCAQRFKVI RDSFTTVNG
|
| |