Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50522 |
Symbol | |
ID | 7199296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 303560 |
End bp | 305344 |
Gene Length | 1785 bp |
Protein Length | 525 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185466 |
Protein GI | 219130634 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTAGATAGTC TTTTCGGTGA CGAAAGTGTG GAAGGGCCGG CCCCCTTCTC GACGCGATCA TCCAACATCT TATTTCCTTC TTTTTATGTT TTCATCGCAG TGCAGCTTAT CCTCGTCTGC CTTGTCGTAC ACACACACTG CAATTGAATG AAACCAGAAG CGAATGCCGC TCCGCGTTGG CGGAACGCTC CCGCGACAGA CCATTCTTTT TTCAACAATC TTTCCCATCC GCCGTCCACT TCCTTGGATT CAAACAGCAA TTGGACTCCC ATGCACACCG CCGACTTTGT TCCAAATGCT TCGCTAGCGT CTACCGAAAC TGCGAACCGA TCGTTTGTCT CGACACCAAC TCGACCGATT TCTCCGCGCA GCAGTAGGAC GGTCGACCAC GGCGACGGCA CGGAACTACG GCCTGCAACA ACAACAACAA CTTCAACCCC ACCCAATCAG CCACAAACAC AAATGTACTT CCCTATGCTA CCGCGTAACG TCGAAAGCTC CCAAATTGAC CCCGCAAGGA TTACACAGTA CCCGTTAGGC CAAAAGCAGG TCTCCACCAA ACAAAGAAAT CATCAGCAAC TGGAATCGGA ACAGTTTGAT TCCTTGCAAA ACAAGTGGGC GGTGAAAGAC GCAGCGAACG AGAAGAGCGG TATCTACGTC TCGGGCTCCG ATACTTCCTT GTTGCATTCA AGCTTTCCGT CTCGCACCAC AAATACTATT CGACCGTCTC CACTCGTCAC CGAAATCATG AGGCTTCCAG CGAAGCGAAA CTTTGATACG ACAAAAGATC ACAAGACGAA CGCGCTGTCG AGAAAAATTG AATTTTCCGG AGCTCCGGTG GCTGTGACGA CTAGTGGCTC TGCTGGGAAT GCGCCTTTTG CGACAGGGCA TTCGCCCATA CTGCAACCCG GTGGGCTTCA ATCCTTGCAG CCTCCACCTC GATTCCGATC CATATCGGAT ACTGTCGCAA TAGGTTCAGG GGTTTCCCCA TCAACAGGCA GTGGCTCTGC ACAAGCCGAT GCACAGCTGC GCATCTCTAA TCCCTACGCT ACGACAGAAT TTGGATCTCG ATCCAATTCG CCGCTAGCTA CAGCGTCAGA CACCATGGAT TATAATTCCC TGCTTCACAA TTCCTGCAAG CTGTACCCTA CAACAATCAC GATAGTGGAA AGCGCTTTGC GCTTCGATCC CGAAGGTATT CGCAGAAAGG TTTCGATTGT ATGTGAGAGA AACATGGGTG GTCAAACGAG CAAACTGCAG GCGGTGGAGC GATACGTTTA TCCGATAAAC ATTGCGCTCA GATTCAATGC CGCCTTGGAC GTACTGCAAC TACTTGCATC CAAGGGACCA GAAGTATTGA TGGAATCAGA TGGTTTGGAT CATATGAGTT CGCTGGGTAT CGCGCTAGCT CTTGGGCACC AAACGAAGGT TATCTACTTG TTACTCTCGA CGAATCCCCG CAGTGCACGA ACCAGAGATC GTTACTCCAA TTTGCCCCTC CATGTTGCTG TGCGACAACC TAGCATTACT CTCGAGATTG TCGAAATGGT ACACATGGCC TTTCCCGAAG CAATCAAAGC CCGCAACTTT CACGGCCAAA CTCCGTTGGA CGTAGCTGTC CGAACGGTAG CTTGCCCAGA CGCCGTGCTA AATTATTTGC AAATATCTGC TTTCGGGCAG CTCGAAGCGG CTGCCGACCA TTTTGACGAC TCCGATTTCA TCTAGCTAAA AAAGGACTTC AGAGAGCGAA AAGTTTCTTA TGCTAGAACT TGAGTACGTT TTGCT
|
Protein sequence | MKPEANAAPR WRNAPATDHS FFNNLSHPPS TSLDSNSNWT PMHTADFVPN ASLASTETAN RSFVSTPTRP ISPRSSRTVD HGDGTELRPA TTTTTSTPPN QPQTQMYFPM LPRNVESSQI DPARITQYPL GQKQVSTKQR NHQQLESEQF DSLQNKWAVK DAANEKSGIY VSGSDTSLLH SSFPSRTTNT IRPSPLVTEI MRLPAKRNFD TTKDHKTNAL SRKIEFSGAP VAVTTSGSAG NAPFATGHSP ILQPGGLQSL QPPPRFRSIS DTVAIGSGVS PSTGSGSAQA DAQLRISNPY ATTEFGSRSN SPLATASDTM DYNSLLHNSC KLYPTTITIV ESALRFDPEG IRRKVSIVCE RNMGGQTSKL QAVERYVYPI NIALRFNAAL DVLQLLASKG PEVLMESDGL DHMSSLGIAL ALGHQTKVIY LLLSTNPRSA RTRDRYSNLP LHVAVRQPSI TLEIVEMVHM AFPEAIKARN FHGQTPLDVA VRTVACPDAV LNYLQISAFG QLEAAADHFD DSDFI
|
| |