Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40749 |
Symbol | |
ID | 7198621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 299077 |
End bp | 300597 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184775 |
Protein GI | 219129183 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTCCA TTGAAGTCAT TCGCTTTACA CGGAATCAAA GAATTTTGAG TGCATTCGTG GTACTTCTCA ATATTGCTGT CGCTCTGTAT CTCCTATTTG TCGACTCCCA GAGGAGTCCA ACGCATGACC GCTCGTTTTC ATTTCCGCAC GCCCTAGCAA CATTGCCTCC CAACGCTTCT CAGTCGGAAT CTATTGAGGT TGGGTCGTCG AGTAGGACGA GGGAGATAGC TACATTAGCA TCCGCTATGG CTGAGATCAT ACGTTTACTT ATACCCGAGC CATCAACAGC CAACGAGTAT CTGGCACAGC ATGAACCAAA AACTTTTCGA TTTTATGTAT ACGACAATTT ATCTCATGAG TACACGTGGC AATATAGCGC TAGTTGTATG AAGGCGAAGC GCCGTCTGAG TGATACATGT GATTGGGGTG AATCAGTCTG TGGGGAAAAG CGGCTTACTC GCAGCCCGTA TTCCAAGCGC AGATTAAACC GCAACGGCGA TTTGGTTTTG AGCAAGGCTT TTTCGTCCTA TCAAGGAATC CTACGAACTT ACGATCCGAT TGATGCGGAT TTGTTCGTGG TCCCGTATCC AAGTCAGGCA CACTTTCACT GCAATCAAAC GTCCCACGAG GACGTGGAAA CGCGTTTACT GGATCGACTT GCTTACTTCA ACAAAAAGAC CCGTAGAAAA CATTTGTTTT TTTCTTCCGC GGTACGGTCT GCCTCCAACA AATTTATGGG TTCGCTGCCG CTCTTGGTAA CAATCGGGCC AGTTGACCGA CAGTGCAGAA TAGGTCGGAA TTGCGGTCAA ATTGTGATGC CGTACGTAAA CACCAATCCG GAGTATCAAC CAATGGTCGT TCAAAAGAAC CTCCGTTCCC TAAAGGACCG AAAGTTCGCC ATGGTGGCAA AATTCAACGC CTATATATCC GGTAACAGCA TGCCTCGTAG CGATTTTCTC AAGGTTGTCG GTAACGTGAC GGCGATCGCT GGATTCCCTG TGCTGATTTC GGCGCTGGGT CGGCGCCGAA CCATGCCCAA CGAGCGCAGC GTACTAGAAG ACTACCGCAA CGCAATCTTT TGCCCTTGTT TGCGAGGCGA CGAACCTCCG CAGAAAAGGC TGTTCGACGT CATGATGTCG GGATGTATTC CGGTAGTATT GGACTTTCCA TCGAAAGACC CAGGCTACCG GTCACATTTT GCGTCTATGG CAACGTCAAC GCGCGGGGCC TATCCTTTTG CCAAGGGTTC TTTCCACGGT TGGCCAGAAA TGGGTTTGGA CTACAACGAG TTCATGGTTA CTGTGAATGG TACTTGTGGT GTATCATGCA TTGTTCCGAC TCTGGAAGAT TTGCTCTTGA ACCATCGTGA TCGATTGGTA AACATGCAGG AGCGGCTAGC AAAAGTTATC AAAGTGTTCA GTTATGGGAT GGAGCACAAT ACATTACAAC ACGCAGACGC GATATCAGCG ATTCTCGTGC AAGTAAAGCA CTACGTCGAT AGTCTCGGTC AAGTTTCATA G
|
Protein sequence | MLSIEVIRFT RNQRILSAFV VLLNIAVALY LLFVDSQRSP THDRSFSFPH ALATLPPNAS QSESIEVGSS SRTREIATLA SAMAEIIRLL IPEPSTANEY LAQHEPKTFR FYVYDNLSHE YTWQYSASCM KAKRRLSDTC DWGESVCGEK RLTRSPYSKR RLNRNGDLVL SKAFSSYQGI LRTYDPIDAD LFVVPYPSQA HFHCNQTSHE DVETRLLDRL AYFNKKTRRK HLFFSSAVRS ASNKFMGSLP LLVTIGPVDR QCRIGRNCGQ IVMPYVNTNP EYQPMVVQKN LRSLKDRKFA MVAKFNAYIS GNSMPRSDFL KVVGNVTAIA GFPVLISALG RRRTMPNERS VLEDYRNAIF CPCLRGDEPP QKRLFDVMMS GCIPVVLDFP SKDPGYRSHF ASMATSTRGA YPFAKGSFHG WPEMGLDYNE FMVTVNGTCG VSCIVPTLED LLLNHRDRLV NMQERLAKVI KVFSYGMEHN TLQHADAISA ILVQVKHYVD SLGQVS
|
| |