Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31533 |
Symbol | |
ID | 7196074 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 304048 |
End bp | 305768 |
Gene Length | 1721 bp |
Protein Length | 516 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177063 |
Protein GI | 219110623 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0011992 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCCTGC TGGACGGACG CACCATCAAG TCCACTCACG TCACGGAACT CGACGTTCCC GATCTCCCGT TAGCCGCCAG AACCGCTCAC ATTTTCCCAG GGCTCACAAA CGGTTCACTA ATTTCAATCG GACAGCTTTG CGACCATGGC TGCATAGCCA CATTCACTTC TGACGCCGTC ACCATAACTC TCGACAAAAA AGTCATCCTC CGGGGCGATC GCTCGGCCCC TAATCGACTG TGGACTCTCC ATGCACCCAG CTCGACCCCT CCTACCAAGC CGCTTCCTTC ACCAATTTTT CCTGTCGCCA ACAACGTCAA ACACTCTTCT TTGCTTGCCG ATCGGATTGC CTTCCTACAT GCATCCCTAT TCTCGCCTCA GTTGTCAACA TGGTGTAAGG CCATCAACGA AGGCCGCCTC ACCACTTTCC CTCAAATTTC TTCGGCCCAG ATAAAACGCC ACCCCCCTTG ATCCGCTCCC ATGCACAAAG GCCACTTGGA CCAACAGCGA GCCAACCTTC GCTCTACTCA ACCTACAGCT GTCGCTTTCT CGGCCACAGA CTCAATCGTC GACAACCTCG ACGAAAATCC AGTTCCAGAC GACCCTCCAG CTCTCAAATC CAATTTTTTG TACGCCGATT GCTACGAAAC AACAGGAAAA ATCTTTTCGG ACCTCACAGG CCGTTTCGTC ACCTCTTCCA GCACCGGCAA CGCGTACATG CTGGTCGTAT ATGATTACGA TAGCAATTTC ATCCATGTCG AACCGATGAA AAATCGTACC GGAACCGAAA TCCTCGCGGC TTATCGGCGC GCTTTCGACC TCTTTTCATC TCGAGGCCTG CGACCCCAGC TCCAACGGTT GGACAACGAG GCATCTACTG CCCTTCAGCA ATTCATGGAT GACTCTAGAG TTGACTTTCA GTTAGTACCT CCTCATTTGC ACCGTCGCAA CGTTGCCGAA CGAGCGATCC GCACATTCAA AAATCATTTC ATTGCCGGGC TTTGCAGCAC TGACAAAGAT TTTCCTCTTC ACCTTTGGGA CAGGCTACTC CCCCAAGCAA TCATGACCTT GAACCTTCTG CGTGGCTCGC AAATTAATCC TAGACTCTCG GCGTGGGCAC AGGTCCACGG CGCCTTTGAC TTTAATCGCA CTCCGTTGGC GCCACCCGGC GTAAAAGTTC TCGTTCACGA GAAGCCTACT GTACGCAAAT CGTGGTCTCC CCATGCCGTC GACGGCTGGT ACATCGGACC TGCCATGCAT CACTACCGAT GCTACCGTGT TTGGATCAAT AGCACCACCA GCGAACGCAT CGCCGACACT TTAACTTGGT TTCCAAGCAA AGTACAGATG CCGACCACTT CGTCACGAGA CACCGTGGTA GCTGCCGCCC GCAACCTCGC CACAGCCCTA TCAAATCCGA CTCCAGCTTC TCCACTTGCA CCGCTTGCCA CTCAGGAACG CGTTGCTTTG CAGCAATTGT CGACTATTTT TTCGAATTTT TCTGATCCCA CAAGCCCACC GGCAGCAATT TCCCCGTCTG TTACCTCCGT ACCCCGCGCA GCCCCTGCTG CCGTGCCACC GCGAGTCCAA TTCAAGGATT TGCCCACCGC GCCACTTCCG AGGGTGCCAC CAAGGTCCAC CGGCCCAGCC AATTCCCAAT CACTTCCGAG GGTGCCCGTT TTTGCTCCCG CGACGGAAAC ATACAAGTTA GTCACCTGTA A
|
Protein sequence | MVLLDGRTIK STHVTELDVP DLPLAARTAH IFPGLTNGSL ISIGQLCDHG CIATFTSDAV TITLDKKVIL RGDRSAPNRL WTLHAPSSTP PTKPLPSPIF PVANNVKHSS LLADRIAFLH ASLFSPQLST WSVAFSATDS IVDNLDENPV PDDPPALKSN FLYADCYETT GKIFSDLTGR FVTSSSTGNA YMLVVYDYDS NFIHVEPMKN RTGTEILAAY RRAFDLFSSR GLRPQLQRLD NEASTALQQF MDDSRVDFQL VPPHLHRRNV AERAIRTFKN HFIAGLCSTD KDFPLHLWDR LLPQAIMTLN LLRGSQINPR LSAWAQVHGA FDFNRTPLAP PGVKVLVHEK PTVRKSWSPH AVDGWYIGPA MHHYRCYRVW INSTTSERIA DTLTWFPSKV QMPTTSSRDT VVAAARNLAT ALSNPTPASP LAPLATQERV ALQQLSTIFS NFSDPTSPPA AISPPCCRAT ASPIQGFAHR ATSEGATKVH RPSQFPITSE GARFCSRDGN IQVSHL
|
| |