Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46559 |
Symbol | |
ID | 7201699 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 702068 |
End bp | 705505 |
Gene Length | 3438 bp |
Protein Length | 1009 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181058 |
Protein GI | 219120648 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTTG CCTACATTGG ATGTCTTCTG TTGGCGCTCT TTGCGGATGA AGGCGAAGGC GCCTCTGTCC GTGGATCCAA AGACGACGTT GTCGCCGTCG AAGCACCCCC TCGAGTCCGT CACTCTTCCA GCACCGCGCT TGTATCGGTT CCTCAACGTC GACTCGCGGA AGATGCTTTT GAACCACTCA CTTGCAATTC GAGCTTTTCC AGTTGCATTC CATGGACATC GCGTTGGGGG CGGAGTGCCG TCAAGACCAC CCTCATCGTC ATCCCATGTG GCCAATGTAT TGTGATGAAT CTCGACAGTC CCAAGCTCAC CCTGCAAGAA GGTATCGATA TCCGGGGTAA GCTCGTATTT CCCGACCGTT ATGAGCTTAC TGTGGAGACA CCAGAGGTTG TGGTTCAAGG CGAGCTCGAG ATGAGAAGCT CCAAGCTTGT CGACGGGAGC CCCGCAATTA AGTTTGTCTT GTATGGTAAC GGTGATCGAC ATTTTGATCC AGTCAACAAC AACAGGAACG CTTGCGGCGG GTCCAGCTGC AACGGAGGGG CTCGCCCCAT TACAGTAGCT GGTGGAAAAG TCAACCGTAA GTTTATGTCA ATGAAAAACG AATCACGCTG GTTTCTGCAC ATCGATTCGT GACGATCGGC ACTCATAAAA ATCTCTCTAC GCTATTTCCA GTTAATGGCC TTCCTACAAA CACTCCAACC TGGCTACACG TTTACGACGT TATCGGAAGC TCGGCCATTG TCGTTTCCAA TTCGGTCCGT AATAAATGGG GTGCCGGCGC AACTATCGTC ATCACCTCTG AGCACCAAGG TTACTTTGGC GAGCAAGTTC GTAAAATCAC CAGCATTTCG AACGTCGGCT CCAATAGCGT CCGTCTCAAT CTCGACCAAC CCATCAACCG TCCGGTCACG CTTCGCGACA GCCCAGACTT TGCCACCGAG GTAGCCTTGC TGTCACGCAA CATTGTCTTT GAAGGGGCCC CCGGAACGAA GGGTGGCCAC TTTTGGATCA TGCACACTCC CCGAGTGAGG CAGCGTATCG AAGGAGTGGA ACTCGTTAAC TTCGGACAAG AAGGCCTCTT GGGTAGATAC CCAATCCACT TTCACATGTG CGGGGATGTC TCAGGCTCGG TAGTAGTCAA GAATACCATT CGCAATTCCA ACCAGCGCTG CGTTGTCGTC CACGGTACCA ACAACCTCCT TGTTCAAGAA AATGTAGCCT ATTTCACCAA AGGACACTGT TACATGTTAG AAGATGGCAT CGAGACGGGC AATCAGTTTA TACGAAACAT TGGCATACGC ACTATTAAAG CAAAGGTCAC TATTCCGAAC ATGGGTAGCA ACGGCAGGGA ATCTGATAGG TCTGCCAGTA CCTTCTGGAT CACGAACGCT GACAACTCGT GGATCGGAAA TGTAGTAGCC GGATCCGAAG CCCTGGGCTT TTGGTTTGAG CTGTTGGTCC GTGGAAACCT GGCCAACGAG CACCAAGACT TTGATCCTAT GATGGTTCCG ACCCGCAAGT TCGAAGACAA CGTCGTTCAT AGCGTATTTG GGGTAGGGAT GACCTACTAC TTGAGCGGTT ACATCCCGGA AACACTGCAG TACTTCAAAA ACAACAAGTT CTTCCGCAAC CATCACCTTG CGCTCCGTAT CCACCGGACA CGAAACATCG TTCTGACTGG CAATAAGTTC TCGGACAACA GATATGCCAT TCAAATCGAT CGTGACGAAG AAATTCATGT CACCGACACA ACCATTGTTG GCTATTCCGA TCTATTCAAG GACGTGGTCA GAAGGAATCG CTTTGCCCAA GCACCTTGCG CCCAAGGGAT ATCTTTCCAA TCGACTAATC CATGGAAAGA CAAGATGGAT TCGGAGCTCA ACGGTGTCAT TTTGGACAAA GTCAGATTTT CTGGATTTTC CAACGCGGTG TGTTCGTCTT CAACCGCAAT CGAGCTGGAT TCCCGACTCG ACGGGTACAA ATCGTTCGAG ATGTTCTCAC AGTACTCCGG CGCCACCGTA AGTGACGCGA ACTCGATTGA CTTTTGTCGT GGAAAGTCTG CCGGTGCACG TGATGTGTAC GTGTCCGATA CCACAGGTTC TCTTCTCGAC GGCGTTGCCT CTGCTCCTTC TACTCTGATG GTCAACTCGC CGGAGCTGGC AAGCTTTGTC AATCCCAGTG CATGTACCGA AAACGCAGCT CGATGCTACA CCTACTGTAG CAACACTTGC TTCCGCACCG TTCACTACTA CGTCCCAGTG GGCCAAAGTC GAGACTACAA GCTCAAAGTA TGCGACCGCA AGGACGCGTC CGACTGTACC GTACTCTCCG GTTACGTACA CTTTAACCAC CCCTGGCCTC GCCGATTTGC CGTGCATGTC CCGTCAGGAC GGGAGTACGA CACTTACTTC CTCGACAAGG GAGTGCCTGT GTACCCAACG AATGTCGAGA TTGTGTTCCA GGAAAAGCTT TGCCCAACGG CACCGGATGA TGATGATATC GCTCTTCTCT ACAAGGCTCG AGGTACCACC TTTCCCCCCA CGCCTAGTCC GACGACAGCT CTCGCCGCAT GTGGAAACTT GATCGCAAAC TCGGACTTTG AGCGTGGTTA CAACGGATAT TGGGATGCAC AAGGAGCCGG TACATTGTCT ACCACAGCTG GCTATAAATC TGCCACAGCT ATGTACTACG CCTCTGGTAA TCGCAACCGG TACTGGGTGG GACCATCACA CCAATGGCGA GAAGGTCTGG ACTTGAAATG TCTAAAGCAG GGTACCACAT GGGAGTTTTC TGCTCGCTTG AAGCTTGTCG ACTCAACGAC TGGAAGGGGA GCCTCATGTA ACACAGGCTC GTCCTCGGAA GGCGAAATGT GCCCTCAGGT GCAGCTCATT GTGCGCGACC AATCTTGGAC TCAGCATTCT TTCCGGATCA GCGGCTTCGA CGGTGGGGAC ACTTGGGTAG CCAATGGATT TAACGAGTTC AAAGGCTATT GGACAATCCC AGCGAATGGT TCAGGATGGC GAGGGGGTGT CGCAAACATG CGAGTGATAC TTTCCGAATT CCCACTTGGT ATGGATTTGG TTGTTGACGA TTTTGAGTTG ACCCAATTTG TTTAAAGCGA CGTACACGTT ACCGTTGGAT CGATTTTGCT GGAAGGGACC ATCCGTGCAT GCCAATGTTG GTTTCTTGGA ATTTTAATTT GAGGTACAGT TAAATCTGGC GATCGATGAT TTCGGTTGTG GCCTGAGCAA GACAACATTT TCGGATTCCG CCCACAGCCG TACCGTGGTA GAACTACTTG CTTGAAGCTC ACTTGCGTGC AATAGCATTA ATCCAAGTAT AGCAAATTCC TTATTTGTAA GTTGGTTCAA GATAATGCTC TACCAAAAAA AGTGCAGGGG CTCCTTCCGG ATCGTCAGTC TCCTCATG
|
Protein sequence | MKVAYIGCLL LALFADEGEG ASVRGSKDDV VAVEAPPRVR HSSSTALVSV PQRRLAEDAF EPLTCNSSFS SCIPWTSRWG RSAVKTTLIV IPCGQCIVMN LDSPKLTLQE GIDIRGKLVF PDRYELTVET PEVVVQGELE MRSSKLVDGS PAIKFVLYGN GDRHFDPVNN NRNACGGSSC NGGARPITVA GGKVNLNGLP TNTPTWLHVY DVIGSSAIVV SNSVRNKWGA GATIVITSEH QGYFGEQVRK ITSISNVGSN SVRLNLDQPI NRPVTLRDSP DFATEVALLS RNIVFEGAPG TKGGHFWIMH TPRVRQRIEG VELVNFGQEG LLGRYPIHFH MCGDVSGSVV VKNTIRNSNQ RCVVVHGTNN LLVQENVAYF TKGHCYMLED GIETGNQFIR NIGIRTIKAK VTIPNMGSNG RESDRSASTF WITNADNSWI GNVVAGSEAL GFWFELLVRG NLANEHQDFD PMMVPTRKFE DNVVHSVFGV GMTYYLSGYI PETLQYFKNN KFFRNHHLAL RIHRTRNIVL TGNKFSDNRY AIQIDRDEEI HVTDTTIVGY SDLFKDVVRR NRFAQAPCAQ GISFQSTNPW KDKMDSELNG VILDKVRFSG FSNAVCSSST AIELDSRLDG YKSFEMFSQY SGATVSDANS IDFCRGKSAG ARDVYVSDTT GSLLDGVASA PSTLMVNSPE LASFVNPSAC TENAARCYTY CSNTCFRTVH YYVPVGQSRD YKLKVCDRKD ASDCTVLSGY VHFNHPWPRR FAVHVPSGRE YDTYFLDKGV PVYPTNVEIV FQEKLCPTAP DDDDIALLYK ARGTTFPPTP SPTTALAACG NLIANSDFER GYNGYWDAQG AGTLSTTAGY KSATAMYYAS GNRNRYWVGP SHQWREGLDL KCLKQGTTWE FSARLKLVDS TTGRGASCNT GSSSEGEMCP QVQLIVRDQS WTQHSFRISG FDGGDTWVAN GFNEFKGYWT IPANGSGWRG GVANMRVILS EFPLGMDLVV DDFELTQFV
|
| |