Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41383 |
Symbol | |
ID | 7199174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | + |
Start bp | 324127 |
End bp | 325911 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185310 |
Protein GI | 219130309 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGCTA CCAGGAATGA CAAGACGGCG GGAAGGCATG CCAATGCCTA CCGACGAGCG ACGCCCTATC GGACCCTTGC TTTGTGTGCC ACGTTTGGAG TTGTTTGCTT TTATCTGGGT GTCTTGCTGG GTAGTGTAAC TTTCGCATCA CACTCTTCTT GTTCGTCGGC GGAAGAGCTC AACACACGAG TAGAAGAGCG AGTGAACGAA ATATCTTCCG CCTGGAAGTA CAAACACAAG CTGGAGCAGG AAGATACCGC CGCACGGATA CCTGCCAATC TACACGATAT AGTTCAAGGC ATGACCCGCG TGGATCGCAA CGAGTTTGCG GCTCTGTTTC CCATGGGGGT ACCGTTGGAT CCGTCGTCGC CACAGAACGA TCAAGTCGTC ATTCTGCACA ACTCGCCGCG ATCGTTGCCC ACGGATCCGT TTGCGGCCGC CGAAGCCTCC TCACAAACCA CCATACCTTT ACTCAGCGCT GCCGATGCTA CCGAAAATTG TGATAATCTA CACGTGGTGC TCACGGACCA CAATTTGAAG CGTCGTCAAT GCGTGGCCCT TATGGGTCAA TACGAAGCGT TCCATTTGCA AAAGTTTATG CGGTTGCCAC AGTCGGGCAA GCTCGACCCA CACCTGCCAC TGCGGCTCGT CAATCGAGGA GCCCATCAAT CGGGACGGAA GTCAACCAAA ACGCCAACCA TGGAACAGAC AATGCAAGCG TGGTCGACCT TGACGCCGTA CTTGCAGAAT ATAACCCAGA CGTTGGACAA ATTGAGACCG ATCGCGGCGT CCGTGGCCGT CGATAACACG ATTGTCGTCA TGGTCTGTAA TCACGGACAG TCGGAGTTGC TGCTAAACTT CGCCTGTGCC GCCCGAGCGC GGGGGCTCGA CACGGCTTTA GAGGCCGTCT TGGTGTTCGC CACGGACGAG GAAACTCGGG ATTTGGCGAT CGGATTGGGC TTGTCGGTTT TCTATGATCC AGTTGTGTTT GGCGAAATGC CCAAGGAAGC TGCGAGGGCA TATGCAGACG TCAAGTTTCG GGCCATGATG ATGGCCAAGG TATACTGTGT ACAGCTAGTC AGCATGTTGG GGTATGATTT ATTGTTTCAA GACGTAGACA TAGTATGGTT GCGCAATCCG CTCGAATACT TTCACAACGA CACATCCAGT GCGAACGACG AGGTCAGCCC AGACTATTAC GACGTTTATT TTCAGGATGA TGGGAACCAC GCGATATACT ACGCGCCGTA TTCAGCCAAT ACGGGCTTTT ACTTTGTCCG CCACAACGAC AAGACTCGCT ATTTTTTCAA TTCGCTACTC CTCGCGGGCG ATTTGATTTT GACGACTAAA TCCCACCAAA TCCCGCTCGT CGCTTTGCTG CAGGAGCATG CCTCCATGTA CGGACTCAAG GTAAAAATAT TCTCGCGCCT TGAAAACGAC TTTCCTGGTG GTCACGCCTA TCACCGACGC AAGGACTTTA TGAAGGATTA TTTTGCCGGA CACGTTAACC CGTATTTGTT CCATATGAGT TGGACCAAGA GCAAAATCAA CAAAGGCAAG TTCTTTGAGC AAATGGGGGA ATGGTATTTG AGGGACACTT GTGCGCAAAA AACGGCGCAA CATATTCTTG ACCTGCCGGA TGGTACCAGC GTGAACCAAC AATCTTTAGT AGAACCGTGC TGCATGGCCA CATCGGTCGT CAAATGTCAT TTTCGAGACA AGGCAAGCAA AATCCCGTGC AACGACAGTC CAGCTATCGA CAGGAATGGC CGATCTTTTT GGTAA
|
Protein sequence | MVATRNDKTA GRHANAYRRA TPYRTLALCA TFGVVCFYLG VLLGSVTFAS HSSCSSAEEL NTRVEERVNE ISSAWKYKHK LEQEDTAARI PANLHDIVQG MTRVDRNEFA ALFPMGVPLD PSSPQNDQVV ILHNSPRSLP TDPFAAAEAS SQTTIPLLSA ADATENCDNL HVVLTDHNLK RRQCVALMGQ YEAFHLQKFM RLPQSGKLDP HLPLRLVNRG AHQSGRKSTK TPTMEQTMQA WSTLTPYLQN ITQTLDKLRP IAASVAVDNT IVVMVCNHGQ SELLLNFACA ARARGLDTAL EAVLVFATDE ETRDLAIGLG LSVFYDPVVF GEMPKEAARA YADVKFRAMM MAKVYCVQLV SMLGYDLLFQ DVDIVWLRNP LEYFHNDTSS ANDEVSPDYY DVYFQDDGNH AIYYAPYSAN TGFYFVRHND KTRYFFNSLL LAGDLILTTK SHQIPLVALL QEHASMYGLK VKIFSRLEND FPGGHAYHRR KDFMKDYFAG HVNPYLFHMS WTKSKINKGK FFEQMGEWYL RDTCAQKTAQ HILDLPDGTS VNQQSLVEPC CMATSVVKCH FRDKASKIPC NDSPAIDRNG RSFW
|
| |