Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44433 |
Symbol | |
ID | 7197674 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 534872 |
End bp | 536795 |
Gene Length | 1924 bp |
Protein Length | 449 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178246 |
Protein GI | 219114901 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.580101 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGGAACGCG TAAGGTCGAG ACGGACTATC GCGTCAAGAG CGTACCTTAC GTTTCGTTGC GGCAAGTGTC AATTGACACT TTGCAACGTT TTGAACTAAA CTTTGTTGTA CTGTTTTGCC GGTCTTTAAT TGATCGTGTG GGGAAGAGCA AAAGTGCCTG TGCAGAGTGT ACAGCCAAGC TACCAACAAC CTGAAGGTTG CAACAATCCG AACTCTGGCT TCGTGCTGAG CACTTTGTAG TCAGTGACTG TACCAAGGCG CCATTCCCTC TTGCACCTCA AGGTTGCCGA AACGCTCTAG TCTTTTTCCG CTTTGATCAT CAGTAGTGAT AGTATGGGTT TAAATTGTTC CAAATGCGCT GCGTTAGAAC CCGCCGACAG CCAAATTTCC AGTACGCGAC AAGAATCACC AGACCCGGCC ATGTACGACT TGATGCGCCG TTCGGTCGAG TACACGTCAC GGCCTCATCC TGGTACTGTG GAAGTAGCCA CACCGGAAAA GACGACTGCC TCCACGGAAA CAGCGAGCGC CGCCGCGGCT GGACAGTTCC ATCTGCTGTA CAAGCAGCAA CACAAGACAG CCAAACTTCC GCATGAACTA GAATCCCTAC AAAGTCTCAA GAAGGCTACC AGGAAAACCT CTTCTGTGTA CTGTCCTACC CGCTTGCTGG TTCGACGACA TTCTGTACGT CATTACGAAG AAGGCATGAG TGGAACCCAT ATTTACGCGA TTCGCTATCC CATCGATAAG CCTTTGGAGT TGCGGGGGTA CAAGGGAGAA ACAACGACTG TCACTACGGG ACAGGAGCAA GCACCCCATC GCCACAAGAG CCCAGCCTTG CAGCAGCAGC AGAAGCAGCT CTACAACTCA ACAGCAACGT CAGCCTCCGT CACTACGGCT ACGAACAAAG CAAGTATCGC ACAATCAAGC TCTTCAAACG CAGCATCAAG CGGGAAAAAA ACCACGCCGG TGGGACCGGT AGAACCAGCA AGCGACAAAT GGCCAGCTTT GAAAGATAAT CCGGAACAAG GAGCGGTGGA TCAGTTGTTT AACCCCATTA CAATGCTAGC ACCACCGAAA CGACGAGGTA CGTTTATTGG TCGAGCACTG AAGTCTCTAG TTTCTCGTTC AGCTGTTTCT CTTTTTTCAT GTACCATTGA CTAATTCGTT GTATGTACTT ACACCTGAAA CCTTCAGTTG CTGAATGGGT GACTATCAGC TCGGAGCACA ACATTTACGT TGCCGACGTT GGTTTAACTA TTGCCGAGTG CGATCGCCTG GTCCAAGTGA CGGAACAGGT TTGTCGTGGC CAGTACGCTG CCTACACGTA CGCGAAGCAA ACGTTGGGAT GTCGGGAATT CCCACCGTTG GCACACGCGT GTCAGGATGC GGTCCACGCC GTCACGCACG CAATTTTGGA GTATGGGAAA AATTCGGCGC TCGCCTTGGA TGATCGGGAA CCTCACATGG TCAAATACGA TGTCACCAAG AAAGAACGTC AAAAATTGGA TATGCATACA GACAAAAGTG AATGGACCTT TTTGATTGCC TTGTCGAACG GATCCGGTTT GGATTACGAA GGTGGCGGGA CGTTCTTTGA GTGCCTTGAT TCAACGGTGC ACGTACAGCG CGGTCACGCC TTGATTTTTC CGGGCAAGCT TCGGCATTGT GGGCAGCGCA TTACATCTGG ATTACGTTTT TTGCTGGTAG GATTCTTGGT GGACAAGTCG ACGCCTTCAT CAAAAGAATC AAATGCGACG CAGACTACAA CATCGACGAA AGAGGATATT TAGATTGCCC CCCATGTGGA CGGCTGAGGT GACGTGCATA CACTGTGAAA AGAGCAACGT GATGAATTTG GTAGAGAGGT GCGGCTGCGT CTCTTGAGGT ATTCGTGTGA TCATTAGAAC ACAAAATCTT TTGT
|
Protein sequence | MGLNCSKCAA LEPADSQISS TRQESPDPAM YDLMRRSVEY TSRPHPGTVE VATPEKTTAS TETASAAAAG QFHLLYKQQH KTAKLPHELE SLQSLKKATR KTSSVYCPTR LLVRRHSVRH YEEGMSGTHI YAIRYPIDKP LELRGYKGET TTVTTGQEQA PHRHKSPALQ QQQKQLYNST ATSASVTTAT NKASIAQSSS SNAASSGKKT TPVGPVEPAS DKWPALKDNP EQGAVDQLFN PITMLAPPKR RVAEWVTISS EHNIYVADVG LTIAECDRLV QVTEQVCRGQ YAAYTYAKQT LGCREFPPLA HACQDAVHAV THAILEYGKN SALALDDREP HMVKYDVTKK ERQKLDMHTD KSEWTFLIAL SNGSGLDYEG GGTFFECLDS TVHVQRGHAL IFPGKLRHCG QRITSGLRFL LVGFLVDKST PSSKESNATQ TTTSTKEDI
|
| |