Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38186 |
Symbol | |
ID | 7203085 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 279673 |
End bp | 282242 |
Gene Length | 2570 bp |
Protein Length | 682 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182192 |
Protein GI | 219123772 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.434067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAACC GGAACCAAGT CTGGGTAGGG GAAAACCCTT GTTTGATTTC GGGCACTGTG GAGTCCGCTC GTCGGGAGTC GTTCGGAACG TCGGGTAGAG TCAAAGACTC GCAACGCTCC GTCGGCAATC TATGGATAGA TAATGTGACT AAAAGACTGC CTGCCTGACC GACTGCCTAC GAGAGGGAGT TATTACCAGT GAGCAAGGAA GAAGTAGGAA CAAGGGCGTT GCCTTTCGTT GTGGCACCAT TGTCGGCGTA TGCTCGGTCT TTCGAAAAAA ACGCACACCA CGGACCATGG CGAATTTATA TATGTGTAGC ATGTATGTAC GGTGTGTACG ATACTTATGA ATATATATAT ATTTGTGTGA TAGGTACGAC GTCGACACTA CCTACTAGTA CTGCTAGCTA ATTGGAAAGA GGCTCTCAAG AACGACGGAC GAGCTTTGTG TGAACCCGAC CGTAACGGTC GGGCGTGGCT CTCGTGCCAC CCACCCACAC GACGCTCTCC CGTGCCTCGC CGATGCTTGA CAGTTACATC GGTCGGTCGG TCGTAGGTAG TATGTGACTG GCGTGAGTCG GAAACATGAC CGAATCCCAG AAACGCACGG TTCCCGTCCT TCGCTGTTGC TGCCCCATCC GTACTCGCAA TCCAATCCCG TAGTCACCGT CGATCCGTCA GTCAGTCAGT CAGTCATTCG TTCACACGGT GTCCCGCCCA AACTTGTCTC GTGTGTCGGT CGGATCCGCG CCCGAATGGT GCGTTTCGGG ACCCATGGGG GGAGGGGAGA AACCGATGGG CGTGTCCAGT GGGAAGAATC GTCGCGTCGC GTTGCCAGAA CCATCGACAT AACCAGCACT ATCAGCAAGC AATGCAGACA ATGCTCTCCA CCAAAGACAG TAGCGTGAGT CCCACCTCGT CGCAATGGCT GGAATTGCAA CGGGGTGTGG ATCAAGTCGT CCGGTTCCTC CATCCACAAT CCCCGGCTTC CAAACCGGCC GCACGAGTGT CCCCGCACGA ACTCCGTCGC GCCTTGCGAC GCATCGATCG ACTCTTGACT CGGACTTTGG AACAAGTCTC GTTGATGGCA TCCGTATCCA CGGTACCACA CCATCCACAT CGGCACGGTG CCGCGTACGC GTACGCTCCG CACGTCGTGC GACTCGTGAC GTCCCTCCAC TCCCACGACA CAAAAAACGA CGTCCGTCTT GACGACACGC GGATCGGATC TACAAGGACT CTACACGTCG ATTGGAACCA CCAGTGCGCA ACTCTTGATC ATGGTCCCGG AAGAACGTGA CCACGAGGAG GACAATGCCG GGGCTGCGGA TGAAGGCAAC GAAGCGTATC ACGTGTCGTC CCACGACTTG ACTCTCCTAC TCGACGCACT CCTTCGCACC GTAGCGACGT CCGCGTACAC GGAGGACTCG GTCTTGCTCG CCACCAGTCC GTGCGGATGG TTGGTAGATC TCGTCAACCA CGTGTGGCTA GCGGAGGAAA CCGTGCCGGC CGTGGCGGAC GACACCCAGT GGCACCGGTG GTTGGAACTC ATGCTCCCTC TCGCTCTCCG TTCCTGGCCT ACCGCCAATG ACCCGTCCTG GGAAGCCGTG GTAGGAATCC AACCCGACGG CCACAACCCA CACTCGGAAT CCGCCGCCAC CGAAGCCGCC TTTCAGTGGC GTGTCTTGAC CGGTCTCGTT CTGGAACAAG TCGTCTGGTC CCACCACACA CCACACGTTC CGGGTACGCG CATCGGAGTG CATCACATCC AACGTCACGT CCAATTAGCC GTCGAATACG CGTTGGACGT GTTGGACAGT CTGGGTGTTT GGTTGGATGC CGGTACGGCC GTCACGTCGA ATCACGCACG CGATCGGCAG CAGGCCGCCG TCGTCGTCCG CGACGGCTTG CGGCACGTCA CGTACGCGAC CAATCTACTC CGCACCTGGG AAGCCTACGA ACCCGGTACC GAATGGCCGG AACGCACCGT GTCGAACAAG ACCCTGACGT GGGGACCCTT GGTTCAGACC ATGGCCACGA GTTTGGCCCA ATCATTGAGT CCCGGTCTCG TTGGTGGGTA CGCAAACACG GACGACCCGA CACTACGGGA ATTGGAGGAA TTGGAAACCT GGATTGTTCA ACATTTGTTG CGCTTGACGT CGTTTGCCTT GGAGGGAGTG CGGGACCATG GCCACGTTTT GGAAGAGGAG GTTTGCTTAC TGGTCTTCTT GCGTTCCTTG TCACGCCCCG CGTTGCTGTT GTCTTCGGAC TTGTGGACGA CGCTGACGAG TCGGGTGGAC GATGCGGAAG TCCGTCGGGT ACTCCAAACG ACCTTGCTCG TCCTGCAGTC CTCCACGAGT TTGACCGAAC CTTTCGCCAC CGCGAATAGT ACCCACAATA CCATGGCCGT GGAAACGACG TCGAGCGCCG GACACGGGTT GCAGCACCAA TTGTGGCAGC TTTTGCAAAA CGATAACAAT TATTCCGCTG CGGGGGCAAC CCCCAAGGCA GTCTCGGATC CGTGGGACAC GCTATTTTTA CGGTACCATA CGGACGATCC GGTCGTCTAG
|
Protein sequence | MANRNQVWVG ENPCLISGTV ESARRESFGT SGRVRRRHYL LVLLANWKEA LKNDGRALCE PDRNGRAWLS CHPPTRRSPV PRRCLTVTSV ETHGSRPSLL LPHPYSQSNP VVTVDPSVSQ SVIRSHGVPP KLVSCVGRIR ARMHYQQAMQ TMLSTKDSSV SPTSSQWLEL QRGVDQVVRF LHPQSPASKP AARVSPHELR RALRRIDRLL TRTLEQVSLM ASVSTVPHHP HRHGAAYAYA PHVVRLVTAQ LLIMVPEERD HEEDNAGAAD EGNEAYHVSS HDLTLLLDAL LRTVATSAYT EDSVLLATSP CGWLVDLVNH VWLAEETVPA VADDTQWHRW LELMLPLALR SWPTANDPSW EAVVGIQPDG HNPHSESAAT EAAFQWRVLT GLVLEQVVWS HHTPHVPGTR IGVHHIQRHV QLAVEYALDV LDSLGVWLDA GTAVTSNHAR DRQQAAVVVR DGLRHVTYAT NLLRTWEAYE PGTEWPERTV SNKTLTWGPL VQTMATSLAQ SLSPGLVGGY ANTDDPTLRE LEELETWIVQ HLLRLTSFAL EGVRDHGHVL EEEVCLLVFL RSLSRPALLL SSDLWTTLTS RVDDAEVRRV LQTTLLVLQS STSLTEPFAT ANSTHNTMAV ETTSSAGHGL QHQLWQLLQN DNNYSAAGAT PKAVSDPWDT LFLRYHTDDP VV
|
| |