Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39425 |
Symbol | |
ID | 7194943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 493277 |
End bp | 495181 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183365 |
Protein GI | 219126231 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.292382 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGC GTCCTTTATG GGTTGTCCAG ATTCGACGAA GAACGATAAG GCTCGTCTCG TTTGCCGTAT CGTGTGCACT TTTAGCAGTC GCCATCGGCT CAATGGCACT CATGTTCCGT TGGGATGGGA ACGACCTTTC GCTTGGGGCA GAAGTAGATC CATTCAAGAT TCAAATCAAT GCAAGAAAGA GTGAAAGCGT GACCTTCCTT CAGAAAGGTG TCCAAATGCC CGTCAAAGAC TTTGCGGTAC ACGGAACAGT CATCAACAAA ATTACAGATG CTGGTATCGC TACCACCCAA TTCGAGAAGT GTTCTGAGGG CAACACCAAG ACTTGGCTGG ACGGCTCCAG GATTGGAAAC ATTAACGAGT CGTTGCTGAC ACTCGACTTT GTTCGCCAAA ATATTGTGGA TTCGCCAACT GCTCTCGGGT CTTATGATGC TGTGTCCATA ATCCTAAATC AGACCATGAG TCATACATCT TCGCGATTCC GCAACTATCA CGATGCCTCC ATCACTTCGG CCAATATTCG AACATGGGCT GTCCGTCTCG TGTACTTATC CCTACATTAT CATCAACACC GGCATGCTTT ACGGGAAGCA CGCATACGAG CCTTAGACGA CGCATGTTAT CCAGTTATGC TGGACAGAGG CGTCGGTCTG TACGACTACG AGTGTCCCTC AGCAAAATAT CTAGTGGTGG GTCTTGCCAA ATACGGATTG GGTGCCAACG TTCGCGCGGG CGCGGTCAAG GCACTCGTAG CCGGGCTAGC GACCGACCGA GTCGTTCTCT TTGTCAACAA TGTAGCGGAT CTCGAGAGTG CTACCAAAGA CATGGGTCCT TGGTCGTTGG CGTCGTGTCC ACGTAGAGAT CATCAATGCT TCTTTTTGCC CCTAAGTCCG TGTGTGCTGC TGCGTGAGGA ACTGGCTGCC GCACACATTT TGGACAAGGG CGAAATGAGA GCTTTGTTCA AAACTGGACG TGTTCCTCTT GGACGAATTG ACGATCGAGT GCTGGTGCTG CATCTCCATT TTATGCCTCA GCTAAAGTTG CCGGGGAATA CCGTTGAGCT ACTTCAAAAC TACTCATATT CCCTGATCGA GGGCGTTTTA TCAACCGATT CTAGACGGCC AGTCATGGAG CAAGCAATTG CAAGTTTTAC TCTACCAGAC GAGCCTCGCC CCGCGGGATA TAATTACGCT TTAGCGAACA GCCGCATTCA GCATGCGCTT ATGTTCTACA CGATGCGACC CAATCTGCAT AGTCGCAAAC GAATAGACGA GATTTTAAAG GAAATCATTC CCGCAAAGAC TCTCCCCGAG AGATCGGTTG GCATTCCGAT TCGAGCTTCG GATAAGTGCC GGCAGGAAAG CCAATGCTTA TCCTTTTCCC AGCACATAGA AGTTTCCAGT CTGCTTTGGA ACGACGTTTA CCCCAACACA TCCCACTCGG ATCAACCCAC ACTTATTTTT ACAACGGAAT CGAAAGACAT GATGCATGAA GAACACGCAT ACATTTCCAA CACCACAATC CCCAAACCCT TTGCGAATAT GAACTTGCTC ATGAATCCTC ACGATGTGCT ACCCAATACC GGGTTTGTTG AGGAAATCGT CGCGAAAACC ACCAGCTTTT CGGCTGACGA TGCAATGCTT TCTGCCGTCA CGTCATTACA ACTCCAACTT TTGGCCAGAT ACACCATTGC CAACTGCTGC TCTAACTTTC ATACACTTCT GGGGGACTTT CTAGTGGAAG GTATGGGAGC CGCACATCAC AACGATTTTT CCTGCTTGCA GGAGCACTCC GATCCAGTTT ACGCCATTTG CTGTGGCTGG CACAAGGATT GCAAACAAAA GCGAGCGGCT ATCCTTCTCG CCCGAAACTC TACAGATGAA ACGATGAAGG ATTGA
|
Protein sequence | MKKRPLWVVQ IRRRTIRLVS FAVSCALLAV AIGSMALMFR WDGNDLSLGA EVDPFKIQIN ARKSESVTFL QKGVQMPVKD FAVHGTVINK ITDAGIATTQ FEKCSEGNTK TWLDGSRIGN INESLLTLDF VRQNIVDSPT ALGSYDAVSI ILNQTMSHTS SRFRNYHDAS ITSANIRTWA VRLVYLSLHY HQHRHALREA RIRALDDACY PVMLDRGVGL YDYECPSAKY LVVGLAKYGL GANVRAGAVK ALVAGLATDR VVLFVNNVAD LESATKDMGP WSLASCPRRD HQCFFLPLSP CVLLREELAA AHILDKGEMR ALFKTGRVPL GRIDDRVLVL HLHFMPQLKL PGNTVELLQN YSYSLIEGVL STDSRRPVME QAIASFTLPD EPRPAGYNYA LANSRIQHAL MFYTMRPNLH SRKRIDEILK EIIPAKTLPE RSVGIPIRAS DKCRQESQCL SFSQHIEVSS LLWNDVYPNT SHSDQPTLIF TTESKDMMHE EHAYISNTTI PKPFANMNLL MNPHDVLPNT GFVEEIVAKT TSFSADDAML SAVTSLQLQL LARYTIANCC SNFHTLLGDF LVEGMGAAHH NDFSCLQEHS DPVYAICCGW HKDCKQKRAA ILLARNSTDE TMKD
|
| |