Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37061 |
Symbol | |
ID | 7202087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 76247 |
End bp | 77827 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181132 |
Protein GI | 219121560 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.38015 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGACT TTATCGCTCC CGATAACTTT CCTCCCGACT ACCCTATTTT GGATACTACC AACCCCACTG CAACCATTGC AGCCATTCCA GACCCACCTG ATAACGTGAA TATCAGCTTG GATATTCCAG ACTCGTTGAA AAACCTTCTT TCGAACGTGT CAAATCCTGA GGCCGCCTTT ACTGGAGCTT ACTATACGCG GTACACCCAT GAGTTTCGTG TCTCTCTCAC TTCCTCTGAT TACTATACCT ACCAAACCCA GGCATTCAAG GGAGTCCTCA AGGAGTTCAA GTTTGACTCC GACAACCCCA TGGATACACT CACTAAAGCA AAGTCTAACA TGGAAGAAGC TGCCTTTGCC ATTACCGCAA AAGGGATCAT CAATCGAAAG AATAAACTTG AAACCTTTCT TACCGAATTT GGACTCAGAG CCCCTTTTGA CACAATCTAC ACCCAGTGGA AGTCAACCTC CCAGGGACTT ATTCCTGTGT TCTCTTCCCA TAAGAATCTC TTTCAAGATT TCCATTCCAT TGTGCTCTCT GATGTAGTGA ATACCGTTGA CTTCATGCAA CGTTACACCA ATATTGCTCA TCCCACCCTT GGAAAAATCA ACGAGGAACA TTCTCGCGAC TACTCAATGT CTGGCACGGC AATTTATAAT TCCTGTGATC TCTCGTTACA GTCCTGGTTG GACACTCAAA TTGGCATTAG CACCGACACT ACCCTGAAAC GTCATGGCAG CTCCGGTCCA GTTCGATTCT ATCTCATTTG GTCGCGATAT GCCAATGTTG ATGGGGCCGT TGCCACATCA ATCCAAGCCG CTCTCACCAA GTTACGTGTT CGCGACTTAC CTGGAGAAAA TGTTTCCCTC TATTTTGACA CGGTCACCAT CATTGAGGAA TATCTCAGAT CAATGGGACG TACAATTCCA GATTTTGTAT CCCATGTTAT TGACATTCTT GTCGATGTTT CCGTTGACGA TTACTCCCTG TTCATAAAGA CCCAACAGTT TGTTCGCAAT CCCTCTCTTC ACAATATGCA TTCACTTCGC CAATTGGCCT GCGATCAATA CCAGCTTCTT TTGAACTCTG GCAAGTGGCA TCCAACTGCA AAGACAGGCG CGGCCTTCCA TGCTGCCCAC CACATGTCTA CCCATGCACC GGATACCACT CCAAGTGCTC TCGTGAATTC TGGTTCTGGT ACTCCCAAAC CACGTCTTTC TCGAGAAGAG TGGGAAAAGA CCATTGACCG TTCCCCTCCA CCCGCCGGAT CATCAGATTG TCGCAAGTCC ACAAAGGGTG ACTTTAACGA ATATTGGTGT GCCACCTGTA ATTGGTGGGG CAACCACCCT ACCAACAAGC GCCACCATCC CACCGCTCCT ATTGACCACG CCGGTTTTCT CGAGAAACGG AAGAAACGCT TTGCTAAACG TGACCCCTCG GACTCCACTC CCTCTGTGAC CGTCAATAAC AATTCAACCA CACCACCATC AGGAGTGAAC TCTTCTGGCG CCCTTCAGCT CTTATGTTCC TCCGCCTTGA CCCAGTTTCA CTCTTTTGGT GCACCCCCTT CAAATTTTTA G
|
Protein sequence | MSDFIAPDNF PPDYPILDTT NPTATIAAIP DPPDNVNISL DIPDSLKNLL SNVSNPEAAF TGAYYTRYTH EFRVSLTSSD YYTYQTQAFK GVLKEFKFDS DNPMDTLTKA KSNMEEAAFA ITAKGIINRK NKLETFLTEF GLRAPFDTIY TQWKSTSQGL IPVFSSHKNL FQDFHSIVLS DVVNTVDFMQ RYTNIAHPTL GKINEEHSRD YSMSGTAIYN SCDLSLQSWL DTQIGISTDT TLKRHGSSGP VRFYLIWSRY ANVDGAVATS IQAALTKLRV RDLPGENVSL YFDTVTIIEE YLRSMGRTIP DFVSHVIDIL VDVSVDDYSL FIKTQQFVRN PSLHNMHSLR QLACDQYQLL LNSGKWHPTA KTGAAFHAAH HMSTHAPDTT PSALVNSGSG TPKPRLSREE WEKTIDRSPP PAGSSDCRKS TKGDFNEYWC ATCNWWGNHP TNKRHHPTAP IDHAGFLEKR KKRFAKRDPS DSTPSVTVNN NSTTPPSGVN SSGALQLLCS SALTQFHSFG APPSNF
|
| |