Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50416 |
Symbol | |
ID | 7199222 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | - |
Start bp | 305040 |
End bp | 306926 |
Gene Length | 1887 bp |
Protein Length | 483 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185360 |
Protein GI | 219130412 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0136502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAATTTACAG TTATTGCGTT TCTTACAGTT AAGACCCCTA TTGCTGATGC TTTCGTTGCG TACACTCTCG GTGTATTCCT TCCCTGCACT GTCACCGACG AGCTTGCCAT CCAAAACACT CGTGTTGCCT CGTAACAACC ACGCGAAACG GTGGCTCGTG TTGCTCGGGA GGAGAGCCAA AGCATTATTG GTAGTGGTGT GTCGGGGAGT GCTTGACCTG GGAGTCTTCG ATCGCCTCCG TACTCCTCCT CGCACGCACA CCTCTTTGGG AGCGACTCTC TCTCTCTCTC TGTGAAGCTG TAGTGGAGAA ATGCTACCGT CCAATCACAA TCACACGGGG GTACCGGTGG TAGGGACCGG ACCCCCACGG AGGTCATCGT CGGCCAAGCA CAACCTACAC GGCAACGCCA GTCACCATCG CAACGGATCG TCTCTGCTTT CCTCATCCAC CATCCTCCTC GTATTGGCGA CGGCTATTCC TAGTTTCTTT CTAGGCACAC TTACTTGCTT ATTCGCCGGT ATTGATTGTC AACATCAGTC GTCGGCGACT GTGAACCTCC TGGAAGCCTG GCGCAGTGCC GATCGCGAAT CCACAAGCTG CACTGCCAGC AGTATCGAGC ATCAGCTACA GGCACGGATA CAGACACTAC AAGCTCAGTG GGAGGCCGAT CTAGAGACCA AAATACAACA ACGCACTCGG CAAATGGTCA AACCAGACGG CAGCGATACG TCCACGTCGG GGTACCATCG CGATTGGCTC TTTCCCTCGG ACCGGACCGG CCGCTTCGTC ACCGCCTTGG TGCGGACGCC CAAACTCGGC TTGGTCGATA ACCTCGATCT CGGAGTGCCT GTGGACCCAC CCGGCAAGGG CTACGAAGAT GTCCTGCTAC TCTACAGTCG CGAGTCCACC TTGCCACAGC CCGTCCGGGA CGATCCCACC CTGACCTTTA TGGCCAACAC CACCGCCGCA CTCGAGCACT GCGATTTCGT CAACGTCGTA CTTACGGAAC ACAGCGCCGG TCGGAAACAG TGTATCGCCG TCGTGCCGCA GTACGAATCC TACCACTTGC AAAAATGGAT GCGGGCCCAT CCGCAATCGG GAAAATTGGA CACTGCCGTC CCGCTGAGAC TCGTCAGTCG CGGACACGCT GATAACGGCA GACAAAACTT TATACCACCG GCGCTCGATG ACGCCCGACA GGCCTGGGAA GTGCTGAAAC AGTATCTGGA TAGTGTCGAT GCGGTCTTGG AGGAACTGAA ACCGCTACTG GAAAATATTG CGATTGAGAA CACCGTTATC GTCATGGTGG TGAACTTTGG TCAAACCGAG TTACTCATGA ACTTTGTATG CGCCGCCAAA TCCCGATCAT TGGATTTGTC CAACGTCATT GTCTTTACTA CCGATCAAGA GTCGACCGAT CTGGCCACGT CTTTGGGACT GACCGCTTAC TATGACCAAC GGGTACGTGT TTTGCTCACG GGAGGACCGG TTGACGGAGG GTAGGGCGGT GGTGGATACT GATCAAATAA GTCTCACACT CATTTTTAAA ATATTGATGT AGAACTTTGG AGAAATTCCC TCCGAAGCCG CCCGGCGGTA CGGTGACCGC CGCTTCACCG CCATGATGAT GGCTAAAGTC ATCTGCGTCC AGCTCGTCTC CATGCTCGGT TACGACTTGC TCTTCCAAGA CGTCGACATT GTTTGGTTTT CCAACCCTCT CGAATACTTT GCCCACGCCG ACCCTGGCAT GGATATGTTT TTTCAAGACG ACGGCGCCCA TTCTACTCGC TACGCTCCGT ATTCCGCCAA TTCTGGACTG TACTTTGTCC GCCACAACCG TCGCACCCGA CACTTTCTCA CCAGCCTACT CATGGTGGGC GACCTGA
|
Protein sequence | MLPSNHNHTG VPVVGTGPPR RSSSAKHNLH GNASHHRNGS SLLSSSTILL VLATAIPSFF LGTLTCLFAG IDCQHQSSAT VNLLEAWRSA DRESTSCTAS SIEHQLQARI QTLQAQWEAD LETKIQQRTR QMVKPDGSDT STSGYHRDWL FPSDRTGRFV TALVRTPKLG LVDNLDLGVP VDPPGKGYED VLLLYSREST LPQPVRDDPT LTFMANTTAA LEHCDFVNVV LTEHSAGRKQ CIAVVPQYES YHLQKWMRAH PQSGKLDTAV PLRLVSRGHA DNGRQNFIPP ALDDARQAWE VLKQYLDSVD AVLEELKPLL ENIAIENTVI VMVVNFGQTE LLMNFVCAAK SRSLDLSNVI VFTTDQESTD LATSLGLTAY YDQRNFGEIP SEAARRYGDR RFTAMMMAKV ICVQLVSMLG YDLLFQDVDI VWFSNPLEYF AHADPGMDMF FQDDGAHSTR YAPYSANSGL YFVRHNPYSW WAT
|
| |