Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45119 |
Symbol | |
ID | 7200313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 257247 |
End bp | 259321 |
Gene Length | 2075 bp |
Protein Length | 591 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179167 |
Protein GI | 219116745 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.170034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTGGGAAA GCAAAGGCCA TTGAGGTCCA CACCACAAAA ACCGCTCGCG ATGCCTTCGA TATCAGCTAG TGTAAAGCTG TCGATTCTCT TAGCATGTAT AGCTTCGTTG ACATCACCGG CTCTGAGCTT TACACCGCTG CTGTCGAGCC TTGCTGCTTC GGTTTCCAAG CTACAGGCTC CCCGAAGCTC CGTCTGCTTG AGCAAGCGTT CGTCTCCTTC TCGATCCAAC GGAGCGCTGA CCATGCACAT GGGTCACAGC CATAGTCATC ATATTCACAC TCATACCGAA GAGCTCAAGA ACAAGCAACT ACAAACGAAA AAACGCCGAC GCCGTGCGGC ACTCTTTCTC TTCACGGCCC TCGCTATTCT GGGACCGCCG CTATTTCGTC AACGCACGCT AACACGGACT GATGTGGGGA CGTTTCTGTT GACATCCACT ATCCTTACCA TGAGTGACAC CGTACGTCGT AAAATTCAAC ACGCCATTCG GAAAATAGGA CAATTGCGTG AAGGCATTGT CAAGCATTCG TCACCCGTAT CTCCGGGAAA ATTTCTCAGT TACCTTTTCC GGAACGACAA CGCGGCTGAT CGGGTAACGC TTTTGGGCAG TGTGATCAAC TTATTTTTGA GTGCGGGAAA ATTTGCCGTT GGCGTGTCGT GTCATTCGAG CGCTTTGATA GCGGATGCCG GGCATTCGCT GTCGGACCTC TTTAGTGACT TTATCACCCT GTGGGCGGTG CAAATCGGTC GTTTGCCTCC GGATGATGAT CATCCGTACG GACACGGAAA ATTCGAAGCC ATCGGCTCCC TGTTCCTCTC ATTGACCTTG CTGGCGACTG GAATATCCGT CGGCGCGGCT TCCAATCGGA AACTCATTGA AATCATCACT ATCCAACGAG CGTCCGGCTG GGGTACCGCG GCATCCCTAG CTGGACAAGT GCCTACTTCG CCCGCTCTGT TCATGGCTGC CCTCAGCATT TTTTCCAAAG AATGGTTGTA TCGTATTACA AAGCAGGTTG GAGATAGACT GAATTCGCAA GTGATTCTAG CGAATGCCTG GCATCATCGA TCCGATGCGT ACTCGTCTGT TCTTGCACTC TTGGCGATTG GGCTAGCGAT GTATTTTCCC GCGATGATTG CTGCGGACTC GGCTGCTGGT ATACTGGTGG CCGGGATGAT TTGCATGACT GGTGCAGAAA TTATGGGCGA ATCCATTAAG CAGTTGACGG ACACGAGCAA CGAAGCCCTC GTGAAAAAGA TAAAGCGTCT CGCGAAGGAT TACTCCGATA ATGTCTTTGA AGTAACACGA GTGCGGGCGC GACAAGTTGG CTCATCTGCA ATTGTCGATG TTGCCATTGC GACTCCGGGA GAGTTGTCGT CGTCTGCTTC TCGCGTCATG GCGGAAGGTT TGCGACGCAA GATTATGCAG TCCGTTGATG TAGTTGACGC TGAAGTCCAT TCGACTACGC ACGACACTCC ACTTTTGCAT GCCACCAAGA CACGAGAAGC CATGGCGAGT AATGGTGTGG TTGTCCAACC CAACATGAAC GACATCGAAG AGAAGGTGCG ACAAAACATC AATTCTCAGC ATCCTAAAGT TCGGTCGGTG CAGGGAGTCA CGGTTAAGCT TGCCGAGTCG TCGGCCCGCA GGAATAGCGT CGACGTGGTT ATCCGTGTGG ATCCCGAAGC AACAGTTGCG GCGGCGCACG CAGTCGCGGA AGATCTGAGA AAATCTCTCG AAAATATCGA TCACATCCAT CAAGCTAGTA TTTTTTTGGA CTTGAACGCT GAACTCATAT CGGTTTCTAA TGCCTTGAAA CCTTGAATGC GGAACAAATT GTCAAAGCCC AAACCCAATA TACTACCTAG CTAACAGGGA TACGGTTGGA CCACAAAGCA GATGGGAGGG AACTGCAATC AACTTGCGCA CTCGATAACA CTACACAGCA TTCACAGCTA TACCATCGCA TCCATATCAA CTGCATCCGT TGCATTGATG AAAGACTGTA GCGTCACAAT CTTCCCTTTA GTGAATATTT CAATTTTGTA CTATAAATTA CTGGTTGCCG CCTTC
|
Protein sequence | MPSISASVKL SILLACIASL TSPALSFTPL LSSLAASVSK LQAPRSSVCL SKRSSPSRSN GALTMHMGHS HSHHIHTHTE ELKNKQLQTK KRRRRAALFL FTALAILGPP LFRQRTLTRT DVGTFLLTST ILTMSDTVRR KIQHAIRKIG QLREGIVKHS SPVSPGKFLS YLFRNDNAAD RVTLLGSVIN LFLSAGKFAV GVSCHSSALI ADAGHSLSDL FSDFITLWAV QIGRLPPDDD HPYGHGKFEA IGSLFLSLTL LATGISVGAA SNRKLIEIIT IQRASGWGTA ASLAGQVPTS PALFMAALSI FSKEWLYRIT KQVGDRLNSQ VILANAWHHR SDAYSSVLAL LAIGLAMYFP AMIAADSAAG ILVAGMICMT GAEIMGESIK QLTDTSNEAL VKKIKRLAKD YSDNVFEVTR VRARQVGSSA IVDVAIATPG ELSSSASRVM AEGLRRKIMQ SVDVVDAEVH STTHDTPLLH ATKTREAMAS NGVVVQPNMN DIEEKVRQNI NSQHPKVRSV QGVTVKLAES SARRNSVDVV IRVDPEATVA AAHAVAEDLR KSLENIDHIH QASIFLDLNA ELISVSNALK P
|
| |