Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45612 |
Symbol | |
ID | 7200653 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 711815 |
End bp | 713854 |
Gene Length | 2040 bp |
Protein Length | 497 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179698 |
Protein GI | 219117821 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTGC GGCTCGGACT CGACTTCAAA GAACTGCATT CCTTCTTCCA AGAGGCGTTT GGTCCATTTC GTGAATACGC AAGAACTACG AATGAAATTG ACACAGACCA TGAAGGCGAG GTAGATAGAG GTTACTGTCA AAAGAACAAG ATACATTTGA CGACGGTAAA TAAGCGTTCT TACAATTTTA TTGATTTATT CATTTATTAT ATAGATAAAA TACACTCCCG ATAGCACGAT GACACACCTA GGGCTGGTAC GGATGGACTT ATAGTGAAGC GGTTATCACC CAGCACTTTG AATGCTGTAT CCCGGGTTCG AATCCCGGTA GGTCCTTGTT ATTTTTGCCA TTTTTAGCGA AGCTGACACA GTCAACTTTC AGAGTAAACT GTTTTTGTCT GTGATTGCTT TCTTGATGCC TCATCCTTTG ACAGTGAGTC GTCAAAGAAG GCCAGTAGCA ACAAGCTTCT TACACCATTT TTGGAAGATG AAACGAGCTC TAGTTATCTT TACTTTAGCT ACCGCTCGAA ATCCTTTACG TGTAAGGGGG TTTCAAGTCG CTTCCTTATT GCCAAAAACG AGCTCTGCAC GTCTCGATTG CTCCCACGTA GCCGCTTATC GAGCTTCTCC TCATTTGTCA CGATCAAGCT TTGTTTCCGG TTCGTTCACT GAACAGGACT TCCTATCGTT TCCTGATAGA AGACCCCCGC ACTTCATTTT TCATCGAGTT TTCACAAGAT TTTTTCACCT AATTTCTCAA CTGCGGAAAT CCTTTGCAAA GGTGGCTTTG GCGCTCATTA TTGCGTTTTC GCTAGTGTCA AATCCGGCCT TAGCCGTGAC AGGTGGGCGA ATGGGCGGAT CCTTTGGTAA ATCGAGCAGT GCTTCCATGT CTCGTCCCAT GACCCGTCAA TCGTCGCCAC GATCCGGCAG TCGAGGAGCT CCTCGAGTTG CTATTCACAA CCACAATTCG CGGCCCCGCT ACCGCTACCG CTCGACGAGT ATGTACGGGA GTTTCGATGA TGCACTCTTA GCGCCACGTT GCCACCGTCC CGTGGCTGCG AGATTTTCCG CTTCTGATGT TGTCTTGTTG ACGGGGACGT CCGCATTGAT TTTCTACGGT GTAACCAATA ATTTTAGAGG CAAACGTGAC GGTGACGAAA GTGCTCTGGG ACCGGGAGCA TCGGTTGCAT CCGTCACAAT TGCGTTGGAT GTTCCCAGTC GAAAAAATCC GAACTCAGTT TTACACAGAC TGAAGCGTAT TTCAGAACGA GCCAATATGG CATCCCGAAA AGGTGTGCAG GATTTGGTGT CGGAAGGTAA TTGGAATGTG TTTTTTGGCG GTCTTTCCAG TACTCGTTCC TTACTATTCG TCCATTTCTA TGCTCCCTTT CCTGGTTTTT GTCCTAATGT AAATAGTCTC CTTGGAATTG TTGCGGCAAG AAAAGGCAGT TTCATCAGGC AAGACGTACT CGCAGCATTT CTCGAGTGTC ACTCCAGCCG AGCGTGATTT CCAGCGACAA TCAATCCAAA GTCGGAGCAA ATTTGACCGT GAATCAGGTA AGTTGCCGCA AGCATTGCTA CGCGCCAGAA AGTATCACTA GTAGATGCAA AACTCATGAA GGCTTTTCTC GTTCGAAAAT GTACAGTGAA TAAATTTGGA TCCGAGGACC GTTCGGAACG TTCACAGGGC GCACCAACTT CGTTGTCCCT ATCGAAAGAC CGTTCTCAAG CCACGAAGGC AGTTGTGACA TTGCATTTGG CCATTGAGGG TGATTCAACT CTTCTACCTC CCGTGCGTAC CCGTACGGAT TTGCTCTTGG CGCTGAAGAA AATAGCGGCT GATGTGGTCG TGTCTGACTG CCTATTGTCA GCCGAAGTCT TGTGGGCGCC CGAGGAAGAC TGGGACATGC TTACGGAACG AGATATTTAT GCGGACTATC CCGATCTGAT TTCAGTGTAA TGGACAACTC GCAACAGATT ACTGAAAATC AATAGATATT GATACAACAA TCCATCCATG GGCGTTGGTA TACTAGGTTA
|
Protein sequence | MSVRLGLDFK ELHSFFQEAF GPFREYARTT NEIDTDHEGE VDRGYCQKNK IHLTTGWYGW TYSEAVITQH FECCIPGSNP ATARNPLRVR GFQVASLLPK TSSARLDCSH VAAYRASPHL SRSSFVSGSF TEQDFLSFPD RRPPHFIFHR VFTRFFHLIS QLRKSFAKVA LALIIAFSLV SNPALAVTGG RMGGSFGKSS SASMSRPMTR QSSPRSGSRG APRVAIHNHN SRPRYRYRST SMYGSFDDAL LAPRCHRPVA ARFSASDVVL LTGTSALIFY GVTNNFRGKR DGDESALGPG ASVASVTIAL DVPSRKNPNS VLHRLKRISE RANMASRKGV QDLVSEVSLE LLRQEKAVSS GKTYSQHFSS VTPAERDFQR QSIQSRSKFD RESVNKFGSE DRSERSQGAP TSLSLSKDRS QATKAVVTLH LAIEGDSTLL PPVRTRTDLL LALKKIAADV VVSDCLLSAE VLWAPEEDWD MLTERDIYAD YPDLISV
|
| |