Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45587 |
Symbol | |
ID | 7200369 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 625398 |
End bp | 627282 |
Gene Length | 1885 bp |
Protein Length | 548 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179882 |
Protein GI | 219118205 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTACGTCCAC CTCTCAGTTC AGTTTACTGT TAGCTCTGCC TGCACCATTC TAACCAATGG ACGCTCCGAA GCCAAAAAGT CAAGAAGTAG TGCCGCAGGA GAGCTCTATG CCTGCATCAC GTTCACTCTC TAAGGATCAA GAGAAAACGG TGGTTGAGTG CGCCAGGCTG CTGGGAACAA CGACATCGTC GTTGGCTCGC ACTGCCGTCC GGCCTAGGGA GCTGGCCCTC TTATTTTCCG AGGCTGTCAC AGACCATGAT GTGGTATTGA CACTCCCTTC GCTCGATTGT AGAGTAGGCA TAGAGAACGA TCAGGTAGGC AATCCTTTGC CGGACGTTGG TGAACCCGGC CCTAGGGTTC CGGAGCATTC CTCGGGAGAA GCGTCCACAG ATGCAGTCGA AAACGTCCCC AACAAGCCGC TCATGGCTGC TTCTGTTCTC GTTAAAGCGG TAGACGACGC TTCGATTTTA GAGCATCAGG CAGAAAAGCA TTTAGGGAAG TTGACTTCAT GGACAAAGTC CACACTAGTA TACGCGCCGG AGGCAATGGC GACCAATGTA TCCGACTCAT TTTCGTTTCT AATGGATTCC CGACTGCGGT CTTGGACGCT TTTGCTTCTC CGACACTCTC TTTCTACGGG TGACTCTGAA AGCCGGACAC GGCTGCTTCG GATACTTTCG GCGAATGTTG CTGTCGACGC CGCTGAAACA ACGTTGCAGA CCCTCCCTAT GCCCGATTCC GCTGCATGCC ATCCAAAGGA AGCCGATGTG ATTCTTCCCC TTCTTTTTGA GACCAAGATT TCTCTTACTT TACAGGGCAA ACAAGAAACT GTTGTTCTTC GGGCTCCCGG GACTATGTCG GGTAGGTTGA TGTTCGAATC GATAAGAAGC TGTTGTATTG TCTTTACTCA TTGCTTGAAC TTTTGTTTAT GATAGCTTAT TTTGGGAATA ACAGCATAAC CGACATTGAA GACCTACAAA AGGTCGATGT ACGGCTAGAT ACTGATATCA TGCTCAAGGC TATGGTTGAT CAAGCAAGAC TGATTATTTT TAAGACTGTT GCCAATGCAA CTCACACGCC TCTACCTCTC GGTGTCGTGT CGAAAGCATC TGGGTCCCCG AAGGACGTCG TTTCCAAAGG AACAAGAACC GAACCCGCAT CGGAAGGCTC TAACCCTTCC ATTCTGACTT CAACTAGTCT CGCTGGCTTC CGTAGCGCGT TAAATCTTGG TAGTTCCACG ACTTCACAGT ACGAGGACCA AACCATGCGA CTACAGAAAG CGCGTTCATC GGCTTTGCGC TTAAACGGTG TACTTCATGG AAAGAAGGCT GAACCTACAC GGCCACCACA GCCGCAAAGG GTCCGCTCGG TCAAATGGGA CACGCCTGCT ACATTACCGA AGTTAAATTC AAGCCTAGAT CCTAGTCCAA AGAAGGCTCG ACTGTCGCAA GCCGCAACCG TTTCGAAGCT GAAGAGCTTC CGATCGTTTG GACGACCACA TGCTGGAGAC TTTGGATCTG GTCCCCGCAA TGCAACGTTT GGGGAGTATG GAGGACGCCA GGGTATGTGG GGACGAGATG GGCGGATGAT TCATCACCCC ACGCCGATGC AAGAGGGCGG AGCGGGTGGA TTCGTGGAGG TACCTGTTCC GGAAAAGAAT GCCACTTTTA ATTTGAATTC TGCAATGAAG ACGAGCCAAT CGGCGGTTGG TGTCGAAGCT GGCATGCCCC GAACGGCAAC GGCGCTGGAG AACTGGTTCC TCAAATCCGC TACATAACGT CGCAGATCTA TACGCTATTG ATCACGATGA GGAACAGTTT TTACGATTGT ATATGATGGG CTCTACTTTT AATGTTATTT ACGTTAGAGT TACTGTTAAT GGTAT
|
Protein sequence | MDAPKPKSQE VVPQESSMPA SRSLSKDQEK TVVECARLLG TTTSSLARTA VRPRELALLF SEAVTDHDVV LTLPSLDCRV GIENDQVGNP LPDVGEPGPR VPEHSSGEAS TDAVENVPNK PLMAASVLVK AVDDASILEH QAEKHLGKLT SWTKSTLVYA PEAMATNVSD SFSFLMDSRL RSWTLLLLRH SLSTGDSESR TRLLRILSAN VAVDAAETTL QTLPMPDSAA CHPKEADVIL PLLFETKISL TLQGKQETVV LRAPGTMSAY FGNNSITDIE DLQKVDVRLD TDIMLKAMVD QARLIIFKTV ANATHTPLPL GVVSKASGSP KDVVSKGTRT EPASEGSNPS ILTSTSLAGF RSALNLGSST TSQYEDQTMR LQKARSSALR LNGVLHGKKA EPTRPPQPQR VRSVKWDTPA TLPKLNSSLD PSPKKARLSQ AATVSKLKSF RSFGRPHAGD FGSGPRNATF GEYGGRQGMW GRDGRMIHHP TPMQEGGAGG FVEVPVPEKN ATFNLNSAMK TSQSAVGVEA GMPRTATALE NWFLKSAT
|
| |