Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40783 |
Symbol | |
ID | 7198640 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 386319 |
End bp | 388405 |
Gene Length | 2087 bp |
Protein Length | 663 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184794 |
Protein GI | 219129223 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.26017 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGTGG CTTCTGGGGG AAGTATGGGT ATAAAGAAAG GACCCGTTAT CCCATTAGAA CGGTACGTAC CGAACGAGTA CACACAGCAT TGGTTTTGCT ACTATACACT GCGAAATTAT TTCCACGCTC ACGTTTCTTT ATATCTTGCA CTTGTAGCAA AGAATTGATT TCATTAGGCT CCTTATGGTT TGGCCTTTTC GTAATCCTCG CCTTACTCCA CGCCGCCGAG ATTGCCATTA CCACCCTTTA CCCCTGGAAA GTCCGCGAAA TTGCCGAAGA AGAGGAAAAG CAAGGCAACA TGCGTGGCAC GTTCAAAGTT CTCAACGAAG ACATTACCCG CGTCCTTACC ACAATCCTGG TTGCCTCGAC GGCCTGCTCC ATTTTTGCCA CGACTTTGTT TACGCATTTG GTGGCGAGCC TGTTTGGATT GCAAGGCGAA CGATACGGTG CCATAGCACT AACCGGATTG ACGTTGTTCT TCGTGGAACT TTTGCCCAAA AGTTTGGGTG TCACCAATGC CGAAACAGTC GCACGAATAA TGGTACCACC CGTTAATGTC GCTTCGGCTA TTGTGAGTCC GCTCGGTATT TCTCTCTCTT GGCTAGCCAA GCGCACCTTG TCCATGCTAG GTGTCAAGGA CAAAAACAGT GGCTCGGGTG TATCCGACAG CCAGCTGCGC TTGATTGTAA CGGGCGCCTT GGATTCGGGT ACCATTGATC ATGGTGAACA AGAAATGATT CAGGGTGTTT TAAAGTTACA AGATCAGCGG GTGAAGGAAA TCATGCGCCC CCGCGTCGAA ATGGTAGCAG TTCCAGTAGA CATGTCGGTC GCTAGCGTAC TAGGCGTCGT TCGAGAGTCC GGATACTCAC GAATTCCCGT GTACGATGGC GAGATCGACA ATATCGTGGG CATTGTACTG GCCAAGTCCG TGTTGGATTT TTTCGTAAAT GGAGTGCTGG TCGACGAAGA TTTGAGCAAA AAGTTGGGTA AGAATACCGA AGAAATCAAG GCTGCAGTAG AAGACCTCAA GGCCGCTGAC GAAGCTCGTC GCGGCGAACT ACCTGACGAT ATACAAGCCG ATGCGGACAA GGTCATGGAA CGCGTGGATC TTAAAATTGA TGCACTAGTC GATCAGCGTA TCGATGCGAA CATTGACGCG TCCCTTCCAC CCAGTCTATA CACTCCGGAG CGTATTCCGA TTCGTAGTAA GGGCAACAGC AATGAACCAC AAGGATATGT TCGATCACTG ACAGCAACCG AGTTGGCTTC TCGGATGGAG AAATCTATCC AGGAAGCTGG CTTGATCGAG TCTTGTTATT TTGTGCCGGA CACCGCCAAC GGGTGGTCAG TTCTACAAGA AATGCGTCGT CGTCGGGTGC ATTTGGCAAT TGTCGTGGAC GAATTCGGCG GAACAGGAGG ACTGGTGTCG CTGGAAGATA TTGTGGAAGA AGTGGTTGGT GAAATCTACG ACGAAGACGA CGATGAGGAT TTTCAGGTTT CGGAGGACTC GATTGCCATG CAGGACGACG GAACCTTTTT GATTCGCGGC GACGCCGATT TGGAAGACTG CGATACTATT TTGGAACTGA ACCTGGATGA GGAAGAAGCT CTGAAAGAAT TTTCGACCCT ATCTGGTTTT CTGTGCATGT GTGCAGGGGA GATTCCTTCG GTTGGGGACT TTATTATGAG CCGAGGTTGG TCGTTTGAAA TTTTGAGCGC AGATGACAAG AAAGTGTTAC TCGTCAAGGT GGAGCGCTTG GTTGGTGCAT TCGATAATGA AGAGGAAGCA GCGAGTGAGA ATGTTCTCAA GAACCTGCTA AAGTTAAATT CCAACAAAGA GCACAACAGC AATCATAACA GTAATGACGG CGACTCCGAC AGCGAAAATC GAGACGGCCG GGATTCCGAG CAGGATCGCG AACAGCAGGC CGAGGGCGAA CTCCAGAGTA CCGTCGCTGC GAATATGGCT GAAGCCAGAG AGATCGAACG TATGGTGGAA GCCGGGGAAC GTAAACGAGC AGTACTGGAA GCAATCAAGT TCGCATCGTT GGCCAACAAT ACGTCGCCCG ACAGAAACGA GTTGTGA
|
Protein sequence | MAVASGGSMG IKKGPVIPLE RKELISLGSL WFGLFVILAL LHAAEIAITT LYPWKVREIA EEEEKQGNMR GTFKVLNEDI TRVLTTILVA STACSIFATT LFTHLVASLF GLQGERYGAI ALTGLTLFFV ELLPKSLGVT NAETVARIMV PPVNVASAIV SPLGISLSWL AKRTLSMLGV KDKNSGSGVS DSQLRLIVTG ALDSGTIDHG EQEMIQGVLK LQDQRVKEIM RPRVEMVAVP VDMSVASVLG VVRESGYSRI PVYDGEIDNI VGIVLAKSVL DFFVNGVLVD EDLSKKLGKN TEEIKAAVED LKAADEARRG ELPDDIQADA DKVMERVDLK IDALVDQRID ANIDASLPPS LYTPERIPIR SKGNSNEPQG YVRSLTATEL ASRMEKSIQE AGLIESCYFV PDTANGWSVL QEMRRRRVHL AIVVDEFGGT GGLVSLEDIV EEVVGEIYDE DDDEDFQVSE DSIAMQDDGT FLIRGDADLE DCDTILELNL DEEEALKEFS TLSGFLCMCA GEIPSVGDFI MSRGWSFEIL SADDKKVLLV KVERLVGAFD NEEEAASENV LKNLLKLNSN KEHNSNHNSN DGDSDSENRD GRDSEQDREQ QAEGELQSTV AANMAEAREI ERMVEAGERK RAVLEAIKFA SLANNTSPDR NEL
|
| |