Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37425 |
Symbol | |
ID | 7202352 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 116467 |
End bp | 121392 |
Gene Length | 4926 bp |
Protein Length | 1569 aa |
Translation table | |
GC content | 64% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181662 |
Protein GI | 219122666 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.61446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGAAG GTCCTGATGA TGGATTACCA GTAGGAGCTG TGGTTGGACC GCGACTCGGT TCCCTGGACG GAAGGGCCGA GGATAGTGCA GTAGGTGTTG CACTGGAGAA TATACTAGGC CCGGCACTCG GTACCATAGT CGGCGAAAAG CTGGGAAGCC CCGATGGAAC CAGCGTGGGG TGTTGGGTAG GTGCTTGACT TGGACCCGAA CTTGGTGCCC CGGATGGTAC ACTGGACGGA GGTGCACTGG GTGCCGCACT TGGGATTGCG CTGGGTCTCG CACTCGGTGT TGTACTCGGT AAAGAACTAG GAAGCATCGA AGGTGCCGCA CTTGGGATTG TGCTGGGTCG CGCACTCGGT GCTGTACTCG GTAAAGAACT AGGAAGCATT GAAGGTGCCG CACTTGGGAT TGCGCTGGGT CTCGCACTCG GTGTTGTACT CGGTAAAGAA CTAGGAAGCA TCGAAGGTGC CGCACTTGGG ATTGCGCTGG GTCTCGCACT CGGTGTTGTA CTCGGTAAAG AACTAGGAAG CATCGAAGGT GCCGCACTTG GGATCGCGCT GGGTCTCGCA CTCGGTAAAG AACTAGGAAG CATCGAAGGT GCCGCACTTG GGATCGCGCT GGGTCTCGCA CTCGGTAAAG AACTAGGAAG CATCGAAGGT GCCGCACTTG GGATCGCGCT GGGTCTCGCA CTCGGTGTTG TACTCGGTAA AGAACTAGGA AGCATCGAAG GTGCCGCACT TGGGATCGCG CTGGGTCTCG CACTCGGTGT TGTACTCGGT ATAGAACTAG GAAGTGTCGA AGGTGTTGCA CTTGGGATTG CGCTGGGTCT CGCACTCGGT GTTGTACTCG GTAGAGAGCT AGGCAGCGTC GAGGGTCCCA CGCTTGGTAG TGTGCTGGGT CTAGGACTAG GAAAAGCGCT TGGAAGTATC GACGGTGCCG AAGAGGGGTC TATACTAGGT GCCTGGCTTG GACTCATACT TGGTGCCTGA CTTGGTTTCA TATTTAATGC ATGGCTTGGA CCCATACTTG GAACTTGGCT TCGACCAATA CTCGGTACCT GGCTAACAAC AACACTCGGT GCCTGGCTTG GACCCATACT TCGCAACGCG ATTGGTACCG TAGACGGCTG GTCACTCGGT TGTGTACTGG GAACCAAAGT CGGGACGGTG GCTGCCGGAG TGGTCGAGGG CTGTGCGCTG GGAAGCGCAC TAGGGGTCGA TCCCGGAGCT ACGGTCGTGG GCGCTGACGG CTGGGCACTC GGTTGTGTAC TGGGAACCAA AGTCGGGACG GTGGCTGCCG GAGTGGTCGA GGGCTGTGCG CTGGGAAGCG CACTAGGGGT CGATCCCGGA GCTACGGTCG TGGGCGCTGA CGGCTGGGCA CTCGGTTGTG TACTGGGAAC CAAAGTCGGG ACGGTGGCTG CCGGAGTGGT CGAGGGCTGT GCGCTGGGAA GCGCACTAGG GGTCGATCCC GGAGCTACGG TCGTGGGCGC TGACGGCTGG GCACTCGGTT GTGTACTGGG AACCAAAGTC GGGACGGTGG CTGCCGGAGT GGTCGAGGGC TGTGCGCTGG GAAGCGCACT AGGGGTCGAT CCCGGAGCTA CGGTCGTGGG CGCTGACGGC TGGGCACTCG GTTGTGTACT GGGAACCAAA GTCGGGACGG TGGCTGCCGG AGTGGTCGAG GGCTGTGCGC TGGGAAGCGC ACTAGGGGTC GATCCCGGAG CTACGGTCGT GGGCGCTGAC GGCTGGGCAC TCGGTTGTGT ACTGGGAACC AAAGTCGGGA CGGTGGCTGC CGGAGTGGTC GAGGGCTGTG CGCTGGGAAG CGCACTAGGG GTCGATCCCG GAGCTACGGT CGTGGGCGCT GACGGCTGGG CACTCGGTTG TGTACTGGGA ACCAAAGTCG GGACGGTGGC TGCCGGAGTG GTCGAGGGCT GTGCGCTGGG AAGCGCACTA GGGGTCGATC CCGGAGCTAC GGTCGTGGGC GCTGACGGCT GGGCACTCGG TTGTGTACTG GGAACCAAAG TCGGGACGGT GGCTGCCGGA GTGGTCGAGG GCTGTGCGCT GGGAAGCGCA CTAGGGGTCG ATCCCGGAGC TACGGTCGTG GGCGCTGACG GCTGGGCACT CGGTTGTGTA CTGGGAACCA AAGTCGGGAC GGTGGCTGCC GGAGTGGTCG AGGGCTGTGC GCTGGGAAGC GCACTAGGGG TCGATCCCGG AGCTACGGTC GTGGGCGCTG ACGGCTGGGC ACTCGGTTGT GTACTGGGAA CCAAAGTCGG GACGGTGGCT GCCGGAGTGG TCGAGGGCTG TGCGCTGGGA AGCGCACTAG GGGTCGATCC CGGAGCTACG GTCGTGGGCG CTGACGGCTG GGCACTCGGT TGTGTACTGG GAACCAAAGT CGGGACGGTG GCTGCCGGAG TGGTCGAGGG CTGTGCGCTG GGAAGCGTAC TAGGGGTCGA TCCCGGAGCT ACGGTCGTGG GCGCTGACGG CTGGGCACTC GGTTGTGTAC TGGGAACCAA AGTCGGGACG GTGGCTGCCG GAGTGGTCGA GGGCTGTGCG CTGGGAAGCG CACTAGGGGT CGATCCCGGA GCTACGGTCG TGGGCGCTGA CGGCTGGGCA CTCGGTTGTG TACTGGGAAC CAAAGTCGGG ACGGTGGCTG CCGGAGTGGT CGAGGGCTGT GCGCTGGGAA GCGCACTAGG GGTCGATCCC GGAGCTACGG TCGTGGGCGC TGACGGCTGG GCACTCGGTT GTGTACTGGG AACCAAAGTC GGGACGGTGG CTGCCGGAGT GGTCGAGGGC TGTGCGCTGG GAAGCGCACT AGGGGTCGAT CCCGGAGCTA CGGTCGTGGG CGCTGACGGC TGGGCACTCG GTTGTGTACT GGGAACCAAA GTCGGGACGG TGGCTGCCGG AGTGGTTGAG GGCTGTGCGC TGGGAAGCGC ACTAGGGGTC GATCCCGGAG CTACGGTCGT GGGCGCTGAC GGCTGGGCAC TCGGTTGTGT ACTGGGAACC AAAGTCGGGA CGGTGGCTGC CGGAGTGGTC GAGGGCTGTG CGCTGGGAAG CGCACTAGGG GTCGATCCCG GAGCTACGGT CGTGGGCGCT GACGGCTGGG CACTCGGTTG TGTACTGGGA ACCAAAGTCG GGACGGTGGC TGCCGGAGTG GTCGAGGGCT GTGCGCTGGG AAGCGTACTA GGGGTCGATC CCGGAGCTAC GGTCGTGGGC GCTGACGGCT GGGCACTCGG TTGTGTACTG GGAACCAAAG TCGGGACGGT GGCTGCCGGA GTGGTTGAGG GCTGTGCGCT GGGAAGCGCA CTAGGGGTCG ATCCCGGAGC TACGGTCGTG GGCGCTGACG GCTGGGCACT CGGTTGTGTA CTGGGAACCA AAGTCGGGAC GGTGGCTGCC GGAGTGGTCG AGGGCTGTGC GCTGGGAAGC GCACTAGGGG TCGATCCCGG AGCTACGGTC GTGGGCGCTG ACGGCTGGGC ACTCGGTTGT GTACTGGGAA CCAAAGTCGG GACGGTGGCT GCCGGAGTGG TCGAGGGCTG TGCGCTGGGA AGCGTACTAG GGGTCGATCC CGGAGCTACG GTCGTGGGCG CTGACGGCTG GGCACTCGGT TGTGTACTGG GAACCAAAGT CGGGACGGTG GCTGCCGGAG TGGTCGAGGG CTGTGCGCTG GGAAGCGCAC TAGGGGTCGA TCCCGGAGCT ACGGTCGTGG GCGCTGACGG CTGGGCACTC GGTTGTGTAC TGGGAACCAA AGTCGGGACG GTGGCTGCCG GAGTGGTTGA GGGCTGTGCG CTGGGAAGCG CACTAGGGGT CGATCCCGGA GCTACGGTCG TGGGCGCTGA CGGCTGGGCA CTCGGTTGTG TACTGGGAAC CAAAGTCGGG ACGGTGGCTG CCGGAGTGGT CGAGGGCTGT GCGCTGGGAA GCGCACTAGG GGTCGATCCC GGAGCTACGG TCGTGGGCGC TGACGGCTGG GCACTCGGTT GTGTACTGGG AACCAAAGTC GGGACGGTGG CTGCCGGAGT GGTCGAGGGC TGTGCGCTGG GAAGCGTACT AGGGGTCGAT CCCGGAGCTA CGGTCGTGGG CGCTGACGGC TGGGCACTCG GTTGTGTACT GGGAACCAAA GTCGGGACGG TGGCTGCCGG AGTGGTCGAG GGCTGTGCGC TGGGAAGCGC ACTAGGGGTC GATCCCGGAG CTACGGTCGT GGGCGCTGAC GGCTGGGCAC TCGGTTGTGT ACTGGGAACC AAAGTCGGGA CGGTGGCTGC CGGAGTGGTC GAGGGCTGTG CGCTGGGAAG CGCACTAGGG GTCGATCCCG GAGCTACGGT CGTGGGCGCT GACGGCTGGG CACTCGGTTG TGTACTGGGA ACCAAAGTCG GGACGGTGGC TGCCGGAGTG GTCGAGGGCT GTGCGCTGGG AAGCGCACTA GGGGTCGATC CCGGAGCTAC GGTCGTGGGC GCTGACGGCT GGGCACTCGG TTGTGTACTG GGAACCAAAG TCGGGACGGT GGCTGCCGGA GTGGTCGAGG GCTGTGCGCT GGGAAGCGCA CTAGGGGTCG ATCCCGGAGC AAACGATGGA GGAAGACTTG GTGTGTCACT GGGTCCTGCA CTCGGTCTAT TCGTCGGGAC CGGGGATGGC TCTTCCCTTG GTGTGTTGCT CGGTCTTTCA CTGGGACGCT TGCTTGGTAC TAGACTTGGG ATAGTAGAGG GAGCTTTTGT TGGCTGTGTA CTAGGTTTCT CGCTAGGAGC TGTACTAGGC TTCTCGCTCG GGACGACGCT GGGTGTATTC GATGGAGTGG GCGAGGGCTG CGAACTAGGT CTTTTACTGG GGAACCCAGT AGGTACCGAG GTCGGATCAC CGGACGGTGC AGACGATGGC CTGTCGCTTG GATTTCGGCT TGGGAGTACA CTCGGTATAC TCGATGGAGC AGCGGATGGT TGCTGA
|
Protein sequence | MIEGPDDGLP VGAVVGPRLG SLDGRAEDSA VGVALENILG PALGTIVGEK LGSPDGTSVG CWVGALGAAL GIALGLALGV VLGKELGSIE GAALGIVLGR ALGAVLGKEL GSIEGAALGI ALGLALGVVL GKELGSIEGA ALGIALGLAL GVVLGKELGS IEGAALGIAL GLALGKELGS IEGAALGIAL GLALGKELGS IEGAALGIAL GLALGVVLGK ELGSIEGAAL GIALGLALGV VLGIELGSVE GVALGIALGL ALGVVLGREL GSVEGPTLGS VLGLGLGKAL GSIDGAEEGS ILDGWSLGCV LGTKVGTVAA GVVEGCALGS ALGVDPGATV VGADGWALGC VLGTKVGTVA AGVVEGCALG SALGVDPGAT VVGADGWALG CVLGTKVGTV AAGVVEGCAL GSALGVDPGA TVVGADGWAL GCVLGTKVGT VAAGVVEGCA LGSALGVDPG ATVVGADGWA LGCVLGTKVG TVAAGVVEGC ALGSALGVDP GATVVGADGW ALGCVLGTKV GTVAAGVVEG CALGSALGVD PGATVVGADG WALGCVLGTK VGTVAAGVVE GCALGSALGV DPGATVVGAD GWALGCVLGT KVGTVAAGVV EGCALGSALG VDPGATVVGA DGWALGCVLG TKVGTVAAGV VEGCALGSAL GVDPGATVVG ADGWALGCVL GTKVGTVAAG VVEGCALGSA LGVDPGATVV GADGWALGCV LGTKVGTVAA GVVEGCALGS VLGVDPGATV VGADGWALGC VLGTKVGTVA AGVVEGCALG SALGVDPGAT VVGADGWALG CVLGTKVGTV AAGVVEGCAL GSALGVDPGA TVVGADGWAL GCVLGTKVGT VAAGVVEGCA LGSALGVDPG ATVVGADGWA LGCVLGTKVG TVAAGVVEGC ALGSALGVDP GATVVGADGW ALGCVLGTKV GTVAAGVVEG CALGSALGVD PGATVVGADG WALGCVLGTK VGTVAAGVVE GCALGSVLGV DPGATVVGAD GWALGCVLGT KVGTVAAGVV EGCALGSALG VDPGATVVGA DGWALGCVLG TKVGTVAAGV VEGCALGSAL GVDPGATVVG ADGWALGCVL GTKVGTVAAG VVEGCALGSV LGVDPGATVV GADGWALGCV LGTKVGTVAA GVVEGCALGS ALGVDPGATV VGADGWALGC VLGTKVGTVA AGVVEGCALG SALGVDPGAT VVGADGWALG CVLGTKVGTV AAGVVEGCAL GSALGVDPGA TVVGADGWAL GCVLGTKVGT VAAGVVEGCA LGSVLGVDPG ATVVGADGWA LGCVLGTKVG TVAAGVVEGC ALGSALGVDP GATVVGADGW ALGCVLGTKV GTVAAGVVEG CALGSALGVD PGATVVGADG WALGCVLGTK VGTVAAGVVE GCALGSALGV DPGATVVGAD GWALGCVLGT KVGTVAAGVV EGCALGSALG VDPGANDGGR LGVSLGPALG LFVGTGDGSS LGVLLGLSLG RLLGTRLGIV EGAFVGCVLG FSLGAVLGFS LGTTLGVFDG VGEGCELGLL LGNPVGTEVG SPDGADDGLS LGFRLGSTLG ILDGAADGC
|
| |