Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44930 |
Symbol | |
ID | 7199833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 668146 |
End bp | 670121 |
Gene Length | 1976 bp |
Protein Length | 575 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178823 |
Protein GI | 219116056 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.541672 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTATA ACAAAAACAG TCTTTTAGTC AAATTTAAAT TTGCATTTTA CGTCGTGATT GTGAATTTGA AAGATTCGGA GACCGATACT CCGCACTACC ACGTGCTGTC CGGAATGTTT TCTCGCCCAT TCTGTTGCTG TTTGAGCGAG GCGTCAATCT ATTGGCTGCG AGAGACGATA TTCACTGTTG GTGTCAATGT ACGTCTCGTT CCTAACAGCA ACCAGAAAAT GCGTTCACGC AATTGGCTGG CTCTGGTTCT AGGAACGAGT GCTATTTTGC TTGGTCCAGT TACTGCCCTT TTTGGACGCG CTGCGCTTTT CCGATCCCAT CCCCGAAACG CCACAGTTGA TCTGCGGAAA ACGAGCGGGT TTACGCTGCG AGCTGGATCT TCTAATTTAC CTTGTGTGGA GTATAATGGG GAGGAACCTC CCAAACGGGC TGTCGTTCTT ATGGACGCTT TCTGTCCGTA CCATGGTCTT TATTTAGCGA ACGCCGTTCG GGAACGCTTC CCCGATACCG CCGTCGTGAC CGTTCTGTCC GACTATTTAT ACGCGTTTTT GTCAGCTACG GAACCAGAGT CGCAGGCACA GTGGGATTCC ATGCGAGTTC CGCAAGACGA GACCACTCTT GCTATCTGGC AGTCATTACC CGTTGCATGG CAGGCAGTGT ATTGCGAATC GGATTCGGGC TTGGAAAGTG CGGAAGCTTT ACGAGAGGTT TTAGGTGTAG CCTGCCGGGA CAAGCCGGTC ATGTTATCAG CTCGAAGACA CAAATACAAG ATGAATGATC GCGTTAAATC AGTAGGTTTG GAGTCTGTTA AACAAAAGAT GTGTGATTCG CTGGACGAAG CCCAAGCCTT TGCGCAAGAG CTGGGCCTAT CAACAGGGGA ATCCCTGCAA GCAGTAATCG TCAAGCCGTT TCGAGGGGTC GCGTCGGAAT CCGTCCATTT ATGCCATGAC AATGCTGAAA TAGAATCTGC GTGGAATTCA ATAACGTCCA CGGCGGTTTT TGGGTGCAAA GGTCGGCACG ATTCTGTTCT TGTTCAGGAG TTCCTGCAAG GTTGCGAGTA CGCCGTAGAT GTCGTACTGC GCGATGGTGT TCCGAAAGTT GCGGCAGTGT GGAGGTACGA CAAGACGCAA GCAAATGGTG CTCCCTTTTG CTATGTTTGC ACGCGTCTGG TTGATTCGCA TAGTGATCCG CAAGTACCAG TGGTTTGTGA CTATGTGCTA GACGTCTTAC GAGCTTTGGG TGTAAAATGG GGTTTGAGCC ACAATGAAGT GATTGTCACA TCCGACCGTG GGCCGGTCTT GGTCGAAGTG AATTGCCGCC AGCATAATAT GGACTTTTGT CCGCTCACCA TGGCTTGCAT CGGATACAAC GCATTGGATA TGACGGTCGA CGCTTTATTA GGGGATGAAG AAAGCTGGAA AGTGACCTAT CCCGCTTTTC CCGCATTGCG AGCGCAGGGG TGCATGGTGC ACTTAATCAA TTACGCGTCG GGCGTGCTGA ACGAAGTAAA GCATCTGGAA GAAATAGATG CGCTACCAAG TGTCCTGAAC TGGGAGGTCT ACGATCACTT CCGGGAGCCC GGAATCCTTA TCGAACCAAC TGTGGATATT CGCTCAGATG CGGGATGGGT ACAGCTTGTC CACGAGGATC TGAATACGCT GGAGCACAAT TATGAGCAAT TGGTTGCCTG GATGCCAACA ATGTTTCAGG CACATTAACA TAAGTGTAGT ATTGAGCATT TCCGTCAAAA GGAATTCCAA TTTACAGTTA GAAAAGAAAT AGAAGCGAGT GATAGGAAAA ATAAATCAGG TGGATTGATA GTTTTTAAAG CCGAAGCGCA CTCACTGGTT TCCATCTCCT CGGGACATTG CCGTAGAGAC GTCAATTCTT CGAGAAAGTT CCTGTTTCGG TCTATTATCT GACACAGTGC TGCCTTTCAA CGTGGTCCAT TTCCGACATT TTTTAA
|
Protein sequence | MDYNKNSLLV KFKFAFYVVI VNLKDSETDT PHYHVLSGMF SRPFCCCLSE ASIYWLRETI FTVGVNVRLV PNSNQKMRSR NWLALVLGTS AILLGPVTAL FGRAALFRSH PRNATVDLRK TSGFTLRAGS SNLPCVEYNG EEPPKRAVVL MDAFCPYHGL YLANAVRERF PDTAVVTVLS DYLYAFLSAT EPESQAQWDS MRVPQDETTL AIWQSLPVAW QAVYCESDSG LESAEALREV LGVACRDKPV MLSARRHKYK MNDRVKSVGL ESVKQKMCDS LDEAQAFAQE LGLSTGESLQ AVIVKPFRGV ASESVHLCHD NAEIESAWNS ITSTAVFGCK GRHDSVLVQE FLQGCEYAVD VVLRDGVPKV AAVWRYDKTQ ANGAPFCYVC TRLVDSHSDP QVPVVCDYVL DVLRALGVKW GLSHNEVIVT SDRGPVLVEV NCRQHNMDFC PLTMACIGYN ALDMTVDALL GDEESWKVTY PAFPALRAQG CMVHLINYAS GVLNEVKHLE EIDALPSVLN WEVYDHFREP GILIEPTVDI RSDAGWVQLV HEDLNTLEHN YEQLVAWMPT MFQAH
|
| |