Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50461 |
Symbol | |
ID | 7199312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 133490 |
End bp | 135609 |
Gene Length | 2120 bp |
Protein Length | 429 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185385 |
Protein GI | 219130465 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.450436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACAGTCAAT AAACACCAAA TTCATTCCGA CGGATCTTTG CCAAATTTCA AGACCAACGA ATTCGTTACA ACAACCTTTT TTTAAACTAG AGAATAGTTT GGAGCTGCGG TGAGTGGATC CTGGGAGGTA AGCGAGCCTT GTTCATATAG ACCATATTCG AGCTCTTCAG GGATATCGTA TCAATCGCGA CTGAGTTTCT GCTGCTTGAA AAGACCTGTT CCACTAGTGT AATTATTGGT TCGGTCACTT TGGAACCGAT ATACCTTCAA TCCTCGTTTG ATTTGGGAGT GAGAGGTTGC AAAGGGGTCT TGCTTTCAAG ACGGAAGTTT CCGAACACCC GATCTCCATC GTCTAGCGTT CCCGACCGTT CCTTGTCGAG CGTGCCCAGT ATCACAATCG TGAATGTCTG AGTATGCGAT ACTTATTGAA ATGCACAAAA CTATATATGC AATCCTGGTT TGATTTTATC CACCTTTTAG ATTGTTTAGA TAAGGACAAC AAATTGCGAT TTGTCTCAGA AAGTCATTCT CTGGGCGTCC GTACCACATC TCCAGTTGTC TCGCACGCTT CTCACCACGA TGAATTTTCC TCCTAATTTC TCTAGTCCTT TGTTTCTACG ACTCTTAGAA TTTCCTGGCG AGCGCTTCCA TTTTGGTAGT GCTTTTGGTA TTGCACGCTG AGCAGTAAAT CGTTTCGGTG TGTTCATGGA TCAAAGAGAG GTCGACGAAG CCGAAGCGCA GCTTGCTCGA GCTCTGGGCC TGTCGAATTT CGATTCCTCC TCGAACGATT CAAGGGATCC GGAACAGCAA TTCATCCCCG ACCCCTACGC CGAACAGGTC GATCTATCAA AGCTGCGTGG CGTGGCCAGT GGTGACAATA AGAAAGTGGA CGACGATTCC GAGACCGATC TGGAAGCCCT CGCCGAACTG GTTCCCGACA TCAAGTCAAG ACGCAAGAAT ACGGATAGCA GCGGTATTGA AGTTGGATCC CATTCACACA TTCCCATGCC TGCACAGGAT CAGTACAAGT CTTCTCCGGC ATTAGACGAA GTGATCGCAA GGGCAGATTG CGCGCCTCCT CCCTCGGACG AGGATTCGAC CGACGGCCGT AGACTTAGCA TGCCATCGGC CACCGATTAC TTGAAACATA CGGTATCCAC GCAGGCGAAC GGGTCCCGTA ATGTGGATGA CGTTTCGAAA CTAAGTTCGA TGAAAGGCGG TAACAAGGAC AGGACAGGGC ATGTTTATAA GCCAAGCAAA GGACCACGTG TACCTGCTGA TGAGGATGAA GAATACGACG AGGACACTCT TGGAGAATAT CCGAGGTCGG AGCACGTGCT GGGGGAGGCC GAAAATGTAT CTAAGGACTT TATGAATCCG TCTTCAGGGC AGTCTATCTC CACCGCCACA CAGACCCACG AAATGCGCAT GCCCATGTTC CTCCCGACCT TCAAACCTGC TACGGGCTGC ACCAATGCCT CAGACTTTGT GGTGCGCTGC TTCGTTGCCC GTCTGCGATC GGGCATTACA GTGGTAAAAC ATGGGCGATC CAGGTGGTGC AAGTCGCGAT TGCGAGTCCT GCACATTCAT CCGGACGGTC GATCCCTCAG TTGGAGGCCT GCTGAAGGAG AGCCCACCAC AAATAGACGC CCTCCCAAGC TCGACTTGAG TACTTGCCTC GAAGTCCGGC ACGCCTGGAG CCCTGATCCT CACAATCCTG TCTACACAGG CACACCTATT CTGCGACAAA AATGCGAAGC TGCGAATGCA CACAAGTCCT TTGCGTTGAT TTTCAAAGGT CGCACCGTCG ACATCACGGC CGTCACTGCG GATCAGTGCA AGGTGTTGAT GGAAGGCTTT TCGGCTTTGT GTTTTCGGTT GCAGGTGGCT AATCTTGCTG GTCGCAAAAA AACTCGGCCC ATGCCGGAAG AAGATGGTAT CAGCACAACG GCCAGCAACA CTCTGACCAA CAATTCCTCG GCTCCCCGTA GATAATATGC CGCACTTATA TTTGCACCAA TAGAGAGGCG AGAGGCAGTG TCCTTCTCGC TACTCTCTAT GTATGGAAGC TTACTGTTAA TTCAACTCGC ATATTTTAAA AAAACTTTCT ACCATGTCCT
|
Protein sequence | MDQREVDEAE AQLARALGLS NFDSSSNDSR DPEQQFIPDP YAEQVDLSKL RGVASGDNKK VDDDSETDLE ALAELVPDIK SRRKNTDSSG IEVGSHSHIP MPAQDQYKSS PALDEVIARA DCAPPPSDED STDGRRLSMP SATDYLKHTV STQANGSRNV DDVSKLSSMK GGNKDRTGHV YKPSKGPRVP ADEDEEYDED TLGEYPRSEH VLGEAENVSK DFMNPSSGQS ISTATQTHEM RMPMFLPTFK PATGCTNASD FVVRCFVARL RSGITVVKHG RSRWCKSRLR VLHIHPDGRS LSWRPAEGEP TTNRRPPKLD LSTCLEVRHA WSPDPHNPVY TGTPILRQKC EAANAHKSFA LIFKGRTVDI TAVTADQCKV LMEGFSALCF RLQVANLAGR KKTRPMPEED GISTTASNTL TNNSSAPRR
|
| |