Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47998 |
Symbol | |
ID | 7202998 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 655767 |
End bp | 658434 |
Gene Length | 2668 bp |
Protein Length | 782 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182272 |
Protein GI | 219123938 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0260964 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGTGGACGG GAGCCGTTTT CCGTGCCCTT GAATTGCAGT CGCATCACAC CATGGTGGAT AGAGACACCC CAACCGAATA CACTCTTCTT ACCCATCTAA ACACAAGTCT CCACCAACGT ACCATCGTTG TGGAAACAAG GAAAGTCGTA CTGTAGAGAC CAGCGCCGCG CCGTACCGCA CCGCACCAAC AGTACACTAC TCTTGAGAAC ACAAGCGACG GCCTACACAA CAAAAGAACA AACCGACCTA TATAATGGGA AACTCGGCTT CCTCGTTGCC GTACGCCATT GGGAAACAAA CGGCGATCGT CAATGACGGT TGGGCGTTGC ATGAAGGCAC ACAAAAGTCG GACGGTTCGG ACGTGTCGGT CTTTGTGGCG AAGAAAGCCG TCTTGAACAA GACCGCCATC GATCGGGCAC GCGATCCGTC CCGGACGCAG CTCGAACCCG CCCTCCACCA CTTCTCCTAT TGCAAGAAAC TGCGTCATCC GCACATTCTA CAGGTACTGG CGACGCTCGA CACCGATCAC CCTAACGACG CCAATAACGC GACTGCGGTA TCGTCTACTG CCGCATCGCA AGCGACCAAA GAAACGGGGG ATCTCATCAT CGTTACCGAA CGCTGTGTGC CACTCGACGT CTGGTTGCAG CAAGAAAATC CTCCCCCGGA ACAATTGGCC TGGGGACTAG AAGTTGTCGT TTGTGCCCTA CATTTTCTAC ACGCTTCCGC CAATCTCGCG CACGGAAACA TCTCGCCGGC CTCTTTTTTC GTGACCCGCG CCGGAGACGT CAAACTCTGG AACTTTGTCC TCGTCACCCC AACACACCAG GCTTCCGGAG GACTCTCGAA CCATTTTCAA ACCTACGAGG ATCTCCTCAC CCCACAGCCC TACCGTTCGC CCGAACGACA ACAACGCAAT TGGGCCGCGC TAGCGGCGGA GGGGACGCAC GCCATGGACA GTTTCGGCTT GGCTTTGTTG ATTGCACACT TTTACGGAGG CACGGTCCCC CCGCCTCTGC AGAAGGCTGT CCAACGACTG CAAACGCCCA ACGTCAAAAT GCGCCCCCGT CTTCAACCAC TCCTCAAATG CCCGCTCTTC GATACACCCT ACCAAAAATT ACAACTCCAA CTCGAAGAAT TCATGGTCAA ACCGGTGGAA GAGAAAATCG CATTTTGGCA AAACTTGACG CCCCAGTTAC AGGCCGCCTT GATTCCGGAA AATCTAGCCG TGTACAAGTT AATGCGCATC ATGAAATCCA CCATTGACAC CATCTGCCAA TCTGACTCGA TGCGATCACA AGATATGTAT CGACGAGAAT GTACGTGCAG TTGTGTCTGT ATGACCATAC ATTAACATGT TCACCGTGGT ATCTGATTAA CACAGATTTG TTGTGCGAAT CTCACCCCAT TCTGGTTCCG TCGCACCACG CAGTATCCTC TATACTAAAA CCGCTCTTTT TTGTTGGCGA GCATTACTTG GACAACGATT TTGGCAAAGA ACTGACGTCG ACCGTTTCTA TTTTGTTCAC GGTGAACGAC CGAGGCATTC GAGGCGCTCT CTTGCAAAAG GCTTCCCTTT TTTCCAAACA TCTCGACAAG AATACCCTCA ATCAAGCTGT TTTTGAGCCC GTGTGCAGTG GATTCTCGGA TTCCAGTTCC GCCCTGCGGG AACTCACCCT CAAAGCCACG CTCGGCATGG TCCCCTGCTT GACGCCGCCT AGTTTAGAAA AGCTTTCACG GTATCTCGTA CGCTTGCAGA GCGATCCCGA AGCCTCTATC CGGACCAACA CTGTCATTTT CTTAAGAAAA CTCGCACCAC ATTTGACCGA TACGACGCGC CACAAAATGC TGCTGCCAGC TTACGTAAGG GCCCTGAAAG ATCCTTTTAC GCCATGTCGA TTGGCAGCTT TGCAATCGGT GCTGACTTCC AAGGAGTTTT TCGAACCCAA CATTTTGGCG GGTAAGGTAT TGCCTGCGGT TACACCCTCA TTGTTGGACG GAGCAGCCGA TGTCCGCAAA GAGGCCTTCA CTGTAGTGGA CGATTTATTG TTCGCGTTGC GGCAAGAAAG CGAACGCATG AACTCCCAAC CGGATGCATC CGCAAAGACC GCGGTGACGG CGCCGGGAGT ACCTCCATCT GTCCCCCAGA CGTCGACTAC TCCAGTTCCG TCGGCGCCGT CGTCTGGAGG ATACTTGACC GGACTGTCCT CCTGGATGAC ATCCTCGGCC AAGCCGACTG AACCTGTTCC GTCGACGGCA AGAGCTGGAC CGCCGCGTAG TGCCCCAGCT CCCGCCATAT CTGCGCCAGC CGCAGGAGCA GGGTATACAC CTGCTGCACC CCTCGCCATG GCGCACCTGA CGCTCGATGA TGATGCCAAT GACGACGACG GTGGATGGGG GGACGACGAT CTCGATGTAA GTGAACCGCC GAGGGCACAA CGACCAACGG GGGTGACCAC GACAAAGAGC TCCTTGTTTG CACCGGCACC TGCAGAGGAC GACTTCTTTG GGTCATTTGA TGCTAAGCCT GTGAAGCAAG CTTCTTTGCG TGTGAGCAGT GCGGGCAAAC TCAGCATTCC TGCGAAAAAG ACGAAACCAG TTACCAAAGC AGCTATTACA AAACTCGCTG CAGACGATAA CATGGACGAC GGATGGGACG AATTTTAA
|
Protein sequence | MGNSASSLPY AIGKQTAIVN DGWALHEGTQ KSDGSDVSVF VAKKAVLNKT AIDRARDPSR TQLEPALHHF SYCKKLRHPH ILQVLATLDT DHPNDANNAT AVSSTAASQA TKETGDLIIV TERCVPLDVW LQQENPPPEQ LAWGLEVVVC ALHFLHASAN LAHGNISPAS FFVTRAGDVK LWNFVLVTPT HQASGGLSNH FQTYEDLLTP QPYRSPERQQ RNWAALAAEG THAMDSFGLA LLIAHFYGGT VPPPLQKAVQ RLQTPNVKMR PRLQPLLKCP LFDTPYQKLQ LQLEEFMVKP VEEKIAFWQN LTPQLQAALI PENLAVYKLM RIMKSTIDTI CQSDSMRSQD MYRREYLLCE SHPILVPSHH AVSSILKPLF FVGEHYLDND FGKELTSTVS ILFTVNDRGI RGALLQKASL FSKHLDKNTL NQAVFEPVCS GFSDSSSALR ELTLKATLGM VPCLTPPSLE KLSRYLVRLQ SDPEASIRTN TVIFLRKLAP HLTDTTRHKM LLPAYVRALK DPFTPCRLAA LQSVLTSKEF FEPNILAGKV LPAVTPSLLD GAADVRKEAF TVVDDLLFAL RQESERMNSQ PDASAKTAVT APGVPPSVPQ TSTTPVPSAP SSGGYLTGLS SWMTSSAKPT EPVPSTARAG PPRSAPAPAI SAPAAGAGYT PAAPLAMAHL TLDDDANDDD GGWGDDDLDV SEPPRAQRPT GVTTTKSSLF APAPAEDDFF GSFDAKPVKQ ASLRVSSAGK LSIPAKKTKP VTKAAITKLA ADDNMDDGWD EF
|
| |