Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49738 |
Symbol | |
ID | 7198430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 112249 |
End bp | 114086 |
Gene Length | 1838 bp |
Protein Length | 544 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184488 |
Protein GI | 219128582 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGCGGCACCG TCGTTTTCCA GTCCTTGGCG TTGCCAACAC ACTCTCGAAC AACCACCAAG CATCATTACA TATATATACA TATACCAATC AGTGAGGAGA TACTCGTTGT TTTGCCAGTG CACATCAACG TGTTCGTCGT CGATAGTTTT CGTACTCGTT TCGCGACAGC ATGAGCACCG TCCAGGCCAA CGGAATCCGG GACGACGCTG CGCTGTCGGA CATTGCGATT CCATTCTACC GCAGTACGCC AATCCTCGGA TCGTTGGCGG TGATACTGCT CACTACGGTG GTCTACTTCG TCTACCATCG GTACTATCGT TCCGCCCCGC CCTCCAAACG CGGAGCCCAC GTCCAGGTCT CTTCCATCGA CGCACAAAAC TCCGTCAGTT TGGCAGACAT GGCCTACCTG GCGCACGTGC TATCGCCCCA GTCGACCCAT ATGGACGTAC TGTGGGCTGC TATTTCGACA CCCGAAATGC TGCAAACCAG TGAGGCCGAA GTGCACAAGG TGGAGCGGAT CCGACGAGAC CGACAAAGCC GACGCGCGAC GCAAAAACCC AACATTCCCG ACGAATTGGA AGCCCTCGTG GAAGACGACG ATGGATGGGG GGAAGACGAA GACGATGTGG ACGAAAGCGA CCAAGCGGCG GCACAAGTCC GTGCCAAAGC CAAACAGGCC GAACAGGAAC GCCAGCAAGA TATGGAACGG CTTAAAGCCG CCACCGGACA AACCAACGAA CCGCTCGAGG GAATCGATCC AGGAGTTATT GGACAAACTT GGGTCGAAGC GACACTGGCC CAACACCAGT CCTGGCCACC GCCATTGACG GACGTTATCA CGGCACATAC GTATACTTAT GAAGACCAGC CCGTTGCGAA CCCACTCGAC CACAAAGGTC TGCGCCGTTA TCTCTGTATG ACCATGGGTC GTTTGAACGC ACAAATACTC AACACCAAAC CCGAACTACT CCAAGCCGGA GCGCAAAAGC TCATCGACCA GACGTATTTT CGGGGCAGTT TGGAATTCCG TGGTCGCGTC GGGACCTTGT TGGAAGCTAT CTTGCGACTC GGTACCGTGC TGAAATCCCG GGCACTCGTT GCAACCACCA TTGAAGCCGT CGCGTCGTTC AAAGTCGGGT GTTTGCCGGG AAAATCGACG ACTTGGTTCC TGCAAACCAT GCAACGCCAG TACGGATGTC AACCCCATCT CGCGATACAC GAAAAGAAAG TCGAAGTCCC TCTTTACGAA GAAAGCAGCA TCCTGGCCAC GGGAGACATG GCGGAATTTT TCCTCGACTT GGAACGCACC CACGCAGAAA ATTTTCTGAA ACAAAAGATC GCCATGTGTC AAAAACAAGG CATCCCGCCG CAAGTCGCGC TGCAAGCCTA CCGGGAAGGA TGGTGGTTCC TATTGCGGGC GGAAAACGTG AACGACCCAA ATATTCGAGC GGAACCGATC ATGCGCGAGT CGCCCATTCT TTCCAAACTC GACAGTCAAG ATCTGGACAA GTTTGAAGCC GCGACACCGG CCGCACAGCG GCTCATTACC GCATGGCCCA TGATTGTGCA AAACTGTGCC CAAAAAGCGG GCAAGGTGCG GATTCAGTTT CCGGTACCGT CCATTCCGGG CAAGTACCGG TTGGTGCTCG ACATCAAATC GCAAGATTTT TTGGGCGCCG ATCAACAAAT CGTCATCGAA AAGGATGTTG TGGATGCCCA GACGATCCAG CGAACACTGA AACCCAAGGA AGAACAAGCC ATCAAAAATG AAGAGTCCAA GGGGGAAACC AAAAAGGAAG CGTAATTGCG TCTTATAATA TACTTACCGA TTCAAAGT
|
Protein sequence | MSTVQANGIR DDAALSDIAI PFYRSTPILG SLAVILLTTV VYFVYHRYYR SAPPSKRGAH VQVSSIDAQN SVSLADMAYL AHVLSPQSTH MDVLWAAIST PEMLQTSEAE VHKVERIRRD RQSRRATQKP NIPDELEALV EDDDGWGEDE DDVDESDQAA AQVRAKAKQA EQERQQDMER LKAATGQTNE PLEGIDPGVI GQTWVEATLA QHQSWPPPLT DVITAHTYTY EDQPVANPLD HKGLRRYLCM TMGRLNAQIL NTKPELLQAG AQKLIDQTYF RGSLEFRGRV GTLLEAILRL GTVLKSRALV ATTIEAVASF KVGCLPGKST TWFLQTMQRQ YGCQPHLAIH EKKVEVPLYE ESSILATGDM AEFFLDLERT HAENFLKQKI AMCQKQGIPP QVALQAYREG WWFLLRAENV NDPNIRAEPI MRESPILSKL DSQDLDKFEA ATPAAQRLIT AWPMIVQNCA QKAGKVRIQF PVPSIPGKYR LVLDIKSQDF LGADQQIVIE KDVVDAQTIQ RTLKPKEEQA IKNEESKGET KKEA
|
| |