Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35977 |
Symbol | |
ID | 7201435 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 249711 |
End bp | 251592 |
Gene Length | 1882 bp |
Protein Length | 551 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180601 |
Protein GI | 219119693 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.586118 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATCCC ACAGACAGGT GCAGAGTATA GGAACTCGCA ACGGAGTGAA ACTCGGGAGA TTCGGCATCG TCTCGGCCTT ATGTCTAATC ATCACTCCCA GCGCTGCCTT TACAACGACG TCGTCTTTTC CGGTAATATG GCGAGTCGAG CCGAATTTAT TGCAAGGGCG GAACCCAATC TCAAGGTTTC ACTTCAGAAC GCCAGATTAT GCTACTCGTC CTCGACGTCG CGGTCCTCTT ACCACGACAT CCGCGGACGA AAATGACAAG AAGTCGGGTC CCCTCGACAA GGCTGTCGCG AAATTCAAAG CTCGCCCTGG CACTTATCTA CTCATCCCAT GCGTTGCCGC CATCGTCGGA TGGTTTACCA ACTGGTTGGC GGTTCAAATG ATCTTCTACC CCGTCCGATT CCGGGGTATC CCAATTTATC GACGACCCGA GATACCTCTA GGCTTCTTGG GATGGCAAGG GATCGTTCCC TGCAAAACCC GTCCCATGAG CGAAGCCATG GTGGAAATGG TCACATCGCA GTTGTTGACT GTAGAAGAGG TATTTGCGCG CCTGGATCCC AAAAAAGTGG CCAGTTTGCT TGCTCCGGAA GTACCGAAAC TAACCAACAG TATCCTCCAG GATCTATTTC CGAAATGGGT AACGGCAATG CCGTCGGCAG TTTTTCAGGG GCTGGATTCG GTATCACAGG GCGTAATGAT GCACTTTAAT CGAAAGTTCT TGGAAGGTCT TACGAAAAAC ATGCAGTCCA ACATTGATTC TATTTTTAGT CTACGGAACT GTGTGGTAGA TCAGATGCTC CGCGATCGGA GTAAACTCGG AGAGCTCTTC AAAATCTGTG GACAAAAGGA ACTCGACTTT TTAACCAACA GTGGCCTGTG GTTTGGTTTC TTGCTTGGGC TGATTCAGAT GGCGGTAGCA TTGTTTTGGG ATAATCCTTG GAGTTTGAGT ATTGGGGGTG GTATCGTTGG TCTCGCAACT AATTGGCTGG CCCTGAAATG GATCTTTGAA CCTGTCGACC CTACCCGCAT TGGACCTTTT GTTTTGCAAG GACAGTTCTT ACGACGCCAA CCCGAAGTGG CCAAAGAATT TTCGGCCTTT TTTGCCAATC AAATTTTGAC GGCCGAACAA CTGTGGTATT CCGTTCTGAA CGATCCTGCC ACAAAACCCG CCTTTGCGAC CATGTTTGCC TCGCATTTTA CCAATTTTGT GCACAAAATT ACCCATGGTT TTCGCGTCAC TCTCGAGCCA GAGACGATGA AGCTCGCTGC CGCCAAGGCT TTGGAGAAAT TGCCTAACCA TGTACCAGTG CTGTTTCCTT ACATGGACAA AGCTTTACAG TTGGAATCAA CATTGCGCGT CAAGATGGAG CAAATGACGT CGAGACAATT CGAACGAGGT ACGCCTGGCG AGTGAAAGAT GTGAGAGGCC CAAGCAGGAA GACCGTCAGC TCACCGACGC ATCTGTTTTT GGATTTTTTG TCCTCGTAGT CTTGCACCCC ATCTTTGAGG TAAATTGAAC CTGCACGAAC TCGTTCCTTC GTTTGATTAC GTTAGCGTAT TGCTGACTCG CTGTCTGTGC TTTGGGGTAA ACAACGTAGG AGGACGAATT GACGTTAATT CTAGCCGGTG CAGTGCTGGG TTTTGCCGCC GGTTTGGTGC AACAAGGACT GGAGACGGGT GCCATTCGCA TGCCCAACGT TTGGCAGGTT GTCCGAGACT TCTCCAAGGC TCCGAAAGCG CAAAGTAAAC TCGTTTTCGA ACGAAGTAGA TCGGCACTTG CTCGTTCGAC ACGTCGGATA CGACAGGCGT TGGTGGGGCC ATTCCGGAAG CGCAGGGACA AATTGCCCGG GGACGAAAGT GAAGCGAGCT AG
|
Protein sequence | MPSHRQVQSI GTRNGVKLGR FGIVSALCLI ITPSAAFTTT SSFPVIWRVE PNLLQGRNPI SRFHFRTPDY ATRPRRRGPL TTTSADENDK KSGPLDKAVA KFKARPGTYL LIPCVAAIVG WFTNWLAVQM IFYPVRFRGI PIYRRPEIPL GFLGWQGIVP CKTRPMSEAM VEMVTSQLLT VEEVFARLDP KKVASLLAPE VPKLTNSILQ DLFPKWVTAM PSAVFQGLDS VSQGVMMHFN RKFLEGLTKN MQSNIDSIFS LRNCVVDQML RDRSKLGELF KICGQKELDF LTNSGLWFGF LLGLIQMAVA LFWDNPWSLS IGGGIVGLAT NWLALKWIFE PVDPTRIGPF VLQGQFLRRQ PEVAKEFSAF FANQILTAEQ LWYSVLNDPA TKPAFATMFA SHFTNFVHKI THGFRVTLEP ETMKLAAAKA LEKLPNHVPV LFPYMDKALQ LESTLRVKME QMTSRQFERA GAVLGFAAGL VQQGLETGAI RMPNVWQVVR DFSKAPKAQS KLVFERSRSA LARSTRRIRQ ALVGPFRKRR DKLPGDESEA S
|
| |