Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44341 |
Symbol | |
ID | 7198031 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 273020 |
End bp | 274260 |
Gene Length | 1241 bp |
Protein Length | 368 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178466 |
Protein GI | 219115341 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.718391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TAAATTGCCA CCATGTCGGA AAGCCAATTT GGTTCTACCA GCGTGGACGT GGCCAAAGTT CTACATATTT CGTTTGCGGC TGATCCCAAC GGAAGTCTGG GCTGTACCTT AGTCCATTGC GACAAGGCTG CGGATAACGA GATGTTCGTC CCAGGCTATG CTACCATCGG ACGTCTGCTC GACGGAGATA CGGTGGCTCG AAAGTTTGAT GTGCAGGTCG GTGACTGTAT CGTGGCCGTA AACGGTGAGG GATTTCGTCG CTTTGCTCCG GACTACGACA CGGACAAGGT GGAAGTGCTG AACAGGGAAG GCGAAGAAGT CGAGGTAGAG CTGGACCACA AAGTTATTTC GCCTGGAGAC GCGTACGATT GTCTTCTGCT AAAGATTAAG ACGGTCAAGT CTGCTGCGCC GGACCCACCT TTAATTCTGA CTCTGGAACG GTACAGTTGG GATGCTCGGC CAAACTCGTG GGGACGTTTC TTGGATGCAC GCGACGGCAA CGTCCCGGCC GCGATGCAAC TGATGCAGGA TCATGAGGCT TGGAAGGCAG CCCGATTTCC GATTGATTTG AAAACGAGCG GATTACAGAA AATTCTGCGA GAAAAGGCCG TTTCCGAAAT CGATGTTGAG TTCCTGCACG ACTTTCCGCC AACGGTGTAC GTGGAGTATG GGAAACTCTT GAATATGCAG ACAGCGGGGG AAATTACTGC GGACGACGTG GTCGCCGCCT TTGTCATTTT CACCGAACGC ATGTTGGCAA AGGCCAAGAA TCCACGCCAC CCCCAAACCT GCCAATTCAT AGATTTGTCT GGTATTGGCA TCACTTCTGG TCTTCGAGCC GAAACTCTGA AAAAGGTATA CAAAGTTTTC GAGCCCAATT ATCCCGAGAC ACTGTTCAAG ATGGTCATGT TTCCCGTTTC CACCATGTTT GTAAGTATTG TCAAAGTCGA CGTCTTGTTT GGCACGGGAT GTCTCGCAAA AATATTGATT CCTTACCGAA TTGCTTCGAT TCTTTGTTAT CCTCAAGGCA ACAACGGCAC GCACGCTGCT CAGTTTTGTG AACGAAAAAA CGCAAAAGAA GTTTGTGATT ACGAACAGCC TTGACAAGGT CTGTGCGGAA CTAGGATGGA ATAGACAAGA AGTCGAAGAT TGTGGTGGGG TAACCGAATT CATGCGCAAA CACGAAAAGG TCGGCGATTC GTTGCACTTT GAATAACGCA ATAAAGACAG TACACGAAGA T
|
Protein sequence | MSESQFGSTS VDVAKVLHIS FAADPNGSLG CTLVHCDKAA DNEMFVPGYA TIGRLLDGDT VARKFDVQVG DCIVAVNGEG FRRFAPDYDT DKVEVLNREG EEVEVELDHK VISPGDAYDC LLLKIKTVKS AAPDPPLILT LERYSWDARP NSWGRFLDAR DGNVPAAMQL MQDHEAWKAA RFPIDLKTSG LQKILREKAV SEIDVEFLHD FPPTVYVEYG KLLNMQTAGE ITADDVVAAF VIFTERMLAK AKNPRHPQTC QFIDLSGIGI TSGLRAETLK KVYKVFEPNY PETLFKMVMF PVSTMFATTA RTLLSFVNEK TQKKFVITNS LDKVCAELGW NRQEVEDCGG VTEFMRKHEK VGDSLHFE
|
| |