Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14607 |
Symbol | |
ID | 7203237 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 689366 |
End bp | 690608 |
Gene Length | 1243 bp |
Protein Length | 337 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182442 |
Protein GI | 219124294 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.501177 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGCCG AGATACGACC GATCGATCCC GAGTCGGTCC AGAGGATTGT TGCGGGACAA GCGGTCTTTG ATCTGTCCAC GGCCCTCAAG GAATTGGTGG ACAACGCTCT GGATGCCAAG AGTCGCACCA TCAACAGTAA GTGTCGGACG GTGGTTCCAC ACACGCGCAC ACGGTCAGGA GAAAGTCGAC GGTACGTTTG TCCGTCGCTC ACCGCCTTTT TGTTCGTCTT TTTTCTGTTT GTAACTTTCC GACACTGCGG CCCCTGCTCC AGTCCGTCTG TTCAATCTGG GCATTGACAT ATTGGAAGTA TCGGACGATG GCATGGGTGT TCCACTCGCC TCCCGTCCAT ACTTGGCCAC GCCGCACGCA ACGAGTAAAA TACGAGCCTT TGACGAAATC TACGCCACGG CATCGACTCT GGGCTTTCGT GGCGAAGCGC TCTTTTGCCT CGCCAACCTC AGCACCAACC TCATCGTGGC CACCCGCACC CGCGACGAAG CCACGGCGCA GAAACTCGAA TTCCGTCGGG ACGGATCGCT ACGGACAGAC ACCGTGACGA GTATACTCAA AAAGGTCGGG ACGACCGTTG CAGTCGTTGG TTTGATGGAC GCCTTGCCGG TTCGTCGACA CGATTTGATT AAAGGCATTC AGACACAACG ACGGACCATG CTACGCATGC TGGAAGGCTA CGCCATCTTC TCTCCCGGAG TCGGCTTTCG TTTGATGGAC ATGATGGATG CGGGACGGGG AGGAGAATCG CTCTTACTAG CGACCCCGCA GAATAGCTCG TCCATCGAAG AAACCGTGTC GGCGACACTG GGACCCAAAG TCTTGCCCTT TCTGTGTCCC ATTCAGGTCG ACTTGTCGAC CGTCCTAGAG GCACCGGTAC CGAACTTGTC CCCCGCAACC ACGTCATCCA ATCCTCCACT CCGTACACCG ACACTCTCTG GCAAAATGGA AGGTCTCATT TCCAAAGCCA AGACACAATC ACCACGGAAC TCTCAATACT TTGCCATCAA CGGACGCCCC GTCGAGTTGA AACAAGTGTC CAGGGTCTTG AACGAGGCTT GGAGGGCGTT GGGGTGCAAA AAACGTCCCT TTTGTGCACT GCAATTCACA CTCCCCAACA ACGAATACGA CATTAACCTG TCGCCCGACA AACGGACCGT CATGCTGACG CACGAACCAC AAATATGCGC TCTGGTTCGA GACGCCGTGG TCGAACTCTG GGCCAGTCAA ACC
|
Protein sequence | MSAEIRPIDP ESVQRIVAGQ AVFDLSTALK ELVDNALDAK SRTINIRLFN LGIDILEVSD DGMGVPLASR PYLATPHATS KIRAFDEIYA TASTLGFRGE ALFCLANLST NLIVATRTRD EATAQKLEFR RDGSLRTDTV TSILKKVGTT VAVVGLMDAL PVRRHDLIKG IQTQRRTMLR MLEGYAIFSP GVGFRLMDMM DAGRGGESLL LATPQNSSSI EETVSATLGP KVLPFLCPIQ VDLSTVLEAP TQSPRNSQYF AINGRPVELK QVSRVLNEAW RALGCKKRPF CALQFTLPNN EYDINLSPDK RTVMLTHEPQ ICALVRDAVV ELWASQT
|
| |