Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45050 |
Symbol | |
ID | 7200068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 58114 |
End bp | 59971 |
Gene Length | 1858 bp |
Protein Length | 516 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179130 |
Protein GI | 219116671 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATTGCGTTC TCGCGGGGCC TACACTGATG GATGGTAAAA CCAACAGACA GCCACAAGCT CGTATTGCCG CCCGCATTGG ATAATACGTG CCATGGCCTC TTGTACCGTG GACGAGTGTG CCGGGAATGA TCGAATCCTT GGCGTCGTTC GATCGCGAAC GGACGGATGT CAGAGTCGGC TTTGGGACGA TACCGAGGCG CTTGAAAAGG CTAATGCTGA AAAGATGCAA TTGCAGAATC AGCTGCGTGC GACTGATTGT GGCTCGGATA CGATGGCACT GTCAACGGCA TATCCTGACA GGAACGTACG TACCGGAGGA CGGTGTAAGA GTATTTCCGA GTCGAACCTC AATTCGGAGA AGAATGCTGA GCACTCGTCC ATCAGTGACG GTCTGGATCT CGCGGAGAGT GCCGGGAGTA TAGAGATCAG CTCGGAAGTC AGCGCCTTCG CAGGAAAGGA AAGATCGGGG CCTCCAATCG CACGGGATAT GGATCGTTCG ACAGCACCCT TGGATGTTAT CGATCCTCCA TCAGCCGTCA GAGGACCATC TTATACGGGC AATATGGGAG TTATTGCTGA AGTCGTGCCA CAGCCTCAGT GTGAAGAGCT ACGGACGCCA ATCGGAATTT GTTCCATAGA CGATTGCTTG CTCGATCCGA AACTAAACCA CTGTTTCGAT GGCCCTCCTG TCGCCACACC GTTTCTCAAG GACGTTGAAC GATTGCATCG TCTTGTGGAT CGCCACGATT GGAACAAATT GCATCGTCTC ATCATTCACC AGCCAGATTT AGCTCGTCAA GTGATGTCCA CCACTTGTCA AGGTGAACGC AACCAGTGCA CGCTTCTGCA TGCTGTTTTG TTGCGGTCGT CGCCCAAGGA TCAGGTTGCC ATCTCCATCG ATACTATCGA CGCAGTCCTA ACAGCCCACC CGTCAGCTTT ACTACTACCG GATACACGCG GACGCCTGCC TCTGCACATC GCCTTGCTAC GGAGCGCGTC ACCTTTTGTT TTGCGATACA TTCTGAAAGC TCGGATTCAA GCGATTCGAC AAGCTGACGA GGACGGTAAC CTTCCTCTCC ACTACGCTTG CTCGTACGGC ACACCAACTG TTGCGCTCGA CATTCTTCGG GAGTGGCCCA AAGCTTGTCA AATCTCGAAT CATCGCAACC GCTTACCTCT TCACGGACTC TGTAGTGTCT GGTTCGACAA GGACGAGGTA CACGGATGCA AGGACAATGT CAGTCCTTGC TGGTTGCAAG TGCTGAACAC TATGCTCGAT ATGTATCCCC AAGCAGCATG GGCCAAGGAT CGACAAGGTA GACTTCCCAT CCATGTTTTA TGTGCCACCC ATCCTCACGT TCCGTTCAAC GTGTTGCACG TCTTGATTAC ACTGCATCCC GCGTCTCTGT TGGTCGGAGA CGTAACCGGT CAGGTTCCAG CAGATCTTGT AGTCCGATCT GGTGGGCGCT GGAGTAAGAC TTGCGACAAT GACGTTGTCC TTCAGTACCT AATGGAACGC ACACAGCGGG AACGACGTCC GACTCAGTCA CCCATTACGA GATTACTGGG CATGGGCGCT CACAAAGGGA AGCGTCGCAA GAAACCGCAC GTCCTAATTG ACTTACGTGA TTGCTACGGA TAGAAACATA ACACACCACA GGGCGAAAGG ACAGGACACC GAACGAAAAT AGATACATGA TTTTCGTTTC CAGAACCAAA ATATTGTGCT GAAGCGCTAC GTATACTCAA ACACTCCTCT TGATAGAAAG GTGCACCATA TTCCGACTTT CAACATATAC GCATTGCGCT GCAAAGGAAT AATTAAAATA CGTTTAGAAA GCAAGACAGC CTTTGTCC
|
Protein sequence | MASCTVDECA GNDRILGVVR SRTDGCQSRL WDDTEALEKA NAEKMQLQNQ LRATDCGSDT MALSTAYPDR NVRTGGRCKS ISESNLNSEK NAEHSSISDG LDLAESAGSI EISSEVSAFA GKERSGPPIA RDMDRSTAPL DVIDPPSAVR GPSYTGNMGV IAEVVPQPQC EELRTPIGIC SIDDCLLDPK LNHCFDGPPV ATPFLKDVER LHRLVDRHDW NKLHRLIIHQ PDLARQVMST TCQGERNQCT LLHAVLLRSS PKDQVAISID TIDAVLTAHP SALLLPDTRG RLPLHIALLR SASPFVLRYI LKARIQAIRQ ADEDGNLPLH YACSYGTPTV ALDILREWPK ACQISNHRNR LPLHGLCSVW FDKDEVHGCK DNVSPCWLQV LNTMLDMYPQ AAWAKDRQGR LPIHVLCATH PHVPFNVLHV LITLHPASLL VGDVTGQVPA DLVVRSGGRW SKTCDNDVVL QYLMERTQRE RRPTQSPITR LLGMGAHKGK RRKKPHVLID LRDCYG
|
| |