Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31492 |
Symbol | |
ID | 7196677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 206169 |
End bp | 207809 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176545 |
Protein GI | 219109581 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTAG TGGCGCAACA GCTGGAGACG CACAAGCGAC TGCAAGCCGC GCGACAGGAA CTATGGAGAC TATATCTGCC AAACTTTCTG TGTCCCTACC GCGAGTTCGT ATTAGACCCT AGCCTTCAGT TCTGGACTAT CGTTGTTACT CTCGCTTTAC CCAAACGCTG GGATTTTTCT ATCTCTCGAA TTTCACAGAC GTTGTTCATA CGTCCAACTC TTGCTACTTT GCGATTCTTC TCCAAAGACC CTGATGGCGA AGATCTTCTG GTCACCGCTG GCAAAGAGCT GCTGCACCCA GTTGCGCCCC GTCCACTGCA TTTTCAGCCT CCCTCCGACA TACAGTCGGC TCTGTCGGGT GTTGCCGCGT TTGTCCTGAG TGTCGTCGCG GACGTGATTG GTGCATTTAT GGATCCCGTT AAAATGAAAA AATGGTTTTC CGCAATGTCG GCCTTCCGGG CGTACTTACA AGCATCCGGT GTAGGTGCAG AATTGGAGGA ATCCCTCATC AAACCTCTTT GGAGAGGGCG GCTTCTAGAC AACCTAAAGA TTCTCAATGA CTGCCAAGAA ATCTTAGACG AGGACCGTAC GAAACTTGCG AATGACCTTG ATTCAGAGGT AAGCTCGGAA GACCTCGTCG TCGGTTGCAG CATGATGCGT TTCGCCACCG CCGCGTATGG TGTTGAAATG GTTCGCTCGG CAATTGATCG CGAAGCGAAT TACGAACATG TCAACAGTGA GCGAAAGGCC ATTGCATTTC ACTGCAATAT CCCAACCGAG GACGTCAAGT ATATTTATAT CCAGCCAGGA GACGAAATGC ACACGATGCG TCATTTCATT GCGGTCGATG AAAAGACCAA ATCCGTCGTT CTCGCCATCC GGGGAACGTT ATCAATTTCC GGTGCCTTGG CGGACATGCA AGCTATGGAT TTTGATTTTT GTGGCGGCAA GGCACACATG GGTATAGCGG AACAAGCCAA TTTACTTTGG CAGAAAACAG GACAACGCCT CCGCAGGATC GCTTCCGCAT ACTCGGAAGA ATACCGAATC ATTTTTACGG GACATTCGCT TGGAGGAGGT GCCGCGTGCC TATTGCACGT GAAAGTGCAC ACAGAGAATC TGTTGCCGAC GAGACAGGTC TACTGCTACG GCTTTGCACC CCCACCAACA TATTGCAAGG GTAGCACTCC TTCGCCAGGT CTGGAAATGG CCGTCAAGAA CTGTGTATGC TTTGTGCACG ATAACGACTG TGTTCCACTT CTGAGTGTGG CATCCATCCG TCGTCTGGCT TGCCTTATGG ATGCGGTTGA CAATTGCACG GAAAATCTCT GGTTCACGAC ACGTTTCCGA ATCTTTTGGG AGTTTGTCAA GGTCCCTGGC GATATCGTCA AAACTGTCTG CAGCGTTAAG CATGACTCGA AGGCAGTCGT TGGTGAGTCA GCCATGGTCA TCCCAGCCCG TTGCATTGTT TGGATGAAGA AGACCTTAAG TGGACGTTTT GAGGCCCTTG CGTGTAGTTC AAAAGCCATG GCCTCCATGA ATATCTTTGT CTGCCAAGAT ATGATTGCTG ATCACATGCC AGAGCAATAC GAGGATGCCC TAGACAGTCT TGTAGCTAGA AGGTTTCAAG AGCAACTGTA G
|
Protein sequence | MSLVAQQLET HKRLQAARQE LWRLYLPNFL CPYREFVLDP SLQFWTIVVT LALPKRWDFS ISRISQTLFI RPTLATLRFF SKDPDGEDLL VTAGKELLHP VAPRPLHFQP PSDIQSALSG VAAFVLSVVA DVIGAFMDPV KMKKWFSAMS AFRAYLQASG VGAELEESLI KPLWRGRLLD NLKILNDCQE ILDEDRTKLA NDLDSEVSSE DLVVGCSMMR FATAAYGVEM VRSAIDREAN YEHVNSERKA IAFHCNIPTE DVKYIYIQPG DEMHTMRHFI AVDEKTKSVV LAIRGTLSIS GALADMQAMD FDFCGGKAHM GIAEQANLLW QKTGQRLRRI ASAYSEEYRI IFTGHSLGGG AACLLHVKVH TENLLPTRQV YCYGFAPPPT YCKGSTPSPG LEMAVKNCVC FVHDNDCVPL LSVASIRRLA CLMDAVDNCT ENLWFTTRFR IFWEFVKVPG DIVKTVCSVK HDSKAVVGES AMVIPARCIV WMKKTLSGRF EALACSSKAM ASMNIFVCQD MIADHMPEQY EDALDSLVAR RFQEQL
|
| |