Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14068 |
Symbol | |
ID | 7202433 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 486300 |
End bp | 487388 |
Gene Length | 1089 bp |
Protein Length | 330 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181568 |
Protein GI | 219122472 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.763403 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGCGG TACCTTGTCG TGGAAAACCT CGTTTCTCTA CTGTGAGTAT CGCGATCCCT GGATCGGTTG TTTCCAACTG TCAAACTCGA GAACTACGTA CACAAATGGT TGGCCAGCTA GCGCGAGCGG CAACTATATA TCACGTGGAT GAAATTATTG TTTTTGACGA CAAGCTTGCT AAAGAAATGA AGCCAGATCG GGGTTACTAC CAACGAAGCA ACCATCATGG TGGGTCAGAC GCTTCGCCTC GGTCTGGACA GAAAGACAAT CAAGTCGACA AGGATGAATC ATCTCAGTCA AACCGTGACG AACCCAGTCG CACGCCATCA AGCCGGTCAG ATCCTCACGA ATTTATGGCA CGCGTTTTTC AGTACTGTGA ATGTCCCCAA TACTTGCGCC GGGATTTCTT TCCCATGCAT GGTGATTTGC AGTATGCTGG ACTCTTGGCA CCAATGGATG CGCCCCATCA TGTGCGAGTA AATGACCGGG CGCGATTTCG GGAAGGCATT GTCTTGGAAA AAACCTCATC TACAAACGGG AACTCTCTGG TGCACTGCGG TATTCGAGGT CGGCCCGTCG AGATTGATGT TAAGCTGACG CCTGGCATTC GGTGTACTGT GCAGCTAGAT CCCAAGGCAT CGTACGAGAC TGGGGGCAAG CCCAACAGTA TCATCCGCGG CGGCAAGGTA GTGTCGCCAT CGGCACCGCG GAAATTTGAT GGCACCTACT GGGGTTACAC AACTCGACTG GCCTCGTCCA TTAAGGCTGT TTTTGACGAA TGTCCTTTTG GAGTATATGA TTTAAAGGTT GGGACAAGTG AACGGGGGAG TACTTCGTTG GACGACGGAA AGTTTCGATT GCCTTCCTAC CAGCACGCAC TAATTGTCTT TGGCGGAGTG GCTGGTATTG AAGAATGCGT CGACGCGGAT GAGAGTCTCT CGCTGCCGGG GTCGCAAAGC CGAAAGCTGT TTGATTTGTG GGTGAATATT TGCCCTTTTC AAGGATCACG AACCATTCGT ACCGAAGAAG CAGTGTTGAT AGCATTGGCC AAACTCAGTC CCTTGCTGAG CACAGCGGTC AGCCCACTC
|
Protein sequence | MDAVPCRGKP RFSTVSIAIP GSVVSNCQTR ELRTQMVGQL ARAATIYHVD EIIVFDDKLA KEMKPDRGYY QRSNHHGGRS DPHEFMARVF QYCECPQYLR RDFFPMHGDL QYAGLLAPMD APHHVRVNDR ARFREGIVLE KTSSTNGNSL VHCGIRGRPV EIDVKLTPGI RCTVQLDPKA SYETGGKPNS IIRGGKVVSP SAPRKFDGTY WGYTTRLASS IKAVFDECPF GVYDLKVGTS ERGSTSLDDG KFRLPSYQHA LIVFGGVAGI EECVDADESL SLPGSQSRKL FDLWVNICPF QGSRTIRTEE AVLIALAKLS PLLSTAVSPL
|
| |