Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41265 |
Symbol | |
ID | 7198989 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | - |
Start bp | 340299 |
End bp | 341317 |
Gene Length | 1019 bp |
Protein Length | 304 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185260 |
Protein GI | 219130202 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCTCG ATTGTAGTAC GATTCAATTA ATTTTCGTCG ACGTGACCAA TGAATACGAT CTCGATGAGG GTGATTGTCA GACTAAGAAC CTGGACGCAT TATTAAATAG GTTGAAACTC GCGTGCGTCT TTCAAAATGC GTGCGCATGT CTGGACGCAG ATGTGGATGC GACGAACAAG GACGTATACA ACGTTGTTCG AAAGGAAATT CTCCGGTTCG TCAAACCACG AGACCGCTAT CTTGCCCTGG GGTCATTGCT ACTGAAAAGC CAAGTATTTC ATGATACATA TGCACATCTT GATAGATCCT TAATAGTCAA TGTTCCCCGG ACCGAGCACA AAAAGCCTTA CATCCCGGTT GCCACGAGGC TAGGCCATAT TGAGGAATGT GACGTATACC CTTTCAGCAT CTCCCATCAG TTTCCTTTCG TTGCGTCAGC TCGCCTCCTT ACCGAGACCA GTGACTCAAC AAAAAGAGAA CATCCAATCC AGGTGGGTGT AGATATTGTC ATATACGAAA ACCATAATCC ACAGCTGTAT GACTCCGTTT TAGAATTTGT CGATGCTTTT CGATCAAGCT TTTCCGATGC AGAATGGGGC GATTTGCAGT TTTGTCGAGA AAATGAGCAC AACCTATTAA GAGAGCTATA TCTTCGCTGG GCGACAAAAG AAGCATACAC AAAAGCACTT GGTGTTGGAC TGGGGTTTAA TTTTGCAAGT TTCGACATTC GTTTGGGCCC GCTGCCGTAC GGAAGTCTCT GGAATACAAT TGTCGAGGCA CAGAACGAAA CAATTCGATT TGAAGGTTGC GTTTTTACAT TTGAAAAGAT CCGGCCATCG ATGGAGACCT GGCTGTTTTC TTTTCATCCT CTTTCCCAGT CACATGGTAG CTACGAAACG CAAGGCTGCG GATGTGTCGC GGTAGGGCCT CTTCGTAAGT CAGATTCTAT TAACATTGAA AGTGACTGGA CTACGCTGCC TCAACTAATT CAACAGCATA TGCCATGCTC TCTGAATGA
|
Protein sequence | MDLDCSTIQL IFVDVTNEYD LDEGDCQTKN LDALLNRLKL ACVFQNACAC LDADVDATNK DVYNVVRKEI LRFVKPRDRY LALGSLLLKS QVFHDTYAHL DRSLIVNVPR TEHKKPYIPV ATRLGHIEEC DVYPFSISHQ FPFVASARLL TETSDSTKRE HPIQLYDSVL EFVDAFRSSF SDAEWGDLQF CRENEHNLLR ELYLRWATKE AYTKALGVGL GFNFASFDIR LGPLPYGSLW NTIVEAQNET IRFEGCVFTF EKIRPSMETW LFSFHPLSQS HGSYETQGCG CVAVGPLPYA MLSE
|
| |