Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37164 |
Symbol | |
ID | 7202133 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 303064 |
End bp | 304104 |
Gene Length | 1041 bp |
Protein Length | 305 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181348 |
Protein GI | 219122010 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCCG TCAACGGAGA CTCATCCTTC CCTACGGAAA TGTCGGAAGA CGAACGCTAT CTTTTCGATT TGAATGGATA TGTAATTGTT CGCGGTGTTC TCACGCCTGA ACAGGTGGAA GAAGCCAACG CTATGATTGA CAAGCACGAA AGCGAGATGA TCCAGCGGTC CGATGCTGCC TTGCGCAATG CTGTGAAAGG GACAAAGCTT TACGGGTCCG GTCCAGGCCG TAAGGATTTG GGTCAAGTTT TGGAATGGGG AGCCGACTCC AAGCTCTTCA AATCGATTCT CGCTCATCCT CGGCTCGTGC CACTGTACGT TACAGTAAAA TCGACGTGGG CAGATAGCAA TCTCTGTAAT GTACCACTTT TGCCCGTTTC TCCTATTTCC TCACTCGCAA ACATGTTCTT ATTCCAGATT CCATGGTATT ATCGGGAAAG GGTATCGAAT GGACCATTTG CCTTTCGTAA TTGCTCAAGA CAAAGGCGCA GAGGGCTTCC AGCTTCACGG TGGAACGATC GATTGTACGT CGGGCGAATA CAACCCACAT TTGGCCTATA CGTACCACCA CGGAATGATT CGGAGTTCCC TTCTCGGATG CAACGTCATG CTGACGGATC ACAATCCCGG TGACGGCGGT TTCTGCATTG TTCCCGGTAG TCACAAATCC AATTTTAAAA TGCCCAAGGG AATGGTGGAC GGCGAAAAAT ACGACGAGTT CATCCGACAA CCGGCAACCA AAGCCGGTGA TGTTGTACTC TTTTCCGAAG GTACCATACA CGGAGCTATG GCTTGGACGC CAGAAGAGCG ACAGCGCAGG GTGTGCTTGT ACCGATTTTC GCCCGCTACG AACGTCTACG GTCGCTCCTA CTTTGGGCAC GAAGGGGGCG GTTGGCCCGA TGCGATATAT GATGACTTGA CCGAAGCGCA GCAAGCCGTG CTGGAACCGC CGTATGCTAA CCGTCTGGAC CGTCCTAATA TAAGAGATGA CGGTACCGTC GAAATTACGA CTCGTAGCAC GCGAAAGAAA CAGCACGATA G
|
Protein sequence | MSAVNGDSSF PTEMSEDERY LFDLNGYVIV RGVLTPEQVE EANAMIDKHE SEMIQRSDAA LRNAVKGTKL YGSGPGRKDL GQVLEWGADS KLFKSILAHP RLVPLFHGII GKGYRMDHLP FVIAQDKGAE GFQLHGGTID CTSGEYNPHL AYTYHHGMIR SSLLGCNVML TDHNPGDGGF CIVPGSHKSN FKMPKGMVDG EKYDEFIRQP ATKAGDVVLF SEGTIHGAMA WTPEERQRRV CLYRFSPATN VYGRSYFGHE GGGWPDAIYD DLTEAQQAVL EPPYANRLDR PNIRDDGTHA KETAR
|
| |