Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21316 |
Symbol | |
ID | 7202128 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 271842 |
End bp | 274425 |
Gene Length | 2584 bp |
Protein Length | 727 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181340 |
Protein GI | 219121994 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATCGTTG TCTTCCTTTG TTCTACTTTT TGTGACAATC GCATTGAGGT GGATCGAAAG CACACGTGTC CATTGGTTTG ACCAAGCAGA TCATATCCTG TGCAAAACTG GAGCGGACAC GCGAACGAAA AGGGCCATGA AAGAGATAGA AGCCTGGAAG AGTGCTCTTG GTTCATTGGC TTTCCTGTTA TCTGCGGGCA CGTCAACGAT ATTATCGTGC GAGGCTTTTT CCTTATATCC ACAAGCGCGA TCGCGACCTT TTTCCAATCC GCATTATGCG ACCGTAGAAG CCGATGCCGT CAATGGCGCT TCCTCCGTCG CGGCACAATC CTACGGGGCC GGTCAAATCA CGGTTCTGAA GGGTCTTGAT CCGGTCCGCA AGCGTCCAGG AATGTATATT GGATCCACAG GCCCAGACGG ATTGCACCAC TTGGTCTGGG AAGTCGTGGA TAATTGCGTC GACGAAGCCT TGGCGGGACA CGCCACGTTT GTGACGACGA CGATCCACGC CGACGGGTCC TTGACCGTCA CCGACGATGG ACGTGGCATC CCGACCGATC TGCATTCCGA AACCGGCAAG TCGGCTCTGG AAACGGTGCT GACGGAACTG CATGCCGGCG GAAAATTCGA TAATCAAGGT TCCAACAGTG GATACAAAGT GTCGGGAGGA TTGCACGGCG TCGGTATTTC GGTGGTGAAC GCTTTGAGTG AGTTCGTCCA CGTCAAGGTT GATCGCACAC CAGAGTTGTA CCAGATGCGC TTTGAACGGG GGGTACCAAC GGGGCCACTA CAGGTGAGCA AGGGAACGTC AAACTTTGTG GACAAAGATA TCGATCAAGA ATTGGAGCTC CTCAAGGCGA AATCGGATCA ACAGGATGAT GATAACACTG CCATAACCAT TCACCAGCAG AACCTAGACA AGCTCAAAAC TTTGTTAAGC AAACGTAAAT CTGGAACATC TGTTACTTTC CTTCCCGATC TTAAGGTGTT CAAGGGAGAT AACGGTAAAC CGGATATCAC ATTCGATTCC TCTCGACTCA AAGGACGCAT GGACGAGATT GCCTATTTGA ATGCCGGTCT AGTTTTGACG CTTAAGGATG AACGAAAGTC TCCAGGCCGT GGCCGTCTGC AAGTGTTTTA TCACGCCGGT GGCCTTGCCG AGTATGCAGA ATTATTGTGT CGGACCAAGA CTCCACTCTT TCAGGGAACA ACCTCCAGCA GAAAGAAAAA GCCCGCGAGA AAACAGAAAG ATTCGGCGTC CGACGATGAT GGTATTGCAA TGGATCCTGT GGCGGGTTTG CTTACACCTG ACGGTGCGAC AATACTGTGC ACTGGTACGA GCACGTCGGA CGAAGAAACC CCTCCAGTTT CCGTTTCGGT AGCTTTGCGT TGGTCATCGG ATATGTATGC GGAATCGATT CTCTCGTTTT GCAACAATAT TCGTACTCGG GACGGGGGGT CGCATGTGGA AGGCCTCAAA GCTTGCTTGA CTCGAACAGT CAATCAAGCA GCCAAACGTT CCAACAAAGC CAAAGAAGGG GCTGCCAATC TACCCGGCGA GTTCATTCGT GAGGGATTGA CAGCCATTGT ATCAGTCTCG GTTTCTGAAC CTGAATTTGA GGGTCAGACC AAGGGACGCC TCGGAAATCC GGAAGTACGG CCGGCTGTGG ATTCGTTGCT GAGTGCAGAA CTGACAAAGC TTTTTGACTT CCGACCTGAA ATTCTGGACG CCATTTACAA CAAAGCGAGT TCCGCACAGG CAGCCGCGGC AGCCGCCAAG GCCGCTCGCG ATATGGTCCG CCGTAAGACG CTGCTGACGT CTACGATTCT ACCCGGAAAG TTGGCGGATT GTGCGTCCCG GGATCCCGAA GAATCAGAGA TTTTCATTGT TGAGGGTGAC TCGGCTGCAG GAAGCGCCAA ACAAGGCCGA GATCGACGAA CGCAGGCTAT TTTGCCTTTA CGAGGTAAGA TTCTGAATAT TGAACGAGCA GCCACAGAAC GTATTTACCA AAACACAGAG CTGCAGGGAT TGATTTCGGC CCTCGGATTG GGAGTCAAGG GATCTGAGTT TGATCCTAAG TCTCTCCGAT ATGGTCGTAT TGTTATTATG ACTGATGCCG ACGTGGACGG CGCTCATATT CGTGTCCTGC TATTGACGTT CTTCTACCGC TACCAACGGG AGCTCGTGGA GAACGGCCAT GTTTACATAG CACAGCCTCC TTTGTACAAA GTGAGTGTGG GAAGTGGCAG GTCAAGAAAA GAAGGGTACG CATTCAACGA TACGGAAAGA AACACAGTAA TGATGCAAGT TCTCGGTGTT GATGACCCGA AGCAAGCCGA GGAGGCGCTT GCGGCCGGGA AAGTTTCTTT GCAGCGCTTC AAAGGTCTAG GAGAGATGAT GCCGGAGCAA TTATGGTCCA CGACAATGGA TCCAGAGCGG AGAACGATGC TTCAAGTTAC TGTCAATGAT GCATCGATGG CCGACCAGAC ACTGAGCATT CTTATGGGAG ATACTGTAGC CCCTCGGAAG GAATTTATCA GTACCCAAGC CGAGACGCTT AGAGTAGACG ATCTTGATTT GTAG
|
Protein sequence | MKEIEAWKSA LGSLAFLLSA GTSTILSCEA FSLYPQARSR PFSNPHYATV EADAVNGASS VAAQSYGAGQ ITVLKGLDPV RKRPGMYIGS TGPDGLHHLV WEVVDNCVDE ALAGHATFVT TTIHADGSLT VTDDGRGIPT DLHSETGKSA LETVLTELHA GGKFDNQGSN SGYKVSGGLH GVGISVVNAL SEFVHVKVDR TPELYQMRFE RGVPTGPLQN LDKLKTLLSK RKSGTSVTFL PDLKVFKGDN GKPDITFDSS RLKGRMDEIA YLNAGLVLTL KDERKSPGRG RLQVFYHAGG LAEYAELLCR TKTPLFQGTT SSRKKKPARK QKDSASDDDA LRWSSDMYAE SILSFCNNIR TRDGGSHVEG LKACLTRTVN QAAKRSNKAK EGAANLPGEF IREGLTAIVS VSVSEPEFEG QTKGRLGNPE VRPAVDSLLS AELTKLFDFR PEILDAIYNK ASSAQAAAAA AKAARDMVRR KTLLTSTILP GKLADCASRD PEESEIFIVE GDSAAGSAKQ GRDRRTQAIL PLRGKILNIE RAATERIYQN TELQGLISAL GLGVKGSEFD PKSLRYGRIV IMTDADVDGA HIRVLLLTFF YRYQRELVEN GHVYIAQPPL YKVSVGSGRS RKEGYAFNDT ERNTVMMQAL AAGKVSLQRF KGLGEMMPEQ LWSTTMDPER RTMLQVTVND ASMADQTLSI LMGDTVAPRK EFISTQAETL RVDDLDL
|
| |