Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48625 |
Symbol | |
ID | 7194885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 421182 |
End bp | 423293 |
Gene Length | 2112 bp |
Protein Length | 666 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183092 |
Protein GI | 219125658 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTACCA TTGCGCCCTT GCGGTCTCCC TTGATTCGCG CCGGTTCCTC CACAACGTCC GCCTTTCAGC GGGTACAGAA CGGCATCGGC CATTTAGTCG CCACTGCGGC ACCCGACACC AGCAAGAGCA GTACTGGCGG TACGGAATAC AGCACCTACC TGACTCGCGT CAGTCACCAA GCACCCGCAC GTTTGATTCC ACTGCAATCC AGTGCCGTCC AAGCGGCCGG CGCCACGATT GCCTACCTCG GTAACTACGG TGGCGGCATC CTCCCCGGCG ATACGCTCCA CTACCGAGCG GATGTTCTGT CGCACGCTAC TCTCGCCCTG CTGACGCAAG GATCCAATCG TGTCTACCCG CAATCCAGTC TCCCACAAAC CGTTGATGGT ATTGCAACAT CACCAGGACC GCAAACGACC AAACCCAGTC GCAACGTCCT GGATCTGCGT GTCGACGAAC ACGCGTTGTG CGTCGTGACC CCGGATCCAG TGACACCCTT TCGGGGCAGT GTTTTGGAAC AGATACAGAC AGCCGTCATC GATCCTAATG CGTCCTTGGC CTTGGTGGAT TGGATCTCCT CCGGTCGTTA CGTCCGCGGA GAACGATGGG AACAAACCTC GCTGCTAAAT CAGACGGAAA TTTACTTCAA CGACCACGAC TCGTCTCATC CCGTCCTCGT TGATCGTGTC ATCCTGGACG GCAACACCGG TATGGACTTT CGGAACGCGT CTTTGCAATT CAACGCCTAC GCCTCCCTGG TCCTGTACGG ATCCCGGGTC GAAGCCGTCG TCGAGCGCTG CCACGACATG GCTGCCAGCT TGGCGCGTCC CTACACTCGG ATTCGGGAGC GTACCCCCAA CACACGGACA CAAACTCGCA ATAGCGATCA TGACAATCTT CCGTTTGATG GGAGCTGTTT GGCGGGACGC GTCCTCTGCA GCGTCACCAC GGTACCTACA CGCCACGCTG ATGTGTACGT CGCGCGACTG GCAGCCTGCT CCAACGAAGA CTTGTACCGC GTTTTCCACC ATTTATTGCA ACCCACCGCG GAGGACTTGG GCTACCCCAT TTACCGTGAT CGCATTCGGG CCGTCACGTC ATCGCCCGTG CGGAACATTT CCGCAGTTCC GGAATCGACG TCACGATTGG AGGAGAATGC CGCGTCTGTG GCGATCGTTC CCACCGCTGA CGATCACCTT GTACCGCCTC CATCGAGTGA CGCCACTTAT TGGAATGCCT TTGTATTGGC TGACTCGGCC TTGCCGACGG GAAGCTTTGC CCACTCGGCC GGCTTGGAAG CCGCTGCGCA ACTGGGTCTC GTAGCGGCGG GCAACAACGA CAATTCAATG GTTTCCGATC TCGTCCGCGC CACGGCCACA TCCACTATGC AAACCGTCAC GCCCGCACTT CTCGCTAGCC ACAAGGAAGC TTCTTCTAGC GCGTTCGGTG CCGACGGAGA GTTGGATGCA GAATTTATCA CACGTTGGAA CCACTTAGAT CGAAATCTCC ATTCCTTGCT GGTCGGCAAC GGACCGGCTT GTCGAGCCTC CCTAGACCAA GGTCGCAACT TGTTGCGCGT GGTGCAGGTT TGGTTCCAGC AAGGTACCAA AACACACACA AAAGTGACTA CCAATCAAGT ACTGTCGATC TTGCAAGCTC GCGTTCGAGA CGACCCCAGC GCCGGTCATT TACCCACTAT TTTTGGTGCC GTTGCGGCCT TGTTGGACTT GACCGCCCCG CAGGCGTGTC AGCTTCTAGC GTACTGTGTC GCTCGGGATG TGGTGTCGGC CGCCGTGCGC CTCAATCTGG TGGGTCCCAT GGCTAGTGTT GGGATTTTGT CCGATGCGCA GCAGGCCGGT CAGCGAGGAA TTGCACTGGG ACGCACTTCA TACGAAAACG GACTCTCTGG AGGAAAAGGC TCCGGCGCCT CCTGTGCCCC CGTATTGGAC GTCCTGCAAC CGTGCCACGA TTTGCTGTCA ACTCGCCTGT TTCGGACGTG ACGATTGAGA AGAGAAAACA TGCCGAATGT GAATTCTAGA AATTATGCAA TGTCTGATAA GAGGCCGTAC TTTAAAACAA ACTTCCTTAT GACTAAAAAA ACTCGACGAA CG
|
Protein sequence | MGTIAPLRSP LIRAGSSTTS AFQRVQNGIG HLVATAAPDT SKSSTGGTEY STYLTRVSHQ APARLIPLQS SAVQAAGATI AYLGNYGGGI LPGDTLHYRA DVLSHATLAL LTQGSNRVYP QSSLPQTVDG IATSPGPQTT KPSRNVLDLR VDEHALCVVT PDPVTPFRGS VLEQIQTAVI DPNASLALVD WISSGRYVRG ERWEQTSLLN QTEIYFNDHD SSHPVLVDRV ILDGNTGMDF RNASLQFNAY ASLVLYGSRV EAVVERCHDM AASLARPYTR IRERTPNTRT QTRNSDHDNL PFDGSCLAGR VLCSVTTVPT RHADVYVARL AACSNEDLYR VFHHLLQPTA EDLGYPIYRD RIRAVTSSPV RNISAVPEST SRLEENAASV AIVPTADDHL VPPPSSDATY WNAFVLADSA LPTGSFAHSA GLEAAAQLGL VAAGNNDNSM VSDLVRATAT STMQTVTPAL LASHKEASSS AFGADGELDA EFITRWNHLD RNLHSLLVGN GPACRASLDQ GRNLLRVVQV WFQQGTKTHT KVTTNQVLSI LQARVRDDPS AGHLPTIFGA VAALLDLTAP QACQLLAYCV ARDVVSAAVR LNLVGPMASV GILSDAQQAG QRGIALGRTS YENGLSGGKG SGASCAPVLD VLQPCHDLLS TRLFRT
|
| |