Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47839 |
Symbol | |
ID | 7202995 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 201612 |
End bp | 205660 |
Gene Length | 4049 bp |
Protein Length | 922 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182176 |
Protein GI | 219123738 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.759183 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCCATCGG GATGGTTCCA AACTGACAGT GATCGAGATT CGCGAAACAG TAAAGACAAA CGGCACACAT TTGCTCTTTC GTCGCGGTTT TGACCGATTC ACGACATGAA GGTCGTGACC GCCTCCTTCC TTCTGTTCAC CTTTGCGGCG TTCTTGGTCG GCACGCACGG AAAATATGAT AACGAGAGTC ACAAACTCAT TTTGATCCGA GGACGCCGCG GCTTTCGTCA CCGAACCGGC TCCTGGCTCG CCGCCAGCCC CGCAAATCAA AGAAGGTGCG GTGCCAAAAA TCTGTACACC CGAAGACCAG TACACCCGAA GACCAGTACA CCGTTCGTGT CCGGTGGAAT GCCCGAACCA TAGTGGCTGT CCCGTTGACT CACCAACGTC GGCCCCAACA CCTCCTCCCA TTGAGACTGC GACAAACTCT CCAGAAGGAT CTCCAACCGC GGCTCTACAA GAAACTGACT ACGGTCCGTA TCCGGCCGGT CCTTACATTT CGGTACAACA AAACTCGGAG CGCATCACTT TGCCTGAACC CGCTACAGTA TTGCTTTCAA TCGCCTCAAA TACAGCTTGC CCTCATACAG GTGCTGACAT TGTCTACTGG GACGATGCCA GTACATGGGG CGCTTCCGGA ATACCCGACA CGGCGAACCA AGACGTTGCA GTACCCAGCG GAAGTCGTGT GGTGATTCGT TCAACAATTC CTGTGGTACT TGGTGTAGTT ACAGTACCTG CTGGCAGCAA TCTCATTATT GGTTCCGATG TGAATGGTAT TGACATACAC GTCGCTGGCA TGGAGGTTGC GGGACGTCTC CTTGTCGGGT CCGAGACCTG CCGCCTTGCC AACCCCGTTA CCATCACTCT TCATGGTAGC CGACCGCGGG ACGCCGTCAC CAACGTACCA TCGGATTCCT ACAAGGGTAT CCACGTTACA GGTGTGCTAA GCCTGCACGG GAAGCGCTAC TTCCATACTT GGTCTCGATT GGCCAAGACT GCGGAAGCAG GATTGTCTGT ATTGATGCTA CAGAATCCCG TCAACTGGGA AGCTGGTCAA GAAGTTGTTA TTGTGACGTC CGCCATCAAG GATTCCATCG AATATCACCA GAATGAAGTC CGCACGGTGC GAGCCGTGCA CACCAGCCCG CCCAGTGGTG TGGGAGCAAT TGTGTATTTG ACCGAGCCCG TGGACTACAG CCACATTGCG AACAGCAACT ACCAGGTGGA AGTCGGTTTG TTGACTCGCA CAGTCAAAGT CCAAGGCTCC GAATCTGATT CCGAGCCGAC GGATCCCAAT CCCCTTTCTT GCACGTAAGT ATGGGTAGTA TGCAAACATG ATCTGGTGGC ACCATAAAAG TATCAGTGCT GCTGTGCATA AGTCTTGATT GGTCAAGTGG GATCTTACTC TCGCTATGGC TGTGCGCTTC CTTCTGTTCT TTTATGTGAC GACAAATTCA GAACCCCACT CGACAATTGG TGGTGGATCC AGTCATTTAC TGGGCAGCCC TGTGAGAACA AGGAGTTGAC GGGCTTTGGC GGCCACGTCA TAGTTCGCGG AGGTAGACGA GGACGTCGAA GGCGTGGAGC TCTATTGCAT GGGACAGACC AACTTACTGG GCCGCTACCC TATCCACTTT CATATGCTAG GAGACTTCCC AGACTGTTAC GTCAAGGACT CGTCAATTCA TCGGTCCTAC TACCGTTGCG TCTCTCTTCA TGGCACGCAC TATACAACCA CAACGGAGAA TGTTGCCTAC GATGTTTGCG GATACTGCTA CTATTTAGAA GACGGCGTTG AGCAGTTCAA CACACTGTCC TACAACCTGG CCGCGCATAT TCATAGCATT GGCCCGGTAC CATGGGGTGG TGGCCAAACG ACCGACATCT TCCAGCAGAG CACCACACTA ACACTTCCGG CGGACGTGAC GGCATCTGGA TTTTATATTA CCAACATTCA CAATCATATC ATTGGTAACG CCGCATCTGG GGGCTGGGCC GGCTTTGCAT TCCCGAATCT CGCCGAGGCT ATTGGCGCTC ACCAAGGAAA CGAAGCCTTT CGGCCTGCTA CGGTGACAGG CTTGACTCTA GACGGCAACA CAGTGCATTC TACTGGCTGG TGGTGGAACC ATGCTGGCGC TTTCTACTTT GGTGGCTCTC TTTATTACAA CGGTGATAAG TTGGAGTACA ATCCTGGTCG AAGTTTCTCT TTTGAGCGTG ACGATTGTCA TACGTGCAAG GTGAACAACT GCGCGCCACC GTATAACGAT TGTGTATACG GATGTCCTCA AGACAAAAAG GACTGGCTTT GAATCACAAA TAGTAAGGCT TTCTTGACCG CCAGAGTCGG ACTGGTATGT AATGATATGC TGTACACAAG TCTCCACAAT ACTTCCAGAC CGGTCTGTCT CACGAAATAT GGTTACTGTG TTGCTCTGTT CTTTGTCCAA CATTACAATC CTAGAACTTG TGGTCCGGAC AAATGGAAGT GATCGGTTTT GAGTCTCATG ATAACAGTCT AGCCATAGAG GCGCTCTCTA GCGGCTTGTG GATCAACCAT TTGCTGGCTG TGTGCCGCAC AGGCAAATCA CTTGGACTAC CGGAAGGTGC CACAAACAAC CGACCCCTCG AAGGGAGCGG CTTCTTTTGG TACGATACGG GCCAGGAGCA CATCATCACA CAGTCTACTT TTCGCAATTG TGGCTTTCGC TCGGACAATT ACAATCAGTA CAACTCTAGC GCCACTCGAG GATGCGACGA CAGCGACATG TCTAAAGCAT GCTACAGCGA GTCGTCGATG TTTGGCTTTC TGACCCATTC GGACGAGTTC ACGCCAAAAA TTATGCAGGG GACTCGCGAT ATTACATTCG ATAACTGTGG CCGTCGCTTC AAGTTTACAA TAAACAAGCT GGAGACCGTA TCTGGACAAG GTCAGAACTG GTTGGATATG GATGGGAGCG TTTCGGGCCT GAATGACCCA ACTATCATTG CATCCGGTCT TGAGCTAGCG AAGGACTGGT GGGGCGTTGA CAATCAAGGT CTGTATTGGA TAATGTATAA TTTTTATGCA CTCCTTTGTT TGCGAATTGC ATTCATTCTC ATGTGCGCAC TGTATCTTGC TTTACCTACA GTTGTATACG AGCCACAGGC ACCTCTCAGG TTCATAAAGA AGAAGAACGG CCCAATATGA TCCATGGGTC ACGTGCAGAT GAGCTGGGAT GAGAGTCTAC ACAACCAAGT CGGCAGCACA TACTGCCGCA ATGATGGATC CGGTCTCAAT TGTAGCCCTG TTGGTTACAT GCGGCATCTC GGTCATAAGT TTAGTCCAAC CCTGAACGCT GCGGTGAACA ATGGTCTCCC GGTTACAGCC AACCCTGATG TGGTTAGTAT GATTGGTGGC TTCGGCTGGC TTCTGACCTT GAATCGTGGC GCGCCTCGAC AGCTTGTCTT GTGGGACTTG GAAGTTGATC CTGGAAGTGT GCTTCTACTG AGTATCCCGT ATCCAGCCGG CACAACATTC AACATTTGAG CTAGTGCGCC AAGTTGGTGC GGAGATTCCG ATGGTTTTGT ATGCAACACT GACTTTGTTG CTGTGAGCTC AGTCCAGGCA GTACGCAACA GTGCTGGTAA CATTTACCAC GTTGGCACAA ACGGTGTGCT GACGCTGCAT ATTGTTCAGT TTTCCGGGCA GTTTACTGGC AACCCAAACT GGATCCTTCC CAATTACAAC ACTGGGATAA AAAGGCAATT TCCAAGTTTG AGCGAGACGT GGTAGTGCTT CCAGTACAGG AGTGGGCAAA TACACTAACA AGCTCGGCCA ATTGTGGTGG GTCGGGTGTA TACTGCAGTG GATCGGTTGC GGCGTATGAC CCTGATGTGT GCAATTCAGG TTTTGTGCAA GTCGCATACG ATACATGTTG CCAGTGCTCA AACCTGAGTT GATGCATGTT TGCCAACGGC AGTCGCAACT TTTAAAAGTT TTTTAGCTGT CTGTTGCACG TGATTCTGAA TCATCCTATG TTAAACAGTT AGGCAGCGAA ACACATTTT
|
Protein sequence | MKVVTASFLL FTFAAFLVGT HGKYDNESHK LILIRGRRGF RHRTGSWLAA SPANQRRCGA KNLYTRRPVH PKTSTPGCPV DSPTSAPTPP PIETATNSPE GSPTAALQET DYGPYPAGPY ISVQQNSERI TLPEPATVLL SIASNTACPH TGADIVYWDD ASTWGASGIP DTANQDVAVP SGSRVVIRST IPVVLGVVTV PAGSNLIIGS DVNGIDIHVA GMEVAGRLLV GSETCRLANP VTITLHGSRP RDAVTNVPSD SYKGIHVTGV LSLHGKRYFH TWSRLAKTAE AGLSVLMLQN PVNWEAGQEV VIVTSAIKDS IEYHQNEVRT VRAVHTSPPS GVGAIVYLTE PVDYSHIANS NYQVEVGLLT RTVKVQGSES DSEPTDPNPL SCTHLLGSPF AEVDEDVEGV ELYCMGQTNL LGRYPIHFHM LGDFPDCYVK DSSIHRSYYR CVSLHGTHYT TTTENVAYDV CGYCYYLEDG VEQFNTLSYN LAAHIHSIGP VPWGGGQTTD IFQQSTTLTL PADVTASGFY ITNIHNHIIG NAASGGWAGF AFPNLAEAIG AHQGNEAFRP ATVTGLTLDG NTVHSTGWWW NHAGAFYFGG SLYYNGDKLE YNPGRSFSFE RDDCHTCKNL WSGQMEVIGF ESHDNSLAIE ALSSGLWINH LLAVCRTGKS LGLPEGATNN RPLEGSGFFW YDTGQEHIIT QSTFRNCGFR SDNYNQYNSS ATRGCDDSDM SKACYSESSM FGFLTHSDEF TPKIMQGTRD ITFDNCGRRF KFTINKLETV SGQGQNWLDM DGSVSGLNDP TIIASGLELA KDWWGVDNQV VYEPQAPLST YCRNDGSGLN CSPVGYMRHL GHKFSPTLNA AVNNGLPVTA NPDVVSMIGG FGWLLTLNRG APRQLVLWDL EVDPGSVLLL SIPYPAGTTF NI
|
| |