Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38085 |
Symbol | |
ID | 7203036 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 28878 |
End bp | 31246 |
Gene Length | 2369 bp |
Protein Length | 619 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182146 |
Protein GI | 219123676 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGGGT GCCATTCCTC CATGGTTAAC GCCTTTCGTA GCGTTTCTCG CTCACTGTCA ATTCCAAGCA GACTCGCCGT CTTGAAGGAT CCCCAAGATC TCGTCACCAA AGAAGCCAAC GTGAAGCCCG CCGGGACTCG CACCAAACCA ACCGTTGATC CCTTCAATCC CAATTTCGAA TCGATCGCTT CGGTTCCCTA CAACACAGCG TTTCCGTCGT CCACGAAGGA GTATAAGACC GTTGTCCACG AAGCAACCGG CCACCGGCTT CACGTGCCCT TCCGTCGCGT CCATTTGGAA GATCCCGACC AACTCTACCT GGATTTGTAC GATACCTCGG GGCCGCAGGG CGTTGATCCC AAGAAGGGGT TGGCCAAGTT GCGCCAGGAA TGGACGGACG AGCGTGAGGG TAAGTACGAA CGTTACACTC AGATGCATTT CGCCAAACAG GGAATCGTTA CCGAAGAAAT GCTCTATTGC GCTACGCGAG AAAACATGGA ACCTGAGTTT GTCCGTTCCG AGGTGGCCCG AGGACGCGCC ATTATTTGCT CCAACAAAAA GCATCCCGAA CTCGAGCCGC AAATCATTGG ACGCATGTTT AAAGTCAAAA TCAATTCGAA TATTGGAAAC AGCGAACTGG GAAGCAATGT AAGTGTACCT GTTCTGCCAC CAACTATCCG CAACTGTCGT AGAGAGCAGT ATTTAGTGGA ATCCACATAA ATGCTCACCA TTTCTTGTTA CCACTTACGC TTTCAGATTG AAGACGAGGT GGAAAAGTTG CAATGGAGCA TGCTGTGGGG TGCCGACACA CTTATGGATT TAAGCACAGG CAAACATATT CACCAAACCC GCGAATGGAT CATTCGCAAT TCGCCCATCC CTGTTGGTAC CGTGCCCATT TATCAGGCTC TCGAAAAAGT GGACGGGATT GCCGAAGACT TGACGTGGGA ATGTTTCAAG GAGACGCTTC TCGAACAAGC CGAGCAGGGT GTGGACTACT TCACCATTCA CGCGGGCGTC TTGCTTCGAT ACGTGGTACG TCATATCGCG GGTTGGCTTG CACCGCAATG TACTACGCCT TGAAGTAGAG ATTTTTTGTA AACTCACTTT TCAATTTGAT TAGCCAATGA CGGTCAAGCG CATGACAGGT ATTGTCAGTC GCGGCGGATC AATCCATGCC AAATGGAATA TTTTTCATCA CAAGGAAAAT TTTGCCTACG AACATTGGGA CGACATTTTG GAAATTTGCG CCAAGTATGA TATCGCGCTG AGTATTGGTG ATGGACTACG TCCCGGATCC ATCTACGATG CCAACGACGA AGCGCAGTTT GCGGAACTCT TTACTCAGGG AGAACTAACT AAGCGCGCTT GGGAAAAGGA CGTACAAGTT ATGAATGAAG GTCCTGGACA CGTTCCTCTG CACAAGGTAA GCTATGATCA GGCAAGGTAG ATTTCGCCTC GACCGTACGT CTTGCTACTA ATCCTTCGAC TTGGTTGCGT ACCCAGATAC CCGAAAACAT GCGCAAACAG CTAGAGTGGT GCAACGAAGC ACCTTTCTAT ACACTAGGCC CTCTCACTAC AGATATCGCA CCCGCCTACG ATCACATTAC TTCCGCCATT GGTGCCGCAA CGATTGCGTC TCTCGGAACT GCCATGCTCT GTTATGTTAC GCCAAAGGAG CATCTTGGTC TTCCCAACCG CGACGATGTC AAGGCTGGTA TCATTGCCTA CAAGATTGCA GCGTACGTAT CTGGTGGTTT TTGGTTGTAT GTCCGCCAAG TTTACAGTGT TGATTGATCC CCCACGGCAC AGACTACACA GCAAGGTACA ATAGGAGTAT TGCATTGACA GTCAATACAC TCACGTTAAA CACTTTGTCG TTTCTGTTTA CAGTCACGCC GCCGATCTTG CCAAGGGATA TCCTGGGGCC CAGGATCGCG ACAATGCTCT CTCGAAGGCC CGTTTCTCTT TCCGTTGGAA TGATCAGTTC AACATTAGCC TCGATCCTGT CACTGCCAGA GAATTCCACG ACGAAACCCT TGACAGCGAT GCCGCAAAGA GTTCCCATTT TTGCAGGTAA GCGCAAAGGT CACGACTTTT GTGGATGCGA AACGTCCTGG TCCTCTCTCA CTTGTCATTT TTATCAACCT GTAAACAGCA TGTGCGGGCC CAAGTTCTGC TCCATGAAGA TTACGGAAGA TGTCCGTGCG TACGCGGCCG AGAATGGCTA CGGAGTGGAA GAGACGGCGG CCAAGGGAAT GGAGACAATG AGCGAGCTTT ACAAGGAACT GGGCAACAAG CTCTACGTGG AAGATGACGA GAAAACGTAC GAGAACACCT TCAATCCTTT GAAAGATCTA GCGTCCTAG
|
Protein sequence | MAGCHSSMVN AFRSVSRSLS IPSRLAVLKD PQDLVTKEAN VKPAGTRTKP TVDPFNPNFE SIASVPYNTA FPSSTKEYKT VVHEATGHRL HVPFRRVHLE DPDQLYLDLY DTSGPQGVDP KKGLAKLRQE WTDEREGKYE RYTQMHFAKQ GIVTEEMLYC ATRENMEPEF VRSEVARGRA IICSNKKHPE LEPQIIGRMF KVKINSNIGN SELGSNIEDE VEKLQWSMLW GADTLMDLST GKHIHQTREW IIRNSPIPVG TVPIYQALEK VDGIAEDLTW ECFKETLLEQ AEQGVDYFTI HAGVLLRYVP MTVKRMTGIV SRGGSIHAKW NIFHHKENFA YEHWDDILEI CAKYDIALSI GDGLRPGSIY DANDEAQFAE LFTQGELTKR AWEKDVQVMN EGPGHVPLHK IPENMRKQLE WCNEAPFYTL GPLTTDIAPA YDHITSAIGA ATIASLGTAM LCYVTPKEHL GLPNRDDVKA GIIAYKIAAH AADLAKGYPG AQDRDNALSK ARFSFRWNDQ FNISLDPVTA REFHDETLDS DAAKSSHFCS MCGPKFCSMK ITEDVRAYAA ENGYGVEETA AKGMETMSEL YKELGNKLYV EDDEKTYENT FNPLKDLAS
|
| |