Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48493 |
Symbol | |
ID | 7203716 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 675187 |
End bp | 677184 |
Gene Length | 1998 bp |
Protein Length | 551 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183008 |
Protein GI | 219125478 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000192267 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCCGCTATTC CGCAAACAAC GAACAAAGCG GATGTACATT TTTCGTGAAG CATTGAAAAG CCATGAACCT TATTTGCTGG ATCACCACGG TAAATGCACT GTTTCTGCTA CCGGTGTTGG CCACTTTGGG CCCACACCGC AAGCTCAACC GTGCGCTGGC AGGTCGTTCG ATTCCGGGAC AATACATAAT AGAATTGAAT CCAAGCATCT CGGATTCTAG GGGTTTTGCA ACCCACGTCC TGAAACGCGC GTTCCGAGAC AGTATCATAG AGACCTACTA CTATGCCATG AAAGGATTTG CTGTCAAGGA CCTCCCAAAC ACATTGTTGA ACTTCATGCT CAATTTGGAT GACGTGCTCC TGGTATCAGA GGATGCCGTT GTCGAGGTCG ATGCTGTACA GATTGATCCG ACGTGGGGGC TTGACATAGT TGACGGAACA ATCGACAGAA GATATAATTA CACGTATACG GGACTTGGTG TCGAAGTGTA CATTCTTGAT ACGGGAATTC AAGCAAATCA TTCCGAACTC GAAGGGCGTG TCGAGAGTTG CGTGTCCTTC ACTCCTGAAG GTACGTTTAT GGTTTTTTTG TGTACAACGA GGCCAACATT GTATTATGGA AATTGCTCTA CGGCCTACTT CTAGTTGCAA TCACATTGTG CAAACTCAAA ATCTGGAGGT ATGATATAGT ACTCACTGAC CGGTATTTGT CTCTTCTATA TGCAGAGTGC GGATCCGATC TCAACGGGCA TGGTACCCAT GTGGCCGGAA CTGTCGGTTC GAAAACATAC GGTGTAGCAA AGACGGTGTC CCTCCACGAT GTGAAGGTTC TCAACGCAAA AGGTAGTGGA TCATACAGTG CAGTAATTGC TGGCGTTGAC TATGTAACCC AGATCAAGAT GTCAGATCCC AGCCGAAAAA TTGTGATCAA TATGAGCCTT GGGGGAGGCA TCAGCACTTC GTTCAACAAC GCGATAACTT CCGCAGCAGA TTCAGGCGTG GTAGTTGTGG TTGCTGCCGG AAACAGCGAC GATGATGCCT GCAACTACTC TCCTGCATCT GCGTCGGGAG TATTAGCAGT TGGTTCGATT GATAGCGATA AGCGCCGTTC GAGCTGGTCC AACTGGGGTA GTTGTGTTGA CATCTTTGCA CCAGGATCCG GGATCCTGTC GCTTTCTCAA AGCAACGGCA CAACTACGAA GTCAGGTACA TCTATGGCTG CTCCACACGT TGCAGGTGTT GCCGCGCTCT ACTTACAAGC GGGAAGAAGC ACCGATTCTA TCGCATCGGA TGCGTTGGAG AATGGAATAA GCGATGTAAA GGAATCTTCC AACCGACTTG TGCGCACTTC GGAATTGCCA CCGGTAGCCC CGTCTCTACT GGCGCCCACT CGCTCGCCAA CCCGCAGACC TACTCCTAAT CCAACTGGCG CTCCCGTTAC ACCGCAGACT ACGAAGCAAC CTACACGCAC ACCCTTTGCC ACTCGTGCTC CAACCAAGAA CCCTACCCTT GCTCCGGCCA AGGCGCCCAC TCGTGCTCCC ACCAAGAAAC CCACCCTTGC TCCGACCAAG GAACCTACTC GTTCTCCTAC CAAGGCACCC ACTCGTTCTC CTACCAAGGC ACCCACTCGT TCTCCTACAA AGGCACCCAC TCGTTCTCCC TCCAACGCAC CAACACCTTC TCCGGTATTG CCACAGTGCC AATCTAATGG CCAGGTGTGC ACCGCTTTTA GGCGATGCTG TAGCGGATTC AGATGTCTTC GTTCGTGGTC ACCACAGCGT GGCCGGCACC GAGCGTGTCG TCCTCGCTGG TGATGATGGG CTTCCAAGAC AATCGCTCAC TCCATTGTGT GGCCGGCTTG ATACTCAAAT CGGTCTTGGT CGCCGTGAGT AAACTGTTAA GCGATCAACG TTGCAAGCAG TTGTTTTGAC ATAGACGGTT TACTAGGAGC TTTTTCGGAA TGTGGTTTGC ACATATTGTA GTCAGCGAAA AATTTATC
|
Protein sequence | MNLICWITTV NALFLLPVLA TLGPHRKLNR ALAGRSIPGQ YIIELNPSIS DSRGFATHVL KRAFRDSIIE TYYYAMKGFA VKDLPNTLLN FMLNLDDVLL VSEDAVVEVD AVQIDPTWGL DIVDGTIDRR YNYTYTGLGV EVYILDTGIQ ANHSELEGRV ESCVSFTPEE CGSDLNGHGT HVAGTVGSKT YGVAKTVSLH DVKVLNAKGS GSYSAVIAGV DYVTQIKMSD PSRKIVINMS LGGGISTSFN NAITSAADSG VVVVVAAGNS DDDACNYSPA SASGVLAVGS IDSDKRRSSW SNWGSCVDIF APGSGILSLS QSNGTTTKSG TSMAAPHVAG VAALYLQAGR STDSIASDAL ENGISDVKES SNRLVRTSEL PPVAPSLLAP TRSPTRRPTP NPTGAPVTPQ TTKQPTRTPF ATRAPTKNPT LAPAKAPTRA PTKKPTLAPT KEPTRSPTKA PTRSPTKAPT RSPTKAPTRS PSNAPTPSPV LPQCAPLLGD AVADSDVFVR GHHSVAGTER VVLAGDDGLP RQSLTPLCGR LDTQIGLGRR E
|
| |