Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48077 |
Symbol | |
ID | 7203453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 123092 |
End bp | 124832 |
Gene Length | 1741 bp |
Protein Length | 444 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182620 |
Protein GI | 219124668 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00714324 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACCAGTCCG TGGTGTCGAG CGTCACGGAA ACGACCGATC GGAATGCGCC GTCGGACCGG GAACATTCCG TGCCGGGGCG TCTGCGCCCT CCTCGTCTCG GGCTGGTGGT GGTGGCGGAA CGACAGAGTG TGGCGTTCCG ACACACCGTC CGCTACGGAT ACGCTGATTC CATCCCGTCC CATGGCCTGG CACGGTCCTT CCTCGCGGTC CCGACAACAA AGCCAACGAT CCACGTCCGA CTCGCTCGCG GCCTGTCTTT TGATCAAAGA CGACAACGAC ATTCTCGACG AATGGATTGC CTACCATTAT CACGTTTGGA ATCTCCGTCA TTTGCTCGTC GCCGTCGACC CCAGCAGTCG TACTTCGCCC CGCGCGACTC TCACGCGTTG GACGGAAGTG ACCGATCTGG ATGTGCGAGT CTGGGACGAC GTCGATTATC TGCCGGAATC CTTCCGGACG CTCGGTTACC ACATTCCACC CCGCTACGTC TCGGGCGACG CCCAACGCTC CCAGTGGCAC GTGGGTCACG AATCGGCCGC GCAAGTCGTC GCCGATCGCA CCCGTATCAA CAATCACCGC TACCGTCAAG TCACACTCCT CACACACTGT TACCGCCACT GGCGGGAGCG GAACGCAACC TGGGTACTCC ACGTCGATAC CGATGAATAT TTGACCCTCA ATCCAATCCG GCGTCGGCCA CACACGACGG CAGGGTCGCC CGTTGTACCG GTCAAGCTCG TCCAGCCCGG CTGTTTGTTC CGGTTCTGGA ACGCACTGCT CCGGGATAAC GTCCAAGCCC GCGCCGCCAA CGCTCACCAT TCTTGCGTGT CCATGCCGCG ACTCTTGTTT GGTGCGATGG AAACCAATCG TTCGGTCGTC ACGACGACAA CCACTGCTGG CACAGCCGCC GCGAATCCCC AACGCTTCGA AACCCTCCGC TGGCGCTACC ACGCTGCCTA CAACGATACC GTCTACAACG CCCAACCCAA AACACTAGTG GACGTCTCCC GGGTACCCGC CCCCGACGAA CTCTGGCAAC AGGCATTTTC CATACATCGA CCATCCAAAG TATTGTGTCG ACGCATCGAG CAACTCAATC TGCGCACACC GAACCGTTAC CCCCTCGCCG TCTTTCACTA CCTCGGAACC TGGGAACGCT ACGTTGCGCG TAACGATACG CGCCGCTCCC GACGTGTCTA CGACGCCAAG GCCGCCGCCG GTGCCGGTGG ACGACCCGAA GATTCCATGG AGGACTGGTT CGATGGCTGG GTACACGACG TCGGGATCGA GACGGCGCGC TATTTGTTGC GCGACTATTT CCGCTACCCC GGAAACTCTG TCACAAATGC GTCACGGTAC TACGACACGT TGGCGTAGCG TTGTTCCTAA CCGTGATAGG CACGGAACGC AAGAATTGGT TCCAAGCTAG TCAAGAACAA TTGTGCCTCT CGACAAACGA CGTAAGGAGA TACAGTCAGA CTGCTTGGCG CCTTTACATC CACGCAGCCA ATCCTACTGG ACGACGAGGA AAGTCCAAGA CCTTTCCATG GACCACGGTA ACTATCTCAT TGCTCGACGG GGATATTTTA CAAACACGAT AACATTTTGG ATCAGTTAAG CTTGAGTAAG CTTGCTTCCT CGGAAACACA CACGACAACT ACTACGCCAA TCAGTAGACT AAGTGAGGGG AGGGGATACG CTGGTTTCCT TCGACCTCGA TAGATTTGTG GACTCTCGTC G
|
Protein sequence | MRRRTGNIPC RGVCALLVSG WWWWRNDRVW RSDTPSATDT LIPSRPMAWH GPSSRSRQQS QRSTSDSLAA CLLIKDDNDI LDEWIAYHYH VWNLRHLLVA VDPSSRTSPR ATLTRWTEVT DLDVRVWDDV DYLPESFRTL GYHIPPRYVS GDAQRSQWHV GHESAAQVVA DRTRINNHRY RQVTLLTHCY RHWRERNATW VLHVDTDEYL TLNPIRRRPH TTAGSPVVPV KLVQPGCLFR FWNALLRDNV QARAANAHHS CVSMPRLLFG AMETNRSVVT TTTTAGTAAA NPQRFETLRW RYHAAYNDTV YNAQPKTLVD VSRVPAPDEL WQQAFSIHRP SKVLCRRIEQ LNLRTPNRYP LAVFHYLGTW ERYVARNDTR RSRRVYDAKA AAGAGGRPED SMEDWFDGWV HDVGIETARY LLRDYFRYPG NSVTNASRYY DTLA
|
| |