Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49563 |
Symbol | |
ID | 7198229 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 52044 |
End bp | 55519 |
Gene Length | 3476 bp |
Protein Length | 973 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184293 |
Protein GI | 219128173 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCTG CTCGCTCTCG AAATACCCGG GATGGCACCC GAAAGCAAGG TACGGCACGA AGAAGTGGCG GTACTACTAG TAGCAATGGC AATCAAGGCT TCGACGAGTT TGGATTTGGT CAACCTGCCT TTCCGGATTC TGCGTTTGAC AACCACGGCT TTGAGATGCC GCAAACTCGA ATTCAGCCAA CGAAGATTCG TTCCCGCCGT CGAGCATCTT TAGCTGCGGC GCCGAACATT GACGTTGTGT CGGAAAACCC GTCAATAGGT TTTACCAATC AATTTCAATC TTCACAAGAC GAACAGGTGT CTCGTGGTGG AGCCCGCTTG GCAAAGGCGG GACGCTCATC GCGCTCAATG GACGGCATCG AATTCCCAAC TGCACGCAAG GACGTTTCTA GTCAAAATCG TCCTCGTCGT TCGGGTCGCC GGGCTTCAAT GGCTACTTCT TCCAACCACA GCCTTTCCGC TTCCAATCAC ACCAACCCGG AACTCGGTTA CGGAGACGCC ATTCCGTCTG TTGCTGCTAA CCATAGAAAA GGAGACTCTA ATAGCGGAAT TTTGGACTTC GGCTTTGGTG GTGGTAAGAA TGCCGGTACA GCCAATGCCG ACTACGGCTA CGGTGACACA ATGTCGTCGG GTTTTGGTAA TTTCGAGTCT ATGCCATCCG CGCCTTCCAC CACACCCGAA TCTGAACGTC CGCGTCGCAG CGGACGACGC TCCAGTATCA GTGGAGGTCT TGAAAGTCTA CGGTCTGACT TGCGCGGAGG CGACCTGAGT GGTGCTCCGT CTAGTCGGGT GTTGGGTGGA AATTCTCGCG CCCAGAACAT TGTGTTGCCC ATGGCCGGGC CGGAAAAAGT GGCCGGTGGC AATGTTCGTC GTGGACGTCG CGGATCCTTA CTGGGTAGTG TTGGTAATGC AGTCGGAGCT ACCATGGGGG GATTCACTGG TGGAAATAAG GACAAGGAAA AACTCGACGA CGATACCACC AAAAAGTCTA AGTCTTTTCT AAAGGATCGC AAGGCTGAAG GTCGGCGAGG CACGACACGT CAACCATCGG CCGATGGCAA TATAATCTCT TCCTATACCG GCGATCGCGA CCGACGCCGC AAGCCGGCAG CGTCGTCCAA GACCCTGGGC AAAGAGAGCA ACGTGTCGTA CTCGGATCGT ATTTTAGCAC AGCGGTAAGA GGCAACATAA AAACACAGCA ATTCAATAAT TTGGCGGTGT ACAGACTACC AATCTAAATG TTTAAAGCCT AGCGGTATAG TCCGCTCGGC CAAGATATAG AAGGGCAGGG AATTGCAGCA AAGTAAAGGC ATATTAGATT CAGTGTACGT GATGTACCGG GACCAGTGTG AGAGAATAAA GGTCCGAATG TGACTCGCCC GAGATCGCGG AATCGCAGAA AAACCCGAGA CACTGTCAAT CCGTTTCTTC GCGAAATCCT GGCCGCTTTC GCGCATTTAC ATTACATAGT TCGCACCATG TGGGGGAGAC GAGGACTTTG TCGTATTTCT CTTCTAACGT TGCTATTATT ACTATTCGTT TCTAACAGTG ACTGTAGTTG TGGAGGTGAA GGGAGCAGCA GTAGCGGTAG CGGTAGCAGT GACGGTAGTA GTGCTTGTTG TCGCAGCAAA ATTTGGCCGG CGGCTCGATG TGAAACCTAC CGAACACTCG AAATCGATGC TTCCTCCTCT ATGACATTGC GACGGCACGG CTTGCGAGGA CTGCATATCT CCCCTACTCA AAGCGTAGGG GACGAAAGTA GCTTCGATGT GGACTGCCAT GGATATTGTC AAGACGTTCA ATCGATACTG GACGCCGCGT ACGTTCGCTT TCTCAAGGCG CTCCGGCGAA GCGTGTCTTC CACGCCTTTG GCGCATCACG ACCGTCGCGA AAATGAAAAG GTCGCTCAAC ATGACAACGT ACAGGCTCTG TTGGGCATTC ACATTTCCAT TACTACGAAT GAGTCTGCAC TCGTACACGA CGCGGACGAA CGATACCAAC TGGACGTCCC AGGGCCTACC GTCACTGAAA ACGACGACGA CGACGATGGC AGCTACATTC ATCTCACTGC ACCCACCGTC TACGGCATTC TGCACGCCTA CCAAAGCTTA CTGCAGCTGG TGACGTTTGT TGGTAGGGAC TCTCAAACAG GCGCTTTCGT ATTCGCCATG CCGGACACAA CCCTCATTCG AATCCGTGAT GGACCCGTGT ATCCCTACCG GGGACTCATG ATCGACACGG CCCGACATTT TTTGCCACTA CCGCTTATCT TGCAAAACTT GGACGCCATG GAGGCCAGTA AACTGAACGT CTTGCACTGG CACGTGACTG ATTCGCAGTC GTGGCCCTAC GTCAGTACTG CTTTTCCGGA GCTTAGTGCT CGGGGAGCCT TTGGTCCTGA AGAAACCTAC ACGGCTACAG ATATTGCCCT CGTCGTGCGG GAAGCCGCCG CACGGGGTAT TCGGGTGATT CCTGAATTCG ATTTGCCTGG ACACTCGTAA GCGATTGGAC GCTCACATCC GGAATGGTTA ACACCCTGTG GGTCCAAGCC ACGGCCGCAA GAACCTTTGG ATGCGACCAA TCCGGCCGTC TACGAATTCG TACACCGCCT CTACGACGAA TTGGCAATAC TCTTTGCGCA CGAATCCTTT TTACACGTCG GAGGAGACGA AGTCAATTTA GATTGTTACC ACAATAGCAC GACGGTCCAA AGATGGATGC GAAAACACAA TATGACACAG GAACTTGAGG TTCTGAGCTA TTTTGAGCGT GATTTGCTTT CGTACGTCAC CGCTGTATTA AATCGTCGTC CCATTGTGTG GCAGGAACTC TTCGATTCGG GATTGGGTCT TCCCAATCAG ACAATTGTCG ATGTCTGGAA ATCGTGGGAA CCTTCGTCGC GATACAACGC CACTTTGCGG GGCCACGAAG TTATTTTGTC CTCGTGCTGG TATCTCGATC ATTTGAACGA AGATTGGCAA AGCTTCTACG CCTGTGATCC ACGGGAGTTC AACGGTACGA AAGAACAGAA GAACTTGATT CTGGGCGGTC ACGCTTCCAT GTGGGGGGAA CGGGTGGATG CGACCAACTT TCTATCTCGT GTTTGGCCCC GTGCCAGTGC TACGGCCGAA AAGCTGTGGA CAGGCAACTT AACAGCTGCG GCGGATTCGG CGGCTTCTCG ATTGGCCGCC TTTCGCTGTC ATTTGGTCCG CAGAGGAATT CCGGCCAGTC CGGTCGGTCC GGGAGCAAGT TGCGGCAGAC AACCAAATGG TTTTCCGGCT GTGATCGATA GCTTTCATGA CGAGGAGTTG CAGGAAGGAA AGGTTACTTG AGCAGAGCTT TCTCTGGTTT TACTGGCCAC CTAGCAATGC GCAGACACTA GATATCGTGC CGATTGAATT TCACGTCGCC GAGTCGCTTC TTCCCTGGCG TGCTACTGCA ACCAACCATA AATGAGTTCA CTTCAT
|
Protein sequence | MESARSRNTR DGTRKQGTAR RSGGTTSSNG NQGFDEFGFG QPAFPDSAFD NHGFEMPQTR IQPTKIRSRR RASLAAAPNI DVVSENPSIG FTNQFQSSQD EQVSRGGARL AKAGRSSRSM DGIEFPTARK DVSSQNRPRR SGRRASMATS SNHSLSASNH TNPELGYGDA IPSVAANHRK GDSNSGILDF GFGGGKNAGT ANADYGYGDT MSSGFGNFES MPSAPSTTPE SERPRRSGRR SSISGGLESL RSDLRGGDLS GAPSSRVLGG NSRAQNIVLP MAGPEKVAGG NVRRGRRGSL LGSVGNAVGA TMGGFTGGNK DKEKLDDDTT KKSKSFLKDR KAEGRRGTTR QPSADGNIIS SYTGDRDRRR KPAASSKTLG KESNVSYSDR ILAQRDCSCG GEGSSSSGSG SSDGSSACCR SKIWPAARCE TYRTLEIDAS SSMTLRRHGL RGLHISPTQS VGDESSFDVD CHGYCQDVQS ILDAAYVRFL KALRRSVSST PLAHHDRREN EKVAQHDNVQ ALLGIHISIT TNESALVHDA DERYQLDVPG PTVTENDDDD DGSYIHLTAP TVYGILHAYQ SLLQLVTFVG RDSQTGAFVF AMPDTTLIRI RDGPVYPYRG LMIDTARHFL PLPLILQNLD AMEASKLNVL HWHVTDSQSW PYVSTAFPEL SARGAFGPEE TYTATDIALV VREAAARAIG RSHPEWLTPC GSKPRPQEPL DATNPAVYEF VHRLYDELAI LFAHESFLHV GGDEVNLDCY HNSTTVQRWM RKHNMTQELE VLSYFERDLL SYVTAVLNRR PIVWQELFDS GLGLPNQTIV DVWKSWEPSS RYNATLRGHE VILSSCWYLD HLNEDWQSFY ACDPREFNGT KEQKNLILGG HASMWGERVD ATNFLSRVWP RASATAEKLW TGNLTAAADS AASRLAAFRC HLVRRGIPAS PVGPGASCGR QPNGFPAVID SFHDEELQEG KVT
|
| |