Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42888 |
Symbol | |
ID | 7196466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1422491 |
End bp | 1424331 |
Gene Length | 1841 bp |
Protein Length | 564 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176786 |
Protein GI | 219110068 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0590475 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGCTTGTACT CTGTGGGGTG CGGCGCTATA ATGTCCATGG CAGTGACGGC AGAGCCCATG GAAAGTCGGT CCGAGTCTTC GGCGGATCAT GCTGCAGGCG AGGGTTTCTT GGGTGTGTTT GATCGCGAAG AACCAGACCC TGCGGTGGAG ACACTTGCGA ATCCGGATGA CGATACAATT TCGGAAGAAC TCGCAGGCTC CTTGGTTCGC GTAGGGACAA CGGCGTCGGC TCGATTTAAC ATATTATCCA CCATGGTGGG TGGAGGGTCA TTGTCCCTCC CTATGGCGTT TCAAAAGGCT GGGAATGGAC TCCTCGGTCC CGTTATCCTG ATCGTGGTAG CTACCGTGAC AGAGTTCTGT TTCCGCATCT TGGTAGATTC AGCACGACGG CTAAGTCCCG TTTCTGCAAG CTCCGTAACT CCCGGTAAAG ACTCGTTCGA GTTGATTGCC AGTGCGGCGT TTGGTCGGCG GGCATATGTC GGATCCATGA TACTGGTAAC CTTTATGTGT TTTTTTGGTA CCATTGGATA CGCAGTCCTA CTGCGAGACA TGTTAGAACC TGTCACGTTT ATGATCTTTC CATCCCATGC GTCATTTTCT AATACCACCC GGGTATACGA ATCCCAGTGG GAGAGTGTCG GGCGTGGGAG CCAGGTCGGT GGTAGTGACG GGCCGTCTTG GCGGAACAAT GCGACCATGT TGATTGTTGT TTTGCTGGTA ACCCCGCTGT GTACGTTGCG GACACTGACT GCCCTAAAGC GGTTTGGTGC CGCATCCATG GTCAGCGTTT TGATCTTGGG ACTCTGCGTG GTGTATCGAT CGATTGAATG CAATCTCGGA TACGTGGACG GCAACCACGA TTACAAATTT TGGCATTCTT TTCAACTGTG GCCTGATTCC TGGAAAAATG TGTTGGACGC CTTTCCACTC TTTGTCTCGT GTTTTGTATG TCACTACAAT ATTCTTACCG TACACAACGA GTTACGTGTT CCGAGCCACC AACGAGTTTC GTGGTGGTTG CGGTCCACCA CTTGGATGGC CGCAGCGTTT TATCTCCTCA TCGGTCTTGC TGGATCAGCC TACGCACACT GCACAATCGA CGGTAAAATC CACGGGAACG TCCTTTTAGA CTTTCCCAAG GACGATCCAC TCTTGTTGGT GGGACGCATG TGCTTGGCCT TGACAATAAC CTTGGCCTTT CCAATGTTGA CCATTCCAGC CCGGGATATT GTGATTCGAT CATTGCCTTC GCTGCTCAAA CATGATCAAC AATCGAATGG TGCAGACAAT GGTGAATCAA ACTTAGTGGA ACAGTCGTTA CGACAGTCAC TCCTCGAAAA CGTTCATTCG GACGACGAAG CGGTTGGCTT AGTACCGCAT TCGTCGCTGT CCTCGGAACA ACCGTCCGGC AAAGGAGCAT CTTTCTGGCT ACGGCTAGTC GTTGCTATGG CTTTGTTCTG GACCGCGGCC GGAGTCGCAA GTTGTGTCAG TAGCATCGAT ATTGTGTGGA ATTTACTGGG CAGTAGTCTT TCCATGCTTT TGTCTTATAT TATCCCCTGT TCATCCTACC TCACGATTAT TCACACCGAG GAGAATGGAG GGACCAGTGA GCGTCCCAGT CGGTTTGTCC TGGCGACAGC ATGGGTGCTG TTGTTGGTGG CCTCCCCGCT AATGATCTTG TCGACCGCCA ACGCGGTCTA CAGTACTTTC TTTTCGAACG TATAGGGGCT GCAACTGCGG CATAGACAGC GATGATGATG ATATTCAAAC TTCGCGTTGA GGAACGTAGA GACGAAAAAA ATGTGACATA AGGGCATGCG TATAGTTTCA CAACAAATTA G
|
Protein sequence | MSMAVTAEPM ESRSESSADH AAGEGFLGVF DREEPDPAVE TLANPDDDTI SEELAGSLVR VGTTASARFN ILSTMVGGGS LSLPMAFQKA GNGLLGPVIL IVVATVTEFC FRILVDSARR LSPVSASSVT PGKDSFELIA SAAFGRRAYV GSMILVTFMC FFGTIGYAVL LRDMLEPVTF MIFPSHASFS NTTRVYESQW ESVGRGSQVG GSDGPSWRNN ATMLIVVLLV TPLCTLRTLT ALKRFGAASM VSVLILGLCV VYRSIECNLG YVDGNHDYKF WHSFQLWPDS WKNVLDAFPL FVSCFVCHYN ILTVHNELRV PSHQRVSWWL RSTTWMAAAF YLLIGLAGSA YAHCTIDGKI HGNVLLDFPK DDPLLLVGRM CLALTITLAF PMLTIPARDI VIRSLPSLLK HDQQSNGADN GESNLVEQSL RQSLLENVHS DDEAVGLVPH SSLSSEQPSG KGASFWLRLV VAMALFWTAA GVASCVSSID IVWNLLGSSL SMLLSYIIPC SSYLTIIHTE ENGGTSERPS RFVLATAWVL LLVASPLMIL STANAVYSTF FSNV
|
| |