Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49618 |
Symbol | |
ID | 7198260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 216490 |
End bp | 219596 |
Gene Length | 3107 bp |
Protein Length | 958 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184422 |
Protein GI | 219128442 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTTATCGTCC TAGAGTTACT GATTCAATCA CAATCGTACG ATCGTACGCA TACACAAAGC TCACCATGAG AGTGACAGCT GCAGTAGCCG GGTCCTTGTG TTTGCTGGGG ACGTCGGTGG AATCCTTTGT CTTCCCGGCA CGCAAGAGTC CAAGGTTGTC CTTCCCTGCG GCAAATCCAG TCGCCTCCTC CTTTCGCAAT GCGGAAGCTA TAGTCAGTTT GCACGAGAAA AAGAACGACA TAGATATCGA CGATGTTACG AAAGAAGCGG AAGAAGCTTT GGCGGCGGCT GAAGCAGCGT TAGGAGGTGG ACTTGGAAAC AACAAGCAGA CGGTAAATCC ATCCCCGAGC ACACTGTCCG AAAGCACGCC ATTGCCATCT CCAAATAACA AAACGACGCC TTCATCTTCG CCTTCAAAAA CGCCTCCAGC ATCTTTAAAA AAAACTCTGC CGCAGCCTTC TTCCAAAATA GCGCCTCCGC CCTCGCCTTC TCCGCAAACA AAGCCCCCTC CCACTCCCCC TAGTCCTCGA GAGTTGGCTG AACAAGAGGC TCGTCGTAAA AAAGCCCAAT TCTACCGCAA AGACGCTATT GCTGCGGCTG TTGGTGCGGG ATTGCTAGGT GCAGCAGGAG GAGGCCTACT CTGGATGGAG TTTCCAGAAT TCGGTATCGC CTTACGAGAC GCTGTCAGCG CCGACATTCC CATGTACGTG CCTCCTCTTA TCGGCGCTGC CGTTTTGGGT GGAAGTGCAT TTAACAGTGC TTCGCAAGAT AATGCTGTGG GCACCCTTAC CCGTGGAGTA TTTGGCAGTA CCACCAAAGC GATCGGCGGA GGGATTGTGG GAGCAGTTAC TGGTGTCACC GGCTCTGTGA TCTCAGGCAT TGTAGCCATT CCTAAAAGAG TGGTTAACGC GGCCGCTCAG AAAGTTCGGG AGACTACAGA CGGAATCAAG GCCATTCCGT CAAGGGTTCA GGATGCTGCG GCCCGCAAGG TCCAGAGAAC AGCCGAGGAT ATTCGAGCCA TTCCCACCAA AGTTTCTTCG GCGGCAAGCA GAGCCGCAGA AGAAGCCGCT CGGGAGATCA AGGCAGCCCC CGGTCGGGTC GCAAAATCAG CGGAAGAAGC ACTGGAAAGA TCAGTGGAAG AGACCAAGAA GAATATTAAC AGAATTGGAG AAGATATAAA AGCCTCTCCC TCAAAGGCTG TGGATGCCTT CGAAGCAAAG GTGACGGAAG TCTTCGAAGC AAAGCCCAAA GAACCGCAGG CACCTTTGCG TCCTCCTCCA CCTCTAGTCA GGAACGATGC CAAGTCTTCG TTCCCAAACC TCCAGATTGA CCTGCCCAAG CTTGAGGTGC CGAAGATTCG CGTACCCAAT CTCGATGCGC TCAAGATTGA CACGCCGAGT ATTGAGATGC CTAAGACTCC AGTGCCAAAG AAAGAGCGGG ATGATGGCTT TGTTTTTGGT GAGGTTGGAC TCAAGAACTT TATTAACGCA CCGGTATCAA AGGATTCCGC CCCGTCCAAA CCAGCTACGG TTCCCGAGAA GTCGCAGCAA CAGCGGGCAG CTGCACAAAA GCGCGACCGC GCTCAAGCGA AAGTAGCATC CGAAAAGCAA CGTCGAGAAC AAGCTAAAGT GGTGGCGGCA GAAAAGCAGC GCCAGGAGCA AGCCAAAGCG GCATCGGCCG AAAAACAGCG TCAGGAACAA GCCAAAGCGG CATCTGTTGA AAAACAGCGA CAGGAACAAG CCACTCGACG TGCTAAACAA GAGAAGGCTC GTCAAGAGCG GCAAAAGCGT CTTGAAGATA TAGAAGCAAC TAAAAAGGCC AAGTTGGACC AGCGAATGCG CGAAACAGCA AAGCAGACGC AGATTGCCGA ACAGAAGCGT TTAGAGGCGG AACGCCGAAA GAACGCATCA GGTTCGAAGC CACGTCCTAG CTTTCAAATT CCCCGTCCCA GTTTCCGTAT TTCTCCCGAC TCTGACGCTA GCAAACCTCG TCCCTCATTT TCGCTAGGTG GAATGCCCAA GCAGCAGAAG AGTCCCTCAG AAAGCAAACC CCGTCCGTCC TTTCAATTGA ACCTCGGAGG GGGTAGCAGC AAAGCTGGAG AGTCTGGTGT CAAACCTCGC CCTTCCTTCC AATTGAACCT TGGAGGCGGC AGCAGCAAAG TCGAAAATTC CGATAGCAAA CCTCGTCCCT CTTTTCCACT AAATCTCGGA GGTCGCGGAA GTGATGCAAA TGTTGTCAGT AAGCCTCGTC CTAGCTTTTC CCTCGGTGGA GGCGGCGGGA CCCAATCAGG AAAGAAAGCT CCCCGCGGTG TCCCCACTAT TGTGCGCTGG AAACAACGCC GAGATGGCGG TATAACCGGC TTCATCTACG GTTCTCCAAA TTTCGACGAT GGAGATCGGG TGGAGACTAC AGCAATCGCA ACAGGAGATG TTGCTAATGG TGGCGTCGTT AAGACGGGAA GTGGTTCTAG ATACTTCTTA AGTGAGACCC CTCCAATGGG AGGGAAAGCG AAAGGCGGCA ACGATGCCGG CGCTTTGAAA TCCCTGCTCA GTGCTATTCC AGGAGCAACC ATCAATCTCT CCCGAAGCAG GAAGACTCCA GCGGAAGTAA AGGCTGAGGA GACTTTGAAG AAGGCGGAAG CGGCACGACC GAGAACATTT TCACTATTCG GTTTGGGTGG AGACGGGGCA GACTCCAGAC CACCTTCCCA GAAGGAAAAG GGCTCTGGCG GAGGCAAAAA GCCCGCAGTG ACGGCACCCC GCGGAGTTCC TACTTTGAAT AGATGGAAGA AGAATAGAGA TGGATCCGTC ACTGGTTTTA TTACGGGATC ACCCAACTTT TCTGAAAATG AGAAAGTTAC AACATCCCCG ATTACGCAGG GCACCGTGAA GTCCACTGAA ACTGTGAAGA CTGGAAGCGG CTCTCGATAT TTTCTTGCAT GAATGAGTTG AAAATCTCTG AGTTCATCAA AACAACCTGT ACTATGTGTT GGAGTCCTTT GATAGCATTG ACGATAAATT AATAGGATGA AGCAAGGGCA AATCCATTGA ATGCATAGCT TCTACAAACG CATCCAAGCT TCAAAGCATG CATTTTAACT AAGAAAT
|
Protein sequence | MRVTAAVAGS LCLLGTSVES FVFPARKSPR LSFPAANPVA SSFRNAEAIV SLHEKKNDID IDDVTKEAEE ALAAAEAALG GGLGNNKQTV NPSPSTLSES TPLPSPNNKT TPSSSPSKTP PASLKKTLPQ PSSKIAPPPS PSPQTKPPPT PPSPRELAEQ EARRKKAQFY RKDAIAAAVG AGLLGAAGGG LLWMEFPEFG IALRDAVSAD IPMYVPPLIG AAVLGGSAFN SASQDNAVGT LTRGVFGSTT KAIGGGIVGA VTGVTGSVIS GIVAIPKRVV NAAAQKVRET TDGIKAIPSR VQDAAARKVQ RTAEDIRAIP TKVSSAASRA AEEAAREIKA APGRVAKSAE EALERSVEET KKNINRIGED IKASPSKAVD AFEAKVTEVF EAKPKEPQAP LRPPPPLVRN DAKSSFPNLQ IDLPKLEVPK IRVPNLDALK IDTPSIEMPK TPVPKKERDD GFVFGEVGLK NFINAPVSKD SAPSKPATVP EKSQQQRAAA QKRDRAQAKV ASEKQRREQA KVVAAEKQRQ EQAKAASAEK QRQEQAKAAS VEKQRQEQAT RRAKQEKARQ ERQKRLEDIE ATKKAKLDQR MRETAKQTQI AEQKRLEAER RKNASGSKPR PSFQIPRPSF RISPDSDASK PRPSFSLGGM PKQQKSPSES KPRPSFQLNL GGGSSKAGES GVKPRPSFQL NLGGGSSKVE NSDSKPRPSF PLNLGGRGSD ANVVSKPRPS FSLGGGGGTQ SGKKAPRGVP TIVRWKQRRD GGITGFIYGS PNFDDGDRVE TTAIATGDVA NGGVVKTGSG SRYFLSETPP MGGKAKGGND AGALKSLLSA IPGATINLSR SRKTPAEVKA EETLKKAEAA RPRTFSLFGL GGDGADSRPP SQKEKGSGGG KKPAVTAPRG VPTLNRWKKN RDGSVTGFIT GSPNFSENEK VTTSPITQGT VKSTETVKTG SGSRYFLA
|
| |