Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_18202 |
Symbol | |
ID | 7197221 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 780896 |
End bp | 783930 |
Gene Length | 3035 bp |
Protein Length | 882 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177685 |
Protein GI | 219111867 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.71122 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCCTGTTG TTATTCATTG GCCACTTGTT ACACTCTTCT TTAAAGTTCC AAGGTCATAG CTTCATCGTA GGCAATCTGG TTCAAAATGA TTCACGTCTT GAAAAGCCGT TCAACGCTTC TCACGGCATC ATCGATTGTG CGAACCTCGG TTGGTTCCTC GTCCCGGTAC TCTCAGACCC GCACTTACTC TCGCACTCAT TCCTGGTCCA AAGACGCTTC CGGAGGCGCT ATATCGGTGC TGCCTACCAA GCTTCCCTTC GGTGAACAGG CGCCTCGATT TCCACACACT CTCGGCCTCC CTCTGGTATC GAGACCTTTG TTTCCAGGAC TCGTCACCTC GGTGACGCTT ACGGACGAAG CCACCATTGA CGCCATGGAA GCCTTGACCA AAAACCAAGA TCAAGCCTAC GTGAGTTGTT TTTTGCGCAA AAAGAACCCC ACAGGTGTAT CGGAAGGTGG CGTCATCTTG GCCACTCCCG AAGTTATTAC CGATCCTTCC GACATATACC ACGTGGGAAC CTTTGCCCAA ATCCAACGAT TGACCAGGGG CGTCGGGTCG CCCAAGCCCT CCCATCCTAC CGATTCTCAC GATCAGTCAC ATGAGGATGA AACCGCTGCT ACTCTCATTC TGCTGGCGCA TCGGCGCCTA GACCTCGAAT ATGTGGACAA AATTGGACCA CCGATTGATG TCACGGTGAA ACATTGGAAT CGATCCGATT ACACGGGTGC CGACGACACG ATCCGTGCAC TATCTAACGA AATTATCAGT ACTATTCGAG AAGTTGCGCA GGTGAATATG TTGTTTCGGG AAAATTTGCA ATACTTTCCT ATGCGCGTGG ACGCCAATGA TCCCTTTCGA TTGGCGGATT TTGCTGCAAG CATCAGTGCG TCGGGGACAC CGGAAGATCT GCAAGCCGTG CTGGAAGAAA AAGATGCCGA AATGCGTCTC CACAAAGCGT TGGTTTTGCT AAATAGAGAG AGGGAAGTTA GCAAGCTTCA ACAAGAAATT TCGCAAAAGG TTGAGGAGAG AATGACTGAA GCACAACGAA AGTACTTTTT GACGGAACAA CTTAAATCGA TCAAGAAGGA GCTTGGTATG GAGCGGGATG ATAAGGACAC ACTGATTGAA AAGTATCGCA AAACCCTTTC GGAATATCCG CACGTCCCTG AAGAGGCTAT GGAGACAATT GACGCTGAAT TGGAAAAGTT TTCGACTTTA GAAAAAAACT CCCCCGAGTA CAATGTAACT AGGAGCTATT TGGATTGGCT CACGAGCGTG CCGTGGGGAG TCGAGACGGA AGAAAACTTC GATATTCAGA AAGCACGAAA GACACTAGAT CGCGACCATT ATGGGTTGGA CGACGTCAAA GACACCATTT TGGAGTTTAT CGCGATTGGT AAGCTACGTG GATCTGTCCA GGGGAAAATA TTGTGTTTGT CTGGACCGCC AGGAACTGGA AAAACTTCCA TTGCCAAATC GGTCGCTGAT GCGCTTGGTC GTCAGTTCTT TCGATTTTCG GTGGGAGGGC TTTCGGATGT TAGTGAAATC AAAGGCCACC GTCGGTAAGT CGATGACTTT GCGGCTGGTT GTTGGTAGTC CAACCCTGAT GCGCGCTTAC TCTTATTCTT TTATCGCAGA ACATACATTG GAGCTATGCC AGGGAAACTG ATTCAATGCC TGAAGGCGAC TGGAACTACA AACCCTGTTG TATTGATAGA TGAAATCGAC AAGCTCGGTA CAGGTTTCCG AGGAGATCCC GCTAGTGCTC TACTCGAAGT CCTCGATCCA GGCCAAAATT CAACGTTTCG TGACTATTTT TTGGATGTTC CAGTGGACAT AAGTAAAGTT CTATTCATTT GCACTGCCAA CGAGCTGGAG CGCATTCCTG GGCCACTACT GGACCGTATG GAAGTCATTC GGCTGTCGGG CTATGATCTC CCAGAGAAGG TCGCCATCGC CGAGCAATAT CTGGTACCGA AATCAATGCG TGACAGTGGG CTATTGGTCG ATAAAGCGGA ACACAAGGGT GATGAAAAGG AAGCCGGAGA AGGCGCGAAA GAGACTCAAC AAGAGGCTGG AGAGACGACT AGAGAGGCGG AGGAGGTCGG AGATACTCCA CTGGCTAACT TCGTTCATGC CAAGGGCGTG CCTGAAACCC TAAAGTTAAC AATCGACGCA GTTCGAAGCT TGGCCCGGTG GTACGCCCGA GAAGCTGGAG TGCGAAACCT TGCAAAGTAT ATCGATCGTA TTACCCGAAA GCTCGCACTG CAAGTCGTGG CGGAAAGTGA GGGTGCCACA TTGACCGATA AGAGTTCACG AAAGTCAAAC ACTTGGGAGA TCACAGAAGA CAATTTACAC GAGTACGTGG GTAAACCTGT CTTTACGAGT GACCGGCTAT ATGAGGACGG GCCTCTTCCC CACGGTATTG TCATGGGACT CGCTTACACT TCCATGGGTG GATCTGCCCT CTATATCGAG ACTCAAAGCA TCAGGCGCGG GTTGGATTCG GAAGGGAAAA CCCGAGGAGG CGGTACTTTG AAGGTCACAG GGCAACTCGG AGATGTCATG AAAGAAAGTA CGCAAATCGC AAGTACAGTC GCGCGTGCCC GCCTTTCTGA TATCAAACCG GAAAGCAACT TTTTCGACAT AAACGACATC CACATGCATG TCCCTGAGGG AGCAACTCCC AAAGACGGGC CGTCGGCGGG TGTCACTATG GTAACTTCTA TGCTTTCCTT GGCTTTGGAT CGACCAATTC GAAACGACCT GGCCATGACA GGTGAAGTGA GCCTCACGGG CAAAGTGCTG GCAGTCGGTG GCATCAAGGA GAAAATCATG GGAGCCCGAA GGGCCGGTAT CAAGTGTGTC ATTCTACCGG CCGCGAACAA ACGCGACTAC GATGAGATTC CTGACTATTT AAAGGAAGAT TTGGAAGTCC ATTACGCTGA CACTTTCGAC AAAGTGTACG AAGTGGCCTT TTCGTCCGTG GATTCAACGT AGAGACTAAC AATATAACGA AGAAGGACAG GATGC
|
Protein sequence | MIHVLKSRST LLTASSIVRT SVGSSSRYSQ TRTYSRTHSW SKDASGGAIS VLPTKLPFGE QAPRFPHTLG LPLVSRPLFP GLVTSVTLTD EATIDAMEAL TKNQDQAYVS CFLRKKNPTG VSEGGVILAT PEVITDPSDI YHVGTFAQIQ RLTRGDETAA TLILLAHRRL DLEYVDKIGP PIDVTVKHWN RSDYTGADDT IRALSNEIIS TIREVAQVNM LFRENLQYFP MRVDANDPFR LADFAASISA SGTPEDLQAV LEEKDAEMRL HKALVLLNRE REVSKLQQEI SQKVEERMTE AQRKYFLTEQ LKSIKKELGM ERDDKDTLIE KYRKTLSEYP HVPEEAMETI DAELEKFSTL EKNSPEYNVT RSYLDWLTSV PWGVETEENF DIQKARKTLD RDHYGLDDVK DTILEFIAIG KLRGSVQGKI LCLSGPPGTG KTSIAKSVAD ALGRQFFRFS VGGLSDVSEI KGHRRTYIGA MPGKLIQCLK ATGTTNPVVL IDEIDKLGTG FRGDPASALL EVLDPGQNST FRDYFLDVPV DISKVLFICT ANELERIPGP LLDRMEVIRL SGYDLPEKVA IAEQYLVPKS MRDSGLLGVP ETLKLTIDAV RSLARWYARE AGVRNLAKYI DRITRKLALQ VVAESEGATL TDKSSRKSNT WEITEDNLHE YVGKPVFTSD RLYEDGPLPH GIVMGLAYTS MGGSALYIET QSIRRGLDSE GKTRGGGTLK VTGQLGDVMK ESTQIASTVA RARLSDIKPE SNFFDINDIH MHVPEGATPK DGPSAGVTMV TSMLSLALDR PIRNDLAMTG EVSLTGKVLA VGGIKEKIMG ARRAGIKCVI LPAANKRDYD EIPDYLKEDL EVHYADTFDK VYEVAFSSVD ST
|
| |