Gene PHATRDRAFT_18202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_18202 
Symbol 
ID7197221 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp780896 
End bp783930 
Gene Length3035 bp 
Protein Length882 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177685 
Protein GI219111867 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.71122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCTGTTG TTATTCATTG GCCACTTGTT ACACTCTTCT TTAAAGTTCC AAGGTCATAG 
CTTCATCGTA GGCAATCTGG TTCAAAATGA TTCACGTCTT GAAAAGCCGT TCAACGCTTC
TCACGGCATC ATCGATTGTG CGAACCTCGG TTGGTTCCTC GTCCCGGTAC TCTCAGACCC
GCACTTACTC TCGCACTCAT TCCTGGTCCA AAGACGCTTC CGGAGGCGCT ATATCGGTGC
TGCCTACCAA GCTTCCCTTC GGTGAACAGG CGCCTCGATT TCCACACACT CTCGGCCTCC
CTCTGGTATC GAGACCTTTG TTTCCAGGAC TCGTCACCTC GGTGACGCTT ACGGACGAAG
CCACCATTGA CGCCATGGAA GCCTTGACCA AAAACCAAGA TCAAGCCTAC GTGAGTTGTT
TTTTGCGCAA AAAGAACCCC ACAGGTGTAT CGGAAGGTGG CGTCATCTTG GCCACTCCCG
AAGTTATTAC CGATCCTTCC GACATATACC ACGTGGGAAC CTTTGCCCAA ATCCAACGAT
TGACCAGGGG CGTCGGGTCG CCCAAGCCCT CCCATCCTAC CGATTCTCAC GATCAGTCAC
ATGAGGATGA AACCGCTGCT ACTCTCATTC TGCTGGCGCA TCGGCGCCTA GACCTCGAAT
ATGTGGACAA AATTGGACCA CCGATTGATG TCACGGTGAA ACATTGGAAT CGATCCGATT
ACACGGGTGC CGACGACACG ATCCGTGCAC TATCTAACGA AATTATCAGT ACTATTCGAG
AAGTTGCGCA GGTGAATATG TTGTTTCGGG AAAATTTGCA ATACTTTCCT ATGCGCGTGG
ACGCCAATGA TCCCTTTCGA TTGGCGGATT TTGCTGCAAG CATCAGTGCG TCGGGGACAC
CGGAAGATCT GCAAGCCGTG CTGGAAGAAA AAGATGCCGA AATGCGTCTC CACAAAGCGT
TGGTTTTGCT AAATAGAGAG AGGGAAGTTA GCAAGCTTCA ACAAGAAATT TCGCAAAAGG
TTGAGGAGAG AATGACTGAA GCACAACGAA AGTACTTTTT GACGGAACAA CTTAAATCGA
TCAAGAAGGA GCTTGGTATG GAGCGGGATG ATAAGGACAC ACTGATTGAA AAGTATCGCA
AAACCCTTTC GGAATATCCG CACGTCCCTG AAGAGGCTAT GGAGACAATT GACGCTGAAT
TGGAAAAGTT TTCGACTTTA GAAAAAAACT CCCCCGAGTA CAATGTAACT AGGAGCTATT
TGGATTGGCT CACGAGCGTG CCGTGGGGAG TCGAGACGGA AGAAAACTTC GATATTCAGA
AAGCACGAAA GACACTAGAT CGCGACCATT ATGGGTTGGA CGACGTCAAA GACACCATTT
TGGAGTTTAT CGCGATTGGT AAGCTACGTG GATCTGTCCA GGGGAAAATA TTGTGTTTGT
CTGGACCGCC AGGAACTGGA AAAACTTCCA TTGCCAAATC GGTCGCTGAT GCGCTTGGTC
GTCAGTTCTT TCGATTTTCG GTGGGAGGGC TTTCGGATGT TAGTGAAATC AAAGGCCACC
GTCGGTAAGT CGATGACTTT GCGGCTGGTT GTTGGTAGTC CAACCCTGAT GCGCGCTTAC
TCTTATTCTT TTATCGCAGA ACATACATTG GAGCTATGCC AGGGAAACTG ATTCAATGCC
TGAAGGCGAC TGGAACTACA AACCCTGTTG TATTGATAGA TGAAATCGAC AAGCTCGGTA
CAGGTTTCCG AGGAGATCCC GCTAGTGCTC TACTCGAAGT CCTCGATCCA GGCCAAAATT
CAACGTTTCG TGACTATTTT TTGGATGTTC CAGTGGACAT AAGTAAAGTT CTATTCATTT
GCACTGCCAA CGAGCTGGAG CGCATTCCTG GGCCACTACT GGACCGTATG GAAGTCATTC
GGCTGTCGGG CTATGATCTC CCAGAGAAGG TCGCCATCGC CGAGCAATAT CTGGTACCGA
AATCAATGCG TGACAGTGGG CTATTGGTCG ATAAAGCGGA ACACAAGGGT GATGAAAAGG
AAGCCGGAGA AGGCGCGAAA GAGACTCAAC AAGAGGCTGG AGAGACGACT AGAGAGGCGG
AGGAGGTCGG AGATACTCCA CTGGCTAACT TCGTTCATGC CAAGGGCGTG CCTGAAACCC
TAAAGTTAAC AATCGACGCA GTTCGAAGCT TGGCCCGGTG GTACGCCCGA GAAGCTGGAG
TGCGAAACCT TGCAAAGTAT ATCGATCGTA TTACCCGAAA GCTCGCACTG CAAGTCGTGG
CGGAAAGTGA GGGTGCCACA TTGACCGATA AGAGTTCACG AAAGTCAAAC ACTTGGGAGA
TCACAGAAGA CAATTTACAC GAGTACGTGG GTAAACCTGT CTTTACGAGT GACCGGCTAT
ATGAGGACGG GCCTCTTCCC CACGGTATTG TCATGGGACT CGCTTACACT TCCATGGGTG
GATCTGCCCT CTATATCGAG ACTCAAAGCA TCAGGCGCGG GTTGGATTCG GAAGGGAAAA
CCCGAGGAGG CGGTACTTTG AAGGTCACAG GGCAACTCGG AGATGTCATG AAAGAAAGTA
CGCAAATCGC AAGTACAGTC GCGCGTGCCC GCCTTTCTGA TATCAAACCG GAAAGCAACT
TTTTCGACAT AAACGACATC CACATGCATG TCCCTGAGGG AGCAACTCCC AAAGACGGGC
CGTCGGCGGG TGTCACTATG GTAACTTCTA TGCTTTCCTT GGCTTTGGAT CGACCAATTC
GAAACGACCT GGCCATGACA GGTGAAGTGA GCCTCACGGG CAAAGTGCTG GCAGTCGGTG
GCATCAAGGA GAAAATCATG GGAGCCCGAA GGGCCGGTAT CAAGTGTGTC ATTCTACCGG
CCGCGAACAA ACGCGACTAC GATGAGATTC CTGACTATTT AAAGGAAGAT TTGGAAGTCC
ATTACGCTGA CACTTTCGAC AAAGTGTACG AAGTGGCCTT TTCGTCCGTG GATTCAACGT
AGAGACTAAC AATATAACGA AGAAGGACAG GATGC
 
Protein sequence
MIHVLKSRST LLTASSIVRT SVGSSSRYSQ TRTYSRTHSW SKDASGGAIS VLPTKLPFGE 
QAPRFPHTLG LPLVSRPLFP GLVTSVTLTD EATIDAMEAL TKNQDQAYVS CFLRKKNPTG
VSEGGVILAT PEVITDPSDI YHVGTFAQIQ RLTRGDETAA TLILLAHRRL DLEYVDKIGP
PIDVTVKHWN RSDYTGADDT IRALSNEIIS TIREVAQVNM LFRENLQYFP MRVDANDPFR
LADFAASISA SGTPEDLQAV LEEKDAEMRL HKALVLLNRE REVSKLQQEI SQKVEERMTE
AQRKYFLTEQ LKSIKKELGM ERDDKDTLIE KYRKTLSEYP HVPEEAMETI DAELEKFSTL
EKNSPEYNVT RSYLDWLTSV PWGVETEENF DIQKARKTLD RDHYGLDDVK DTILEFIAIG
KLRGSVQGKI LCLSGPPGTG KTSIAKSVAD ALGRQFFRFS VGGLSDVSEI KGHRRTYIGA
MPGKLIQCLK ATGTTNPVVL IDEIDKLGTG FRGDPASALL EVLDPGQNST FRDYFLDVPV
DISKVLFICT ANELERIPGP LLDRMEVIRL SGYDLPEKVA IAEQYLVPKS MRDSGLLGVP
ETLKLTIDAV RSLARWYARE AGVRNLAKYI DRITRKLALQ VVAESEGATL TDKSSRKSNT
WEITEDNLHE YVGKPVFTSD RLYEDGPLPH GIVMGLAYTS MGGSALYIET QSIRRGLDSE
GKTRGGGTLK VTGQLGDVMK ESTQIASTVA RARLSDIKPE SNFFDINDIH MHVPEGATPK
DGPSAGVTMV TSMLSLALDR PIRNDLAMTG EVSLTGKVLA VGGIKEKIMG ARRAGIKCVI
LPAANKRDYD EIPDYLKEDL EVHYADTFDK VYEVAFSSVD ST