Gene PHATRDRAFT_21929 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21929 
Symbol 
ID7203051 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp111359 
End bp115412 
Gene Length4054 bp 
Protein Length1270 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182326 
Protein GI219124051 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAACCA AGTTTGAGTC CAAATCGGCG CGGGTGAAAG GTTTAGCCTT TCATCCGGTG 
CGTCCCTGGG TATGTGCATC TTTGCACAAT GGTGTAATTC AACTGTAAGT TGTTTAACTG
TCGTTGTGCC ACGGAGCAGC GGACAAAAAT GACCGTTCGT CTGCGGTGAT TTGTGGGGGC
GTTGAGAAGG ATCGTGCCCG AATGTTTGTC TAACCTTTTC TGACTCGTGT TTGTTTCTGT
ATCTCTTTGT GTAGTTGGGA TTATCGAGTA GGCACCGTCA TTGATCGATT TGAAGAGCAC
GAAGGTCCGG TCCGTGGTGT CGATTTCCAC GTCTCGGAGC CGTTGCTCGT ATCCGGCGGT
GACGACTACA AAATCAAGGT CTGGGATTAC AAGCTCCGTC GTTGCTTGTT TACCCTACTG
GGACATTTAG ACTACATTCG GACGGTTCAA TTTCACAGTA CGTTTCCCTG GATATTGAGC
GCGTCCGACG ATCAAACGCT GCGTTTGTGG GATGTTGATC GTCGCACCTG TTTGAGTGTC
TTGACCGGAC ACAATCACTA CGTTATGTGC GCGAGCTTCC ATCCCACCGA AGATCTCATC
GTGTCAGCGT CGTTGGATCA GACGGTCCGG GTGTGGGATA CCACCGGTCT CCGTAAGAAA
CAAACGGGTG AGGCCAGTGG TGGGGGACAC ATGGATGGAT CCATGCGTCC GCCGTCCACC
GGACTTAACG TGCAAGCCGA GCTGTTCGGA ACCAACGATG TTGTCGTCAA GTATGTACTC
GAAGGTCATG ACCGAGGCGT TAACTGGGCG TCCTTCCACC CTACTTTGCC ACTCCTCGCC
TCGGCAGCGG ATGATCGACA GGTCAAGTTG TGGCGTATGA GCGAAACCAA GGCCTGGGAA
GTCGATACCC TCCGGGGACA CGCCAACAAT GTGTCCTGCT GCTTGTTTCA TCCCAAACAC
GATCTTGTCG TCTCCAATTC GGAAGATCGC TCGATCCGTG TTTGGGATGT CAGCAAACGT
GTCGGCGTCC AAACCTTTCG CCGTGAAGGC GATCGCTTTT GGATTCTCGC CGCGCACCCG
ACGCAGAATT TGCTCGCGGC CGGACACGAT TCCGGTATGA TCGTCTTCAA ATTGGAACGG
GAACGTCCCG CTTCGTGCTA CGGACCGACT TCCCAGCTCT ACTACGTACG CGGTCGGGAA
CTTCTCCTGC ACGACTACGG ACGCGGTAGC ACCGGAGTCG ACGTTCCCAT TACCAGTCTC
CGCCGTATGG GAACGCAGGC TCAAACGGAC GGTATAGGCT CAGCCCCGCG ATACCTTACC
TACAATCATC ACAACCCGTC CGAAGGGAAT ATTTTGGTCA CTTCGGATGT CGACGGGGGT
TCCTACGAAC TCGTTACCTT CAGTTTGAGC AATGCGAGTG GTTCCGTCAC GGACGGAAAA
CGTGGTTCCT GCCTTGGGCC AGGTGTTTTT TTGGGACGCA ATCGCTTCGC CATTCTTGAT
CGCCAGCGAC AAATTGTAAT CAAGAATCTA CAAAATGAGA CGACCAAACG TGTTCAACCA
CCTGTTCCCA ACGTGGACGG ACTGTTGGAC GGTGGCGCTT CGGGTCGCGT ATTGTTGCGC
GCCGAAGACC GTGCCATTCT GTTTGAAGTC CAATCCCGAC GCGTGCTGGG AGAAATCACG
GCTCCCAAGA TCAAATCGGT TGTCTGGAGC CCGGATGGAA GCAAAGTGGC CATTGTCTGC
AAGTACGGTG TCGTTATGGC GGATCGCAGC CTCGAGCAGT TGTGCTCAAT TTCGGACAAC
GTCCGCATCA AGTCGGGTGC GTGGGACGTC AGTCCCACGG GCGGCACGGC CTCGGAACTA
TTTGTCTACA CCACCCTCCA TCACGTCAAG TACTGCTTGC CGTCGGGAGA CACTGGTACG
ATCCGTACGC TAGATCAACC GCTCTACGCG CAGCGTATCG TCAAGGACCA GCTTTTCTGT
CTCGATCGCG AAGCCCGACC CCGCATTCTG AGTCTCGACA CGACCGAAGC CCTCTTCAAA
CTGGCGCTGT CGCAGCAAAA GTACGGCAAA GTTATGCACA TGGTCCGTCA CTCTCGCTTG
TGCGGGCGCG CCATTGTTGC TTATTTGCAA AACAAGGGTT TCCCGGAAGT CGCATTGCAC
TTTGTGCGGG AACCCCGGAC CCGTTTTCGC CTCGCCTTGG CGTGCGGAAA TATTGAAGCC
GCCATGGAAT CAGCCTTTAC ACTGGAGCAA AAAGCTCAAG CGGAAGGCAA GGATACCGGA
CGAGACGTTT GGGGCGAATT AGGCAGTGAA GCGTTGCGTC AAGGCAATCA CCAAGTTGTC
GAAATGAGTT ATCAACGCAC GAAAGACTTT GACCGCTTGT CGTTTTTGTA CTTGATTACC
GGTGATACAG ACAAGTTACG CAAAATGCTC AAAATTTCGA ACATGCGTCA AGACATTATG
GGTCGCTACC ACAATGCCTT GTTGTTGGGC GATGCAGCCG AGCGTGTGCA CGTCTTGGAG
GAGTCGGGAA ATTTGCCGCT CGCCTACATC AGTGCAACCT TGCATGGTTT GATGGAAGAC
GCGGACCGGA TCAAGATTAC TATCGAAACA AATGGTGGCA GTGTGGATGG CCTTATGGAC
AAGGTTTCCG CCGAGGCTGG GGATAGAAAG ACACACTGTT TGCTGCAACC CCCCACTCCT
ATCCTCCGCG CCAACAACTG GCCAACACTC GAGGTACAAA AGACGACTCT TGAAGACCTA
TCGGCAGCCG ACGGTGAAGC TCACGAAGAG GACGGTGGTG AATATCACGA TGCAGCAGCT
GCGGCAGCGA CTGAGTTGGG TACGGAAGAT TGGCAAGACG ACGACGAGGA TATGGGTATG
GGTACCGGCG CCGCGGCAGC GGCTGCTAAT GACTTGGACT TTGGTGCCGA CGACGATCTC
GGCGACTGGG GCGACGATCT GGATGAACTC GGCGACCTGG GTGAACCGTC ACATCGTGAG
GCTGACGAAA TGATAGACGT TTCGGAAGTC GGAGAAGTTG GTGACTTTGT TATGCCTACT
TCTGGACGCC CTCCTGCTGG TTGTTGGGTA GGCAATAGTT CACACGCGGC CGATCATCTG
GCAGCCGGAG CTGCGTCTTC AGCGTTACAA TTATTGAATC GTCAAATTGC GGCGAGCGAA
TTCGCTCTAC TCAAGTCAAA TATGATCGCT TGCTATTTGG GTTCCATGAC GAGCGCTCCT
GGTGTTTCGG GCAGTCCGAG CATGTCCATT CCGTTGCTAC GAAATGATGT TAACGGACAT
CCGGGTGCGG AAAGTCTGCC TCGTACACCC TTGACTTTGA AGCAAACGGT AGCCGGGATT
CGCAATGGAT ATCGCTTCTT CCAGGGTGGA AAGTTCAACG AGGCCAAGGC AGCTTTTGTA
TCAGTGTTGG CCGAAATTCC GCTTGTAGTT ACCGGCAACC GGGCAGAAGG CAACGAAATT
AAGGAAATGC TCAGTATTTG CCGCGAATAC ATTACAGCAA TTCGGATCAA AGCGGAAATG
GCAGCAGCTG CGACTGACCC GGTCCGCTCC ACTGAGCTGT CAGCCTACTT TACTCACTGC
AACCTGCAAC CGGTCCACTT GCTGCTTGCT CTTCGTGCTG CCATGGGAAC GGCCTTCAAG
AACAAAAACT TTATCGTGGC TGCCAGCTTT GCGCGTCGTT TGTTGGAGCT TCCAGACATG
AGTAACGAAC GCAATGCAGA ATTGCGAGTC AAGGCAACTA AGGTGTTGCA GAAGAGCGAG
CAAATGGCCC GAAATGAGCA TCAGCTGAAC TATGACGAAA CGAAGACATT TGCGATTGAC
TGCAAAGACT TTGTCCCTAT TTATTCGGGC GACAGCTCGA CGCAGTGTTC ATACTGCGGA
TCTTCCTACG CGGACGAATC TATGTCGCAC AGTCTGTGCT TAACATGTGG ATTTTGTGCT
GTCGGGATCC AAACCATCGG GCTCGTCACT GGATAAATTT CTGCCGGATG TTCTACTCTA
TGTTGGCATA TCTACTAATG CAAGTTTTAA ATTC
 
Protein sequence
MLTKFESKSA RVKGLAFHPV RPWVCASLHN GVIQLWDYRV GTVIDRFEEH EGPVRGVDFH 
VSEPLLVSGG DDYKIKVWDY KLRRCLFTLL GHLDYIRTVQ FHSTFPWILS ASDDQTLRLW
DVDRRTCLSV LTGHNHYVMC ASFHPTEDLI VSASLDQTVR VWDTTGLRKK QTGEASGGGH
MDGSMRPPST GLNVQAELFG TNDVVVKYVL EGHDRGVNWA SFHPTLPLLA SAADDRQVKL
WRMSETKAWE VDTLRGHANN VSCCLFHPKH DLVVSNSEDR SIRVWDVSKR VGVQTFRREG
DRFWILAAHP TQNLLAAGHD SGMIVFKLER ERPASCYGPT SQLYYVRGRE LLLHDYGRGS
TGVDVPITSL RRMGTQAQTD GIGSAPRYLT YNHHNPSEGN ILVTSDVDGG SYELVTFSLS
NASGSVTDGK RGSCLGPGVF LGRNRFAILD RQRQIVIKNL QNETTKRVQP PVPNVDGLLD
GGASGRVLLR AEDRAILFEV QSRRVLGEIT APKIKSVVWS PDGSKVAIVC KYGVVMADRS
LEQLCSISDN VRIKSGAWDV SPTGGTASEL FVYTTLHHVK YCLPSGDTGT IRTLDQPLYA
QRIVKDQLFC LDREARPRIL SLDTTEALFK LALSQQKYGK VMHMVRHSRL CGRAIVAYLQ
NKGFPEVALH FVREPRTRFR LALACGNIEA AMESAFTLEQ KAQAEGKDTG RDVWGELGSE
ALRQGNHQVV EMSYQRTKDF DRLSFLYLIT GDTDKLRKML KISNMRQDIM GRYHNALLLG
DAAERVHVLE ESGNLPLAYI SATLHGLMED ADRIKITIET NGGSAGDRKT HCLLQPPTPI
LRANNWPTLE VQKTTLEDLS AADGEAHEED GGEYHDAAAA AATELGTEDW QDDDEDMGMG
TGAAAAAAND LDFGADDDLG DWGDDLDELG DLGEPSHREA DEMIDVSEVG EVGDFVMPTS
GRPPAGCWVG NSSHAADHLA AGAASSALQL LNRQIAASEF ALLKSNMIAC YLGSMTSAPG
VSGSPSMSIP LLRNDVNGHP GAESLPRTPL TLKQTVAGIR NGYRFFQGGK FNEAKAAFVS
VLAEIPLVVT GNRAEGNEIK EMLSICREYI TAIRIKAEMA AAATDPVRST ELSAYFTHCN
LQPVHLLLAL RAAMGTAFKN KNFIVAASFA RRLLELPDMS NERNAELRVK ATKVLQKSEQ
MARNEHQLNY DETKTFAIDC KDFVPIYSGD SSTQCSYCGS SYADESMSHS LCLTCGFCAV
GIQTIGLVTG