Gene PHATRDRAFT_21316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21316 
Symbol 
ID7202128 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp271842 
End bp274425 
Gene Length2584 bp 
Protein Length727 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181340 
Protein GI219121994 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATCGTTG TCTTCCTTTG TTCTACTTTT TGTGACAATC GCATTGAGGT GGATCGAAAG 
CACACGTGTC CATTGGTTTG ACCAAGCAGA TCATATCCTG TGCAAAACTG GAGCGGACAC
GCGAACGAAA AGGGCCATGA AAGAGATAGA AGCCTGGAAG AGTGCTCTTG GTTCATTGGC
TTTCCTGTTA TCTGCGGGCA CGTCAACGAT ATTATCGTGC GAGGCTTTTT CCTTATATCC
ACAAGCGCGA TCGCGACCTT TTTCCAATCC GCATTATGCG ACCGTAGAAG CCGATGCCGT
CAATGGCGCT TCCTCCGTCG CGGCACAATC CTACGGGGCC GGTCAAATCA CGGTTCTGAA
GGGTCTTGAT CCGGTCCGCA AGCGTCCAGG AATGTATATT GGATCCACAG GCCCAGACGG
ATTGCACCAC TTGGTCTGGG AAGTCGTGGA TAATTGCGTC GACGAAGCCT TGGCGGGACA
CGCCACGTTT GTGACGACGA CGATCCACGC CGACGGGTCC TTGACCGTCA CCGACGATGG
ACGTGGCATC CCGACCGATC TGCATTCCGA AACCGGCAAG TCGGCTCTGG AAACGGTGCT
GACGGAACTG CATGCCGGCG GAAAATTCGA TAATCAAGGT TCCAACAGTG GATACAAAGT
GTCGGGAGGA TTGCACGGCG TCGGTATTTC GGTGGTGAAC GCTTTGAGTG AGTTCGTCCA
CGTCAAGGTT GATCGCACAC CAGAGTTGTA CCAGATGCGC TTTGAACGGG GGGTACCAAC
GGGGCCACTA CAGGTGAGCA AGGGAACGTC AAACTTTGTG GACAAAGATA TCGATCAAGA
ATTGGAGCTC CTCAAGGCGA AATCGGATCA ACAGGATGAT GATAACACTG CCATAACCAT
TCACCAGCAG AACCTAGACA AGCTCAAAAC TTTGTTAAGC AAACGTAAAT CTGGAACATC
TGTTACTTTC CTTCCCGATC TTAAGGTGTT CAAGGGAGAT AACGGTAAAC CGGATATCAC
ATTCGATTCC TCTCGACTCA AAGGACGCAT GGACGAGATT GCCTATTTGA ATGCCGGTCT
AGTTTTGACG CTTAAGGATG AACGAAAGTC TCCAGGCCGT GGCCGTCTGC AAGTGTTTTA
TCACGCCGGT GGCCTTGCCG AGTATGCAGA ATTATTGTGT CGGACCAAGA CTCCACTCTT
TCAGGGAACA ACCTCCAGCA GAAAGAAAAA GCCCGCGAGA AAACAGAAAG ATTCGGCGTC
CGACGATGAT GGTATTGCAA TGGATCCTGT GGCGGGTTTG CTTACACCTG ACGGTGCGAC
AATACTGTGC ACTGGTACGA GCACGTCGGA CGAAGAAACC CCTCCAGTTT CCGTTTCGGT
AGCTTTGCGT TGGTCATCGG ATATGTATGC GGAATCGATT CTCTCGTTTT GCAACAATAT
TCGTACTCGG GACGGGGGGT CGCATGTGGA AGGCCTCAAA GCTTGCTTGA CTCGAACAGT
CAATCAAGCA GCCAAACGTT CCAACAAAGC CAAAGAAGGG GCTGCCAATC TACCCGGCGA
GTTCATTCGT GAGGGATTGA CAGCCATTGT ATCAGTCTCG GTTTCTGAAC CTGAATTTGA
GGGTCAGACC AAGGGACGCC TCGGAAATCC GGAAGTACGG CCGGCTGTGG ATTCGTTGCT
GAGTGCAGAA CTGACAAAGC TTTTTGACTT CCGACCTGAA ATTCTGGACG CCATTTACAA
CAAAGCGAGT TCCGCACAGG CAGCCGCGGC AGCCGCCAAG GCCGCTCGCG ATATGGTCCG
CCGTAAGACG CTGCTGACGT CTACGATTCT ACCCGGAAAG TTGGCGGATT GTGCGTCCCG
GGATCCCGAA GAATCAGAGA TTTTCATTGT TGAGGGTGAC TCGGCTGCAG GAAGCGCCAA
ACAAGGCCGA GATCGACGAA CGCAGGCTAT TTTGCCTTTA CGAGGTAAGA TTCTGAATAT
TGAACGAGCA GCCACAGAAC GTATTTACCA AAACACAGAG CTGCAGGGAT TGATTTCGGC
CCTCGGATTG GGAGTCAAGG GATCTGAGTT TGATCCTAAG TCTCTCCGAT ATGGTCGTAT
TGTTATTATG ACTGATGCCG ACGTGGACGG CGCTCATATT CGTGTCCTGC TATTGACGTT
CTTCTACCGC TACCAACGGG AGCTCGTGGA GAACGGCCAT GTTTACATAG CACAGCCTCC
TTTGTACAAA GTGAGTGTGG GAAGTGGCAG GTCAAGAAAA GAAGGGTACG CATTCAACGA
TACGGAAAGA AACACAGTAA TGATGCAAGT TCTCGGTGTT GATGACCCGA AGCAAGCCGA
GGAGGCGCTT GCGGCCGGGA AAGTTTCTTT GCAGCGCTTC AAAGGTCTAG GAGAGATGAT
GCCGGAGCAA TTATGGTCCA CGACAATGGA TCCAGAGCGG AGAACGATGC TTCAAGTTAC
TGTCAATGAT GCATCGATGG CCGACCAGAC ACTGAGCATT CTTATGGGAG ATACTGTAGC
CCCTCGGAAG GAATTTATCA GTACCCAAGC CGAGACGCTT AGAGTAGACG ATCTTGATTT
GTAG
 
Protein sequence
MKEIEAWKSA LGSLAFLLSA GTSTILSCEA FSLYPQARSR PFSNPHYATV EADAVNGASS 
VAAQSYGAGQ ITVLKGLDPV RKRPGMYIGS TGPDGLHHLV WEVVDNCVDE ALAGHATFVT
TTIHADGSLT VTDDGRGIPT DLHSETGKSA LETVLTELHA GGKFDNQGSN SGYKVSGGLH
GVGISVVNAL SEFVHVKVDR TPELYQMRFE RGVPTGPLQN LDKLKTLLSK RKSGTSVTFL
PDLKVFKGDN GKPDITFDSS RLKGRMDEIA YLNAGLVLTL KDERKSPGRG RLQVFYHAGG
LAEYAELLCR TKTPLFQGTT SSRKKKPARK QKDSASDDDA LRWSSDMYAE SILSFCNNIR
TRDGGSHVEG LKACLTRTVN QAAKRSNKAK EGAANLPGEF IREGLTAIVS VSVSEPEFEG
QTKGRLGNPE VRPAVDSLLS AELTKLFDFR PEILDAIYNK ASSAQAAAAA AKAARDMVRR
KTLLTSTILP GKLADCASRD PEESEIFIVE GDSAAGSAKQ GRDRRTQAIL PLRGKILNIE
RAATERIYQN TELQGLISAL GLGVKGSEFD PKSLRYGRIV IMTDADVDGA HIRVLLLTFF
YRYQRELVEN GHVYIAQPPL YKVSVGSGRS RKEGYAFNDT ERNTVMMQAL AAGKVSLQRF
KGLGEMMPEQ LWSTTMDPER RTMLQVTVND ASMADQTLSI LMGDTVAPRK EFISTQAETL
RVDDLDL