Gene PHATR_10628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_10628 
Symbol 
ID7204194 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp605563 
End bp607759 
Gene Length2197 bp 
Protein Length575 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186091 
Protein GI219113015 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GATGGCCACA CCATTCACCA GTATTTGCGC GGCCGCTTTC TCGGAAAAGG CGGGTTCGCC 
AAAGTATACT TGTGCACCGC TCTGGACACA TCCAAACAGT ACGCGGTCAA GATCGTACCA
AAGGCGAATC TAGTCAAAGC GCGAGCGCGA CACAAGGTTT GTTACTGATT CTCAGTGCAC
ATTATCTACT TTTCCCTAGT TTCGACGCTC ACTTACTTTC TATCCCTCGC AGTTGCAAAC
CGAAATAAAA ATTCACCGCA CACTCAAGCA TCCCAACATT TGCGAGTACA AACATTTCTT
TGAGGACCGC AACAACTGCT ACATCCTTTT GGAGCTCTGT CACAATCAGA CTCTAAATGA
AATGATCAAA CGTCGGAAAA GATTGACCGA ACCAGAAGCG GCTCTGTTTA TGAATCATCT
TCTCGATGCA GTCAAGTACA TGCACCTGAA GAATGTAATT CACCGAGACT TAAAACTCGG
AAACTTGTTT TTGGACCGAC ATCTGAACGT CAAGGTTGGA GATTTGGGCT TAGCAACGAT
TTTAGAACAT CCCGAAGAAA AGCGCAAGAC TATCTGCGGA ACCCCGAACT ACATTGCTCC
CGAGATCATT CAGGGAGACA AGGCCACCAG GGGGTATTCG TTCGAAGTTG ACGTATGGTC
CATGGGGGTT ATCCTGTTTA CAATTCTTGT TGGAAAGCCG CCGTATGAAG CGAAGGACGT
CAAAGCCACT TACCAACGCA TTTTGGCCAA CGAATATTCG TTCCCCAACA ATGTAGAACT
CTCGTTGGAT GCAAAAGACT TAATTCGGAG CATGCTACGC TCCACACCAT GCGAACGGTA
AGGTCGTCAT TCGTTAGTTC TTGACTGATG CATTGTAAAT TTCAATACGT AGAGGCTAAT
CTTTTCTTTC TACAGTCTGT CCCTTAAGGA GATTGGAAGT CACCGGTTCT TGTCCATTAG
GAACACACCA CTAAACATCC CTTCAAACGC CACTCACTCT ACACCCAAAT GGTACTTGAA
CGAGTATGGT AGATTCGTCT CCGACGGAGA CGCTGCGGCC ATTCACTGTC AAAAACCACG
AAAATCAGTA CTCCCCCGAT TGAGTACTCG GCAGCCGTTC GGACTTCGTG ACCAGAACCA
TGGAACGGCC CGTAAGACGA AAAATGAAAA ATCCGAGGGA GAACACATTG ATATACAACG
CCTCGTCAAG AGCACTATAT CTCTACCGGC GTCGAAGCCC ATTAAAGGCG GAGGCATGTC
TCCTACTTTC AGAATTTTTG ACGACTCCAA AAAAGCTACG CCATGTGAGT CCTTGGAAAA
ACCTACACCA AAAACGAACG CAGAGGAAGA ACTTATTTCT CGAACTCGCG CCCTGTCAAT
TCAAACTTCC TCTCGCCTTC AAGATTCGGG CCGATGCAGT CCTGCAAGAT CTCTGGCATC
CTCCACCTAC ACCGCAATTA TCGATTCCGG TACAGAGATT CTGCAGAAAC TCGTTGTCCA
TCTAGAAGCC GTTCTAGAGT TAACTGCCTC ACGTCGTGAT GCGTTTCGAC CTACATCCCC
TCAATCCGTA GTCGTGTATG CAGGACCTAC CAGATGGGTG AGCCGCTATG TTGATTATAC
AAGCAAGTAT GGTCTGGGCT TTCTTTTGAA CGATGGTAGC TCCGGAGTTT ACTTTAATGA
CTCAACCAAG ACTGCTCTGG AGGCACAGGG GGAGACATTC TACTATATTG AACGTAGAAA
GGTTGAAGAC GCTGCTTCTC GAAAAGTAGA AATTGCTGTT GAAACCCACA CGTTGAGTTC
GTACCCAGAG CACTTGAAGA AAAAAGTCAC TCTTCTGAAG CATTTCCGCA ATTATCTCTT
AGACCAGCAG AATAAGGATG AAGAAACAGA GCCCACCCGG CCTCTTTCAT GCCTGCCAGT
TTCTGACACA GTACACGTCA AGAAATGGAT TCGCACGAAA CACGCGATAC TGTTCCGTTT
GAGTGACCAG ACCATCCAGG TAGTCTTCTA TGATCAAACA GAAGTACTCT TGACACCTGA
TGTACGATAT ATTACCTATG TGGATAAAAA TCATGTCCGT CGAACATACG ACTTCACAGA
CGAACTAGTT GGGTCCCTTG TGGAATTGGA GAAACGTCTG AAGTACACTA AAGAAGTTTT
GTTGCAGCTC ATTGGTTCAC ACTCTGGACG TCGCTAA
 
Protein sequence
DGHTIHQYLR GRFLGKGGFA KVYLCTALDT SKQYAVKIVP KANLVKARAR HKLQTEIKIH 
RTLKHPNICE YKHFFEDRNN CYILLELCHN QTLNEMIKRR KRLTEPEAAL FMNHLLDAVK
YMHLKNVIHR DLKLGNLFLD RHLNVKVGDL GLATILEHPE EKRKTICGTP NYIAPEIIQG
DKATRGYSFE VDVWSMGVIL FTILVGKPPY EAKDVKATYQ RILANEYSFP NNVELSLDAK
DLIRSMLRST PCERLSLKEI GSHRFLSIRN TPLNIPSNAT HSTPKWYLNE YEEELISRTR
ALSIQTSSRL QDSGRCSPAR SLASSTYTAI IDSGTEILQK LVVHLEAVLE LTASRRDAFR
PTSPQSVVVY AGPTRWVSRY VDYTSKYGLG FLLNDGSSGV YFNDSTKTAL EAQGETFYYI
ERRKVEDAAS RKVEIAVETH TLSSYPEHLK KKVTLLKHFR NYLLDQQNKD EETEPTRPLS
CLPVSDTVHV KKWIRTKHAI LFRLSDQTIQ VVFYDQTEVL LTPDVRYITY VDKNHVRRTY
DFTDELVGSL VELEKRLKYT KEVLLQLIGS HSGRR