Gene PHATRDRAFT_38668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38668 
Symbol 
ID7203360 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp653307 
End bp655304 
Gene Length1998 bp 
Protein Length626 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182730 
Protein GI219124897 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0793283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCATC AGCAACTACA GGGAGAAACT CCGTTTTCAT TTCCCGCCTT GGATGGGGAA 
ACAAGAATTG ATCTCGGCGC GTACCAAAGA CTAATCGTGA CGAATTTGGA CGTCAATGAG
GCAGGCCGGG CTTATTTTAA CTTAGGTCTA CGGCTCATGC TTTCATACCA GCACGAAATG
GCCTCCAAGT GTTTCCTGGC ATCACTAGAA AACAGCCCAG ACTGTGCTTT GGCCCACGGT
CTTTTGGCAC TATGTCATTC GCCGAACTAC AACTTTAAGG GTGAAGCCTA CTACGAGTCA
GCCTGTCACT ATGAAGACAC AGACAAGCCT GATCTGCTCT GCGTCTTTCC TTCTCAGCAA
GTCGCCGATC GACACAGTCG AATGGCTGTG GAGAAAATTG AGGAGTTGCG CAAGGCACAC
CGTAAACGCA AGGGGAAAAA GAAACAGAGG ACGGTTCCTT CCAATAACGG CGAAAAGCTA
CCTTCTGTAA TATCGGATGT AGAATGTCAG TGGCTTGCGG CGATTCGTGT ATTGACGAGT
TCTCCGGGTG TCGACCCAGA CTTGAGCCAC GATATTGTCG GTCGACCCTA CTCCGACGCC
ATGCGAAAAG TATACGAAAA GTTCGACAAC GATCCAGAAA TCGCCTACGG TTTCGCGGAG
TCATTGATGG TTTTGAATGC CTGGCAGCTA TACGAGTATC CATGTGAGTC ATATGCGCAG
TCTAAGCGAT TGTGATAACT ACGCGTCGGT TCTTCGCCGT TACATACATT TGGAAATAAT
GTCACTCACA ATGCTACTGG TTTCTTGGTT GGTTTCTCAG CCGGCAAGCC GCTCAGCCCG
GATGTAGTGG AAACCCGAGC TGTGCTGGAG CGTTCGCTAA AAATTCATCC GCATCACGCC
GGTCTGTGCC ACATGTACGT GCACCTTTCC GAAATGTCAG CGCATCCCGA AAAGGCCTTG
GCTGCCTGTC AGCCGCTCCG CGGAGAATTC CCCCATGCTG GACATCTGGT GCACATGGCA
ACGCACATCG ACGTCTTGCT GGGTGACTAC GAGTCCTGTG TGCACTTCAA CTGTCAAGCC
ATCCGGGCCG ATCGACATGT CATGGCGAGT AGTCCGGCAA CGGCTGGTAA GGAAAGTTTT
TACTTTGGAT ACATTGTACA CAATTATCAC ATGGCCGTAT ATGGGGCCAT TCTCGGAGGG
ATGCAAGGGA AAGCTATGGA ATTGGCGGAC GAGTTGAACG AACTTATCAA CGAAGATATG
TTCCGAGAGT TTCCCGATTT GACGTCATAT TTGGAAAGCT ATGCAGCTCT GGAAGTGCAC
ATTATGGTTC GTTTTGGGCG CTGGAAGGAG ATCTTGGAGT TAGAATTGCC GAAGGATCAG
CGCCTGATGT TGTTTCGGGC CTGTACTCTG CGGTACGCCC GAGGCTTGGC GCTAGCTGCT
CTAGGCCGCG TCGAGGAAGC CAACAAGGAG ATGATGACGT TGGATGCGTT GCGGGTTGAT
CCCGAAGCGA CGATGCGAAT TTTGCACAAC AATACCATTT TTGATTTGCT CGCGGTAGAT
TCTGTAATGC TGCACGGGGA AATTGCCTAT CGAGAAGGAC AATACGAAAA GGCGTTTGCA
CTGTTGCGGC AGTCCGTACA AATGCAGGAT GACTTGGTGT TTGACGAACC GTGGGGTAAG
ATGCAACCAA TTCGCCATGC CTTGGGTGGA TTATTATTGG AACAGGGACT CTTGGAAGAG
GCTATAGCGG TGTTTCGAAA AGATTTACAT TTTCATCCCA AGAATCCTTG GGCCTTGGTT
GGTTTGATTG AATGCTTGAA ATGTCAACAG CCATGTTGCT GCGAAGCGAC CGATCGAAAT
GCCGAGATTG CTATGCTGCA ATCACAGCTT GCAATATGTC GCAGTGGTGA GCTGGCTGAT
TTTGATATAG AAGTACCGTG CGAGTGCTGT CAACGTTCAC CGGGGCAAAA TACAAACGAA
ACGCAAATCT TGGAATAG
 
Protein sequence
MRHQQLQGET PFSFPALDGE TRIDLGAYQR LIVTNLDVNE AGRAYFNLGL RLMLSYQHEM 
ASKCFLASLE NSPDCALAHG LLALCHSPNY NFKGEAYYES ACHYEDTDKP DLLCVFPSQQ
VADRHSRMAV EKIEELRKAH RKRKGKKKQR TVPSNNGEKL PSVISDVECQ WLAAIRVLTS
SPGVDPDLSH DIVGRPYSDA MRKVYEKFDN DPEIAYGFAE SLMVLNAWQL YEYPSGKPLS
PDVVETRAVL ERSLKIHPHH AGLCHMYVHL SEMSAHPEKA LAACQPLRGE FPHAGHLVHM
ATHIDVLLGD YESCVHFNCQ AIRADRHVMA SSPATAGKES FYFGYIVHNY HMAVYGAILG
GMQGKAMELA DELNELINED MFREFPDLTS YLESYAALEV HIMVRFGRWK EILELELPKD
QRLMLFRACT LRYARGLALA ALGRVEEANK EMMTLDALRV DPEATMRILH NNTIFDLLAV
DSVMLHGEIA YREGQYEKAF ALLRQSVQMQ DDLVFDEPWG KMQPIRHALG GLLLEQGLLE
EAIAVFRKDL HFHPKNPWAL VGLIECLKCQ QPCCCEATDR NAEIAMLQSQ LAICRSGELA
DFDIEVPCEC CQRSPGQNTN ETQILE