Gene PHATRDRAFT_23959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_23959 
Symbol 
ID7198982 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp300874 
End bp303536 
Gene Length2663 bp 
Protein Length605 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185253 
Protein GI219130188 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACAGCGCG TATCCGTTGT TGCTCGAATA AAAAAGCGCC TAACACACCC GTGTCAGGAC 
TCGTAAGAAG AGTTGCTCTC CCGTAGTAGC AGCCGCAAAA AAACTAAAAA AAGTGAGTCG
TCGAGCGGTG TCATGGTCAA GGACAACCGT CGATTATCGC GATCCGCAGA CGTGGGAATT
CGTGTTTCCA TCATTCATCT CCGGGTGTTC TTTTACCTTG GTACCTCATT ACTAGTTTGG
CGTGTAGAGT CGTGTACCGG AAGCCAGTGT TTGTCTAGGA ATACGGAATC TACATTATTT
CCAGAAAACT CCTTGTTTGG GCAGTCTGTT CTGAATTGGA AGAGGGACCA AGAAATTTGG
ACGGTCGTGG TGTTTCGTGG TGCGCTTGGG AGTTGCCGAT GTGTTGCTTT TTTACGTGGT
GATAACGATG TGTCCAGTAC AGTGCACCAT TACTTGACTG TGTTCCTCAC TAGTTTTAAC
TTGCTTGTTT CGCAACCTGC AGAAGAATCT TGTAGAACCT CTTCGTGTTC TATACTACTT
ATCAGAATCC GTTTCCGTTT GATTCATACC ACGCCTTTCC CCAGTTATCT CGTCTGAAAC
AACCAAGAAC CATGGCTACC ACTTCAACTC CCGTTCCTGC TGACATTGGC AACGAAAATC
CTACCACCGA AGCCCCTGTC GCCGTCCCCT CCGCCTCTAC TGCTAGCGAC AATCCTTTTC
AAACGCCCTC TCTTTACGTT GGTGATCTCG CGCCCGATGT CAACGAGAGT CTCTTGTTCG
AGATTTTCAG CGCGGTCGGT CCCGTCGCTT CCATTCGTGT CTGCCGCGAC GCCGTAACCC
GCCGCTCCCT TGGCTACTCG TACGTTAACT TTCATCAGAT GGCAGATGCT GAGCGCGCGA
TGGATACAAT GAATTTCTCG ATGATCAAGG GCAAGCCGTG CCGTATTATG TGGAGTCAGC
GTGACCCCTC CCTCCGTCGT TCGGGAGTTG GTAACATTTT TGTCAAGAAC CTGAACGAAG
CGATTGATAA CAAGCAGCTC TATGATACCT TCTCCCTCTT TGGAAACATC TTATCCTGCA
AAGTTGTCAC AGATCGTGAG GGTGGTGTTT CTATGGGTTA CGGCTACGTT CACTACGAAA
CGGCTGAAGC TGCTAACGCC GCCATTGAAA AACTCGACGG CATGTTGATT GACGGCCAAG
AAGTTCAGGT CGGTCACTTT ATGCGTCGTA ACGATCGTCC CGATATTGAT TCATGGACGA
ACTGTTACAT CAAAAATGTT CCCTACGAAT GGGATGATGC TCGTTTGAAC CAGGAGTTTG
CCCAGTTTGG TGAAGTTCTG AGTGCTACCG TGTCTCGCGG TACACGCAAA CACAAACCCA
AGGTCCAGAA GAAACCCGAA AGCGAAGAAA CGGAAGAAGA AAAGAATGAA GATGAGAAGG
AATCCAAGGA ACTTGTTGAA ACCAAAGAAG AGGAAGGTGA AAAGAAGGAA GAAGATACCA
ACCAGACCCT TGGTTTTGGT TTTATTAACT TTGCGGAACA CGAATCTGCC GTTGCTGCAG
TGGAAGCGCT CAACGGAAAG GAGTACACCA CCACTTTAGA TGGTGAGGAA ATTACCCAAC
AGATCTACGT TGGGCGTGCC CAGAAAAAGT CGGAGCGCGA GCGTGAACTC CGTGCCAAAT
TCGAAGCCGA AAAAATGGAT CGTATTTCCA AGTTTCAGGG TGTCAACCTT TACGTTAAGA
ATCTTGACGA TTCTGTTACG GACGACATGC TCCGTGATGA ATTCGCGGTG ATGGGTACGA
TTACGTCGGC TCGTGTCATG AAGGACGCGA AAGACGGACG TTCCCGAGGC TTCGGCTTTG
TGTGCTATTC TACTCCCGAA GAGTCCACTC GCGCCGTCAA TGAGATGAAC GGCAAACTTA
TTGCGAACAA GCCTATTTTC GTCGCTTTGG CTCAGCGACG AGAAGTTCGT CGTGCTCAGC
TGGAGGCCCA GCATGCCAAT CGTGCGGGTG GCCCTGGGCA ACCAGGCATG ATGCGTGCTC
CGATGGGAGC CCCAATGGGA TACCCTGGTA TGCCCATGTA CATGCAGCGT CCTGGTCCCG
GAGGAGGCAT GCAACCGGCC TACCCCATGA TGCCACAAAT GATGGGCCCT GGAGGTCGTG
GTGGTCCTCA ACAACAGCGT GGACCGTACC CCATGATGGG GCAACAGGGA CGCGGGGGTT
ACCCGATGGG ATACGGCGTC ATGCCGCAAG GCCGTGGAGG CCGTGTCCCC ATGGCTGGAC
GTGGTGGACG CGGGCGTGGC CCCATGCCCG GACCTGGTCA ACAACCGATC AAGTTCAATC
AGCAGGTTCG TAACGCTGGG CCTCCTATGC AGAATCAACC TCCCCACCAG CAAGGTGCTC
CTCAAGCTAT TCCCCACGAG GGTGCTCTAA CGGCTTCGGC TCTGGCCTCC GCCTCCCCGG
AAGTCCAAAA GAACATGATT GGCGAGCGTC TTTATCCTCT GATTCACCAA TCGCAGCCTG
AACTTGCGGG CAAGATTACC GGAATGTTGC TAGAGATGGA CAATTCCGAA TTGTTGCATC
TTTTGGAAAG TCCGGACGCT TTGAACTCCA AGATTTCCGA AGCACTACAA GTTCTGGAAG
CACACCAAGC TGGCCAGGAG TAA
 
Protein sequence
MATTSTPVPA DIGNENPTTE APVAVPSAST ASDNPFQTPS LYVGDLAPDV NESLLFEIFS 
AVGPVASIRV CRDAVTRRSL GYSYVNFHQM ADAERAMDTM NFSMIKGKPC RIMWSQRDPS
LRRSGVGNIF VKNLNEAIDN KQLYDTFSLF GNILSCKVVT DREGGVSMGY GYVHYETAEA
ANAAIEKLDG MLIDGQEVQV GHFMRRNDRP DIDSWTNCYI KNVPYEWDDA RLNQEFAQFG
EVLSATVSRE DTNQTLGFGF INFAEHESAV AAVEALNGKE YTTTLDGEEI TQQIYVGRAQ
KKSERERELR AKFEAEKMDR ISKFQGVNLY VKNLDDSVTD DMLRDEFAVM GTITSARVMK
DAKDGRSRGF GFVCYSTPEE STRAVNEMNG KLIANKPIFV ALAQRREVRR AQLEAQHANR
AGGPGQPGMM RAPMGAPMGY PGMPMYMQRP GPGGGMQPAY PMMPQMMGPG GRRVPMAGRG
GRGRGPMPGP GQQPIKFNQQ VRNAGPPMQN QPPHQQGAPQ AIPHEGALTA SALASASPEV
QKNMIGERLY PLIHQSQPEL AGKITGMLLE MDNSELLHLL ESPDALNSKI SEALQVLEAH
QAGQE