Gene PHATRDRAFT_37228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37228 
Symbol 
ID7202181 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp456098 
End bp458240 
Gene Length2143 bp 
Protein Length567 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181380 
Protein GI219122077 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0446526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAGA CGGTGCGTGT GTCATCCGTC GCTACTTTCC TTTGTCTGTG CAACTATGGA 
AGCCATTTGT TGGTTCTAGT AACATGTAAA GGATAACTAC TCTAAGAGTT TCTCCGTGCA
GTCGTGTGAC GATACATATC AGTGAGAATT CCGCAGGCAA TACTGCGGGA GCTCTTTTCC
CACTTCTGGT TGGTAATCAC CGATTGGAAA AGAGCATCTC GTAGGGCACT AACAGGCCAA
CAATTGTTCT AGGCATAGCA TACTTTTATT AGCTTATGAG ACAAACTTTG TTCATTCTTT
ATGAGTAACA GATCCCACAG AAACATGGAA CCTTTGATCA TTATCTCGTC GGTTTGTCGG
ATGCCAAAGC AGCTTTGTTT GCTGAAGCCG GTATCACCGA TTGGACTACC ATTGCGTCGT
TGGACCTTCC TGCATTCACA CAAGTGTTGG GAGACCAGGT TGGGACACTG CCAAAGCGAT
ACAAACTGAA GCGTGTGGCT GAGTACCTCA GCAGCGGTAA GGAAATCGGT CCCGATACGA
CGATGGGAGA TATCAACCTT GTGTTGTTGT CTAACCATCA GAATGTTGGA GCAAATGCTC
CGAACCCGGC TGCTGGACTC GACCCTCGCC ACGGAACAAT GAAAGCGAGT ATTAACACAA
TCTCAAAGTT TTCAGGTGAT CTGGAAGATT TTGAAGACTG GTCAACAAAA CCGCAACGGC
TCTCGGAATC ACCGTTTATC GTAAACTCCT CAATGGTCCT TGAGGAGTCG GAAATGATTT
TGATGCCGCC AGAAACAATG AGCTCTATCA CATGTTTGTG ATTGCTCTTG TGGATGGTGC
CACAATGCAT ATCATGGAAG ACGTGAAGGA CCAGAACGGA TACGCCGCGT GGATGGCTAT
CAAGGAATGG TATGGTATAT TGGATACCGG TCGGACAATC ATCAACAAGT ATCATAGCAA
GCTGGATGAG CTACTGCTTG ATGATGCGAC CCCAGCCGGG ACCTTTGTCA ACCATTTCAA
GAGATTCAGC CAGAAGCTCA AAGAGAATGG TGAGGGTTAT ACGGCGGATA CAAAACGACA
TCAGTTTCTC GACAAGATCA TTGACAAAGA CTACAATGTG GTCAAACAGC AGTTGGAAGG
AGATTCAACT GCGGACTTCA ATTAATGTGT TGTGCCCATT TGTATGCGCA AGCAAGTCCT
CATGAAGGAC TCCACATTGT CGGCCAAGAA AGCTAGACGG TTCAAGTCGA GCGAGGGCGG
CAAGTCGAAT GGTGGAGGTC CGTCAAGTGG GAAAATTCCT TCGATTCCGA ACTCTATACT
TAACCTGGTC AAGCCAGCAA GTGCTCGCAA GAATCTAATC AAGTGGAAAG GAGTTTGGAA
CTCCGAAGGG CGTATTCTTC GATTGGACAA ACTGGCAAGC ACTGAGTATG ACAGGAAAGG
TAAAACCCCG TTGAAAAGAG AGCACAATAA TGACAGTTCC GACAAATCCG TCAAGACCCG
CAACACCCAA GGCAAGGGGG GCTCCTTGTC CAAGAAGGGC AAGAGAGGCA AGGGGAGGCG
AATCACCGGT GTTGTCCGTC GAACGGAAAC GAAGACATCC GGCACGCCTG ACTCCTCCAT
CCGGATCAGT ATGAAGGATC CGGACGACCA CGTCGAAGGT GATGAGTATA ACGATGTCGA
AATTGAGTCC GGTCAAGAGG AAGATTCTAC TGCACCTGTG AAGAAAGAAC GGGCGAAAAA
GCAGAAGAAT CCGTCGAAGC GGAAGCACAA GCGTGCCAAG TCTCGTCAAA GCCCTATCTC
CCGCCGCGGA CGCGTAGGCA ACAAGAAACC AAGAGCTATC CTAGACCCAG GGACTGAGTG
CAACATTGTT GGCGGGGACG TATGTGTTAA TAATTCCATT GGGCCTACTA AATCTTTCCC
ATTCCGCTGT GACAAGACTG ATATTCTCCC AATCCGTTGT GACAAGATTG ATAGTACCAA
AGAACCTATC ACGGTTACAT ACAGAGCTTT AACGTCCTAC GAACGTTCTA GTCTTCCCGT
TTACATTTCA AACAAGTCAT CGCCATATGA AACAGTCATT GGCGATTCAG CTGCATCATA
CAAACTCAAT CACGATTCTC AAACTAGTCA TAAATCTTAT TAG
 
Protein sequence
MTQTKHGTFD HYLVGLSDAK AALFAEAGIT DWTTIASLDL PAFTQVLGDQ VGTLPKRYKL 
KRVAEYLSSG KEIGPDTTMG DINLVLLSNH QNVGANAPNP AAGLDPRHGT MKASINTISK
FSGDLEDFED WSTKPQRLSE SPNNELYHMF VIALVDGATM HIMEDVKDQN GYAAWMAIKE
WYGILDTGRT IINKYHSKLD ELLLDDATPA GTFVNHFKRF SQKLKENGEG YTADTKRHQF
LDKIIDKDYN VQVLMKDSTL SAKKARRFKS SEGGKSNGGG PSSGKIPSIP NSILNLVKPA
SARKNLIKWK GVWNSEGRIL RLDKLASTEY DRKGKTPLKR EHNNDSSDKS VKTRNTQGKG
GSLSKKGKRG KGRRITGVVR RTETKTSGTP DSSIRISMKD PDDHVEGDEY NDVEIESGQE
EDSTAPVKKE RAKKQKNPSK RKHKRAKSRQ SPISRRGRVG NKKPRAILDP GTECNIVGGD
VCVNNSIGPT KSFPFRCDKT DILPIRCDKI DSTKEPITVT YRALTSYERS SLPVYISNKS
SPYETVIGDS AASYKLNHDS QTSHKSY