Gene PHATRDRAFT_40031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40031 
Symbol 
ID7195503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp641059 
End bp642501 
Gene Length1443 bp 
Protein Length480 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183908 
Protein GI219127367 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.686468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATTG CCGTTCGTCG ACGGCGTAGA CCGCAAGAAA TGGTCTGCTG TTGCCCACAC 
AGTTGTCTCA CTCTGACGAG TCGAACAAAA TCAAGGCGCG GGGTATTCTG GCTGCACTTT
TGGACGGCCG TGGCTATTGT ACAAACCGGG ACAAGCCGAC CCGATCCTTC CTCTCCCACC
GTGGGCAATC ATCAGAAAAA TCCGTTACAA ACTTGGATAG ATGCTCGTTG CGGACCCAAC
GGACACGCCG TTTGGGCGTA CTCTGGCAGT CTCTTCGATC CGCTGAACGG AAAAAAGATT
GCGAACGTGG AAGGTTTGGA ACTAGTTCGC AGACTTGCCG AGACGGACGA CGACGTCGAG
CACAAGCGAA GGTACCAACG ACGCTGTGGG GACTTGAAAA TGGCCCAGGC CATCTTGCAG
GAGAGCTGCT CTTTGGATTA CGCGGGAACA ATCCTTTCTC GCAAAATATT CTGCTACAAA
CCTGTCGACG ACCCGAAATC GCTCTTATCG TCCGTGCGAT TACGACCTCA GGGTCCAGAA
AAAGCAATTC CAACGGATCA AGCAGCAACG GTCTTCGACA CGGCTATTAC GGTCATTCAG
AAAGGCCCAT CCTGGTTTGT TCATGGTGAA CTCCCCAACG GAAACGTTGT ATGGAATCAG
GCCGAAGTTA AGCTAGCAAG CGGAGAGTCA TCTCGCACAC ATTTTGATTA TACTATTTAC
AGTCGACCGC GGCATTCAAA ACGCCAACAA ACTCCAGACT TGACAAGCCA AGCTGTACAA
GAGACCGTAT CGTCGGAGTC GAACAGCATT TCTCCCGCGC GATCGTCTCT CATATCGTTC
GGACCGAGCA AGGCGGAAAG TCTGGGCAAA TTTGGCGCGC GGGAAACCTA TCAATACACG
ACCGAGGAGA CGGGTGCCGG CGGACACTTG CTCGACTCTT TATGGGGACG AATGCAATTT
GCTGTCTCGT CTGTTTTCCA AAGAGACAAA AAGGCTAACG CATTGATCGT GCCAACTCGC
TGCAGCGTTC GATACACACG ATACGGCGAA GGTCCCGTTT GGTACGGGCC GAACCGATTG
TGTACCTTGG AACTCCAAGG GCATCGACTA GAAAATTTAT CCCAGGCACC GCAGCTAGCC
GCAACCATTG CCGCCACGTG TGTCCCGGGT TTCTTATCGA CTCATTCCGC CGTTGCCCAA
GACGACACCG GGGCTCGAAG AGCCGTGGCG TGGTTCCGTG GAGAAAACTC GGTCCAACTA
CAGATCACGC ACGACTACAA CAATGCCGAC AGTATAGAAA GGTCACATTC AAGTGGTGTT
CGGGGTTTAT ACGCGAAGGG AGCGGCTGTG ATTGAGCGTC TACATGCAGC CACGACAACC
AGCACTGGTG GTTCACTGTC AATCTATGAA GAAGATTCAA GCTATTTTAA AACGGAAAAG
TAA
 
Protein sequence
MNIAVRRRRR PQEMVCCCPH SCLTLTSRTK SRRGVFWLHF WTAVAIVQTG TSRPDPSSPT 
VGNHQKNPLQ TWIDARCGPN GHAVWAYSGS LFDPLNGKKI ANVEGLELVR RLAETDDDVE
HKRRYQRRCG DLKMAQAILQ ESCSLDYAGT ILSRKIFCYK PVDDPKSLLS SVRLRPQGPE
KAIPTDQAAT VFDTAITVIQ KGPSWFVHGE LPNGNVVWNQ AEVKLASGES SRTHFDYTIY
SRPRHSKRQQ TPDLTSQAVQ ETVSSESNSI SPARSSLISF GPSKAESLGK FGARETYQYT
TEETGAGGHL LDSLWGRMQF AVSSVFQRDK KANALIVPTR CSVRYTRYGE GPVWYGPNRL
CTLELQGHRL ENLSQAPQLA ATIAATCVPG FLSTHSAVAQ DDTGARRAVA WFRGENSVQL
QITHDYNNAD SIERSHSSGV RGLYAKGAAV IERLHAATTT STGGSLSIYE EDSSYFKTEK