Gene PHATRDRAFT_40023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40023 
Symbol 
ID7195497 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp620905 
End bp622493 
Gene Length1589 bp 
Protein Length522 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184033 
Protein GI219127626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAGA TGAAAGCTGG GGCAATTGCA AGTCGCAGAC GGCGCCATCG AGCGCCATCA 
TTCCTTGTCC CCCTTCTGGG TGGGTTGGTC ACGTATAGCT TGCACATCAA TTCTTCATTT
TCTTTCACAT GGCAGCATAA CGCTGATTCA CAGGAATTCT CCATGCAGCT TGCCACAGTC
AATTCTACCG CAGCGAAACA ATTACAATCG GCTCAGATAC ATAACATCAC ACGCAGTGGA
CTTCAGTATG GCACAGCGAC CGGACAAAAT CATTCGAGAC ACAGCAAAAC AGCAGAATCG
CCTTTTCAGT CTTTGCAAAC GTCACTAGAA TCTCAGCGAC TTACACCTAT CGCTACCAGC
TCAACTGTTA GGGGCAATCG CTCCGTCACT GATCTTCCAT ACGCAGATGC TCGCGACGAA
GATGGTTCCT GGGGGTACAT CGCCGACGCA ACCCAGGTGA GGAGTCGAGT CCTGGCGCTT
CTACCCTCAA ACCATACACT GCACAACAAT GTCACCAGTT TCATACCCAT GACGGAATCT
GAACAAGAAG AAATATGCCA AAAGCCACCC GGAAGCGGAC CGGAGCAAGA ATTGGGCTGG
AAACTGATGC AGCGTGTTGT CGTCAATGCG CCCGAGCCGA GGTACGCCAA CGAGTCTGCA
GTCATTGTCG CCACAATCTG CAGTCATCGT CACCAAAGAG TCTCCAAGCA GTCATCGTCA
CCAACTCGTC CTCCATCTCA GTAAGCCACC ACACAGAAAC AGCAGCACCC AAAATTCTTT
GTGTCGTCTA CACGTATGAT GCTCATCACG ATCGAGTTGC GGCGATTGGT GATACCTGGG
GTTGGCGCTG TGACGGCTTT TTGGCCGCCT CCAACCGAAC TGTTCCGGAG CTTGGGGCTG
TAGATTTGCC CCACGTTGGA CCCGAAGCTT ACGGCAATAT GTGGCAAAAG ACGCGTTCTA
TATTGGCGTA CGTGCACGAA CACTACATTG CGGAGTACGA CTATGTGCAT GTGGCAGGAG
ACGACACGTA CGTGATTGTG GAAAATTTGA GAAATTACTT GGAGTTTACG GTAGAGGCAA
AACACGGTCG TGGCAAAGTA CCATTGTATT TGGGTCAGCG TGTTTTTTCT GGAGGTGGTT
ATACATTTGT TGGCGGCGGG CCGGGGTATA TTTTAAATCG CTTGGCCTTG CAGCGTTTCA
TTAAAGAGGC TCTGTCAGCA TGTCTGGCTA ATCAGCAGGA AGCAGCCGAA GACCGTTCGC
TTGGATATTG CTTCAAAACC TTGGAAATTA CTACGGAAGA TACGGCGGAT GCATTTCATC
GGCAAAGATT TCACGGTGTG GATCCATACT TTTTGGCGAC AAAGAATCCA CAGAAAGGCT
TCTGGAAACG GTTATACAAG TTTTGGGCCC GCGAACATGG GTACAAATGG GGCATTGGCT
TGGTGTCACC ACAAACTGTA ACTTTTCATC TCATAAAGTC GCCGATTTGG ATGAAGCGCA
TGCATGCTAT GCTCTACCAT GCCTGTCCGA CGGGTACGGC AATGGGTGAT CTCCTTCCCA
GACCTACGAA GCTGGCGAAT ATTTCCTGA
 
Protein sequence
MTQMKAGAIA SRRRRHRAPS FLVPLLGGLV TYSLHINSSF SFTWQHNADS QEFSMQLATV 
NSTAAKQLQS AQIHNITRSG LQYGTATGQN HSRHSKTAES PFQSLQTSLE SQRLTPIATS
STVRGNRSVT DLPYADARDE DGSWGYIADA TQVRSRVLAL LPSNHTLHNN VTSFIPMTES
EQEEICQKPP GSGPEQELGW KLMQRVVVNA PEPSHCRHNL QSSSPKSLQA VIVTNSSSIS
VSHHTETAAP KILCVVYTYD AHHDRVAAIG DTWGWRCDGF LAASNRTVPE LGAVDLPHVG
PEAYGNMWQK TRSILAYVHE HYIAEYDYVH VAGDDTYVIV ENLRNYLEFT VEAKHGRGKV
PLYLGQRVFS GGGYTFVGGG PGYILNRLAL QRFIKEALSA CLANQQEAAE DRSLGYCFKT
LEITTEDTAD AFHRQRFHGV DPYFLATKNP QKGFWKRLYK FWAREHGYKW GIGLVSPQTV
TFHLIKSPIW MKRMHAMLYH ACPTGTAMGD LLPRPTKLAN IS