Gene PHATRDRAFT_39052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39052 
Symbol 
ID7194729 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp219154 
End bp221511 
Gene Length2358 bp 
Protein Length481 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183051 
Protein GI219125573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCCGA GATGGGACAA AGGGATCTGT CTTGCAGCCT GCTTGATTGG CTTGCTACAA 
TCTCTCCCGC GACTTGCCTA CGGATTCTAC ACGGAGTCGC TTCCGCCTCG ATACCAGCAC
TCGCCATCAA AGCGACCCAC AGAACAACCG CTGCGTATAC GGCGGCGATC ATCAAATGCG
CTCAACGACA GAATTAATAG ATTTACTGCC CCTGGCACGG CTATCGGGAT TGCCGTAGAG
CCAACCCGAG ATGATGCTCC CAAGGCAACC TACAAACGCA ACAGCTCACG GAAAGGAGAC
CGATTCGCAC AAACGTCACT TGCGCGAAAT CGGAAACGCA CGACCCTGCA AAAGACTCCC
GAGCAGATCG AAACTCAATT ACAGTGCGCG TTACAGCAGT TGCGAATCTT CAGCCAAAGT
TTGCCGGATC GGGATGCTCC CATCGACATG TTATTGTTTC CAACCGTGCG TGAGTGCAAC
GCAGCTTTGG CGGCCTTCGG CGACGCCCGG GAGCTCTTAA GGGCTCTCCG TTTATTTGGT
AAAATGCGTA AAGCCACTGC TTTACAGGAA CATCTGCGGG AGCGGATTAA CTTTGCCTGG
CCCGTTCCGG TTCCCACGCT GGTCACCTAT TCGACACTCA TGTCGCGCGC CGTCAAAGCG
CAAAAACCAC GCGTTGCCCT GCGCATGTGG AATCTCAAAG GCACGGACAT TGTTCCAGAC
GTTAAGGCTG CCAACATTCT CATGAATTGC TTCGCCAAAC TTGCCGATGT CGACAAAGCC
CAAGATTTGT TGTCTCAGAT GAAATCCGGT GAGGGTCGTA TAGTCCCACG CATGACGCCC
AACTTGGTTA CGTACAACAC TCTTTTACAC GCGTGTCAAA AGGCAGGAGC CTTGGACGCC
GCTCTGGTTG CCAAGGCCGC GTTGGACGAA TCGGGGTTAA TACCGGACGC CCGTACGTAC
ACGTCGCTGA TCGCCACCGT CGCCCGCAAC CCAGCGCAAT TCTACGGACA AAACGATCCG
AGTTTGGCCT TTGCATGGCT CCAAGAAATG ACGGACCGCA ACGTGCGTCC AAACGGTATG
ACGTATTCTG CATTGATTGA CGTGTGTGGA AGGTGTCACC GGAGTGACTT GGCACTGCAG
GGATTGCGGA TTATGTTGCG CCAAAAGGCG CTGGAACAAA AATCCTTGCC CCCGTCCGAG
GTATCGTCCT ACTCGTTGCA CAGCGAAGTG GGAGCCTGGA CGGCAGCTAT TAACGCTTGT
GGCAAATCAG GACGCCTGGA AACAGCAATA CGGCTGTTTT ATGACGCCAT GCCGAGCTTT
GGATGTGAAC CCAATACCGT GACGTGTGGG TGTTTGACAG ACTCTCTGCT ACGCGCCGGG
CGGACGGCGG AGACCTTGGA CGTGTTGCGC TTCATGAAGA TACAAGGCAT TGCTCCCAGT
GAGGTCATGT ATACATCTCT TATCACCCAC GCCGAGCGAC TGGTCAAAAT TGAAAATAAA
CGCATGTACA GTCACAAAAT GGAACAAGCG GAGCAAAAAC TTCTCGACAA ATCGGGGGAT
ACGAAAGCTA TTGAAGTGTA TACCGAGCTT ATGCAATTTT TAATTGACGG GAGCACCAAC
AGAGCAAGCA CATCGAAATA TAGTCCCGAA AGCGGCTCGA AAGACGACAA TTCAAATACT
CTCTTGCTCA AAGTGTTCTT GGTCTTTCAA GAAATGAAGA CAGCTGGAGC CGAACCCGAC
GTGGCGTGTT ACAATGCTCT CCTCCGGGCA TGTGCCAGGG CGGGAGACTT TGTGCGCGCT
CAGGACGTGC TGGCTCAAAT GCAAGCGACT GATTTATCGC CGAACGATAA TTCCTGGCGT
GAATTATTGC GAGCCGCTGC CAAGATTGGA CGAAGTGATT TAGCGGAATC AATTTGGAGA
CAGGCATTAG TGTACGGTAA TAGAAGGAGA TACACTGATG AACCAGAAAC GAAATGGATG
CCGACCTTGA AGTCATTTGC CGCTTTAGTA GCATCTTATA TGCGCGAAGC GGTCGACTCA
TCGAAAGCCG CTCAAATGCG ACTGTTCCGT CGTGCTGTGA GTCTCTACGA GGCTGCATTG
TACGGCGATG ACGATTTGGG TATGAGTCGG CTAGATGTGA ATGAGCTATT GGACAGCCAG
CGCACAATGT TACTGATTCT TCAAGCTACA GTCGCGTTGG AAGCTTTGAT CGTCTCAGAC
GGAACGGATG AGCGTCGTGA GTTGCGTTCA ACCGCTGTTT CTATATTAAA GCTGGAATCC
TGTCAACGAG TTCAAACCCA TCGACTTTCC TGGACAGCTT TGGAAGCCTA TGATACAGCT
CGAAAATGGC AGGTCTAA
 
Protein sequence
MAPRWDKGIC LAACLIGLLQ SLPRLAYGFY TESLPPRYQH SPSKRPTEQP LRIRRRSSNA 
LNDRINRFTA PGTAIGIAVE PTRDDAPKAT YKRNSSRKGD RFAQTSLARN RKRTTLQKTP
EQIETQLQCA LQQLRIFSQS LPDRDAPIDM LLFPTVRECN AALAAFGDAR ELLRALRLFG
KMRKATALQE HLRERINFAW PVPVPTLQKL LDKSGDTKAI EVYTELMQFL IDGSTNRAST
SKYSPESGSK DDNSNTLLLK VFLVFQEMKT AGAEPDVACY NALLRACARA GDFVRAQDVL
AQMQATDLSP NDNSWRELLR AAAKIGRSDL AESIWRQALV YGNRRRYTDE PETKWMPTLK
SFAALVASYM REAVDSSKAA QMRLFRRAVS LYEAALYGDD DLGMSRLDVN ELLDSQRTML
LILQATVALE ALIVSDGTDE RRELRSTAVS ILKLESCQRV QTHRLSWTAL EAYDTARKWQ
V