Gene PHATRDRAFT_16073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_16073 
Symbol 
ID7198316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp399655 
End bp401076 
Gene Length1422 bp 
Protein Length474 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184362 
Protein GI219128317 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.760064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTCG ATCGCATGAC GGAAATTCAA GCCAAAACCT TCGATGCCGC GTCTTTGGGA 
AAGGACGTGC TCGGGCGCGC CCGCACCGGA TCCGGCAAGA CCGTCGCCTT TCTCCTACCC
GCACTGGAAC GACTCTTGCA GGACAACAAC AACAAAAGCA ACAAAAAGTC GACGCGCATG
CTCGTCTTGT CGCCCACACG AGAATTGGCG CAACAGATTG CCGAACAAAC CCGGCTCTTG
ACGGCCCATA TGCCCAATAT GTCGCACCAA GTCATGGTGG GGGGAACCCC CAAGCCCAAG
GACGTCTCGG CCATGAAGCG CAAGGTACCC ACCATTATCA TTGCGACTCC TGGACGACTA
CAGGATCATT TGGAATCCAC CGTCGTACAC AACACGCCTT TTAAGGATCT CTTCCGGGAA
CTCGATGTGC TCGTTTTGGA CGAGACGGAT CGACTCCTCG ATATGGGCTT TCGTCGAGAA
ATCGACAAGA TTATCAAATA CCTCCCGCGC AACAAGCAGA CGCTCTTGTT CAGCGCCACC
ATACCGGAAG ACGTCAAGCA CGTCATTCGA CAAACCATGC GCGACCCCTA CATCACGGTG
GATTGCATAC ACGACGATCA GGCCGAATCC TCCTCCCACA CCAACGCACA GGTATCGCAA
GCTCACGTCA TTCTCCCGAC CAACACCCGC ATGGCATCCG GCACGGTAGA CATTATCCGG
AACATTCTCG AAAAACAACC CCACTCGAAA ATTGTCGCCT TTTTCCCCAC CGCCAATCTT
GTCGCCTTTT ACGCCTCGCT CCTACGGGAC GTCCTCGAAA TCCCCCGCAT TCTCGAAATA
CACTCGCGCA AATCACAGTC CCAACGCGAA AAGGCCTCGG AGAGCTTCCG CAAAACCAAC
CACGGCTGTT TGCTCACTTC CGATGTGAGT GCCCGTGGAG TAGACTACCC CGACGTTACG
CACGTCTTGC AGTTCGGCGT GGCCGATTCC CGCGAATCAT ACATTCATCG CCTCGGACGG
ACCGGACGCG CCGGTAAACT CGGACAGGGC ATCCTCGTCC TCACGGACGT CGAACGCGGC
TTTCTACGGC ACCTGAAGGG ACTCGATATT CCTGTCCACC CGGAACTACA AGCCATTGTG
GACGGGCCCA CGGTCGAGTC GCAGCAGGAC CTTGCGCCCG TCTGGGCATC GATCGGTTCG
GGACGGAACG CGGATTTAGC CCTCAAGGCC ACCAAGGCCT ACGTTTCCGC ACTGGGATTC
TACAACACCC ACCTCAAGGC TCGCTGTGGC GTCAAGGGTA CCGACGCTTT GGTCGCTTTT
TGCAACGCCT TTGCCTACCA GGTGGGATTC ACGACGCTGC CCCCGATTGA GAAGAAAACA
ATTGGCAAAA TGGGACTCAA GGGTATTCAA GGTTTGAACG TG
 
Protein sequence
MGFDRMTEIQ AKTFDAASLG KDVLGRARTG SGKTVAFLLP ALERLLQDNN NKSNKKSTRM 
LVLSPTRELA QQIAEQTRLL TAHMPNMSHQ VMVGGTPKPK DVSAMKRKVP TIIIATPGRL
QDHLESTVVH NTPFKDLFRE LDVLVLDETD RLLDMGFRRE IDKIIKYLPR NKQTLLFSAT
IPEDVKHVIR QTMRDPYITV DCIHDDQAES SSHTNAQVSQ AHVILPTNTR MASGTVDIIR
NILEKQPHSK IVAFFPTANL VAFYASLLRD VLEIPRILEI HSRKSQSQRE KASESFRKTN
HGCLLTSDVS ARGVDYPDVT HVLQFGVADS RESYIHRLGR TGRAGKLGQG ILVLTDVERG
FLRHLKGLDI PVHPELQAIV DGPTVESQQD LAPVWASIGS GRNADLALKA TKAYVSALGF
YNTHLKARCG VKGTDALVAF CNAFAYQVGF TTLPPIEKKT IGKMGLKGIQ GLNV