Gene PHATRDRAFT_43519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43519 
Symbol 
ID7197560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp701657 
End bp702960 
Gene Length1304 bp 
Protein Length361 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177667 
Protein GI219111831 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GATACCAATA TCAATCTCGC TGTACGAGAG TGATCAACAG GGGCTCCTGA AATTTGGTTT 
CCCGGAAACG TTGAGAGAAC AATCCCATCT TTCATCTATC GCTTGTCTGG ATGTGGATAT
CAGCTGCAGA CTCGTCGTAT AGTAGTGAAT CTATCTAGAC ACAAGTTGTA TGCAAGAGTA
ATGATCAGGC AATAATGATA TCGGATGATT TTACGGTTTC GTTTAATGCA CGGGATTCTA
TTCTCTATGC TCTGTCGATT GGGTTCGGCT CCTCCTTGGA GCGATACGAT GAAGATCGGC
GCTACGTATA CGAAGAAGAT ATCAACTTTG CAGTCGTTCC AACATTCGCG ATCACCTTTA
CTTTTTGGGC CAATCAATAT CGGAGATCAA TTGGTGACAT TCCGCCATTT CCTCCGCCTC
TCATGAGTTC TGCTGGTGTT TTACCCCAAG GATGCCTTCG AAATGGCGCA TCAATAGATG
ATTTACCGAT CATTCAAACC GAAATTTCTG TCGTTTTCCA GAACGCGCTT CCTGTCCCAA
AAAGTGGTCA AACAGAACCG ATGCGAGTAA GCCAGTCCTT TGTGTCCGTG TCTCCGAAGT
CCATCGGCAC TTTCGTGACC ACAGAAACCA AAATTACGAA TAATTGCCAC ACTCTCTGCA
CCATTACGTC CACGGCTCTC GTTTTGGGCG TACCAAGTAG TCATGTTAAC CCCATGCAAC
CCACGGACAT GATTCGGGAG GAACAACACC CGTCAAAAGA TGCTCTCCAC GAGCTTTTGG
TCGAATGGGA TTATACTGTG CCTCCCAATC AAACACTTCT ATATCGGCAG ACCAGTGGTG
ATTCCAATGA AATTCACGTC AATCCGGATG CTCTGCCAGC CACACTAGAG AAGCAAGCTA
GTAAAATTCG CGTAGATCCT GATTCGCAAC CTGACGTACA GAAAGAACAA GACCGTGACG
ATTCTAGTAG GCGAAAACTT CGTTTACATG GGCTCAGCAC TCTAGGAATC GCAGTACGAG
CTTTGATACA CTATACAGAA GACAACTACC CAGGTTCATC GCTTCAAGCT GTCAAGGCAT
GTTTCACATA TCCTGCGTTC GTGAATGACC GTATCACTGT AAAAATTTCG GGAGCCAAGA
ACGATTCATC CTTACAGTTA GGCAAGAGTG TATTTACTTT TCTAGTACTG AACAAGACAA
GCGGTAAAGT CTTATTGAAA AACGGTTATG CTGAGTTTGC CTGGAACCGT TCAACCTTAC
AGCAGCAATC AAGGCTGTAA GCCTGAGCAA TTTATACCCT AAAG
 
Protein sequence
MISDDFTVSF NARDSILYAL SIGFGSSLER YDEDRRYVYE EDINFAVVPT FAITFTFWAN 
QYRRSIGDIP PFPPPLMSSA GVLPQGCLRN GASIDDLPII QTEISVVFQN ALPVPKSGQT
EPMRVSQSFV SVSPKSIGTF VTTETKITNN CHTLCTITST ALVLGVPSSH VNPMQPTDMI
REEQHPSKDA LHELLVEWDY TVPPNQTLLY RQTSGDSNEI HVNPDALPAT LEKQASKIRV
DPDSQPDVQK EQDRDDSSRR KLRLHGLSTL GIAVRALIHY TEDNYPGSSL QAVKACFTYP
AFVNDRITVK ISGAKNDSSL QLGKSVFTFL VLNKTSGKVL LKNGYAEFAW NRSTLQQQSR
L