Gene PHATRDRAFT_50461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50461 
Symbol 
ID7199312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp133490 
End bp135609 
Gene Length2120 bp 
Protein Length429 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185385 
Protein GI219130465 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.450436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACAGTCAAT AAACACCAAA TTCATTCCGA CGGATCTTTG CCAAATTTCA AGACCAACGA 
ATTCGTTACA ACAACCTTTT TTTAAACTAG AGAATAGTTT GGAGCTGCGG TGAGTGGATC
CTGGGAGGTA AGCGAGCCTT GTTCATATAG ACCATATTCG AGCTCTTCAG GGATATCGTA
TCAATCGCGA CTGAGTTTCT GCTGCTTGAA AAGACCTGTT CCACTAGTGT AATTATTGGT
TCGGTCACTT TGGAACCGAT ATACCTTCAA TCCTCGTTTG ATTTGGGAGT GAGAGGTTGC
AAAGGGGTCT TGCTTTCAAG ACGGAAGTTT CCGAACACCC GATCTCCATC GTCTAGCGTT
CCCGACCGTT CCTTGTCGAG CGTGCCCAGT ATCACAATCG TGAATGTCTG AGTATGCGAT
ACTTATTGAA ATGCACAAAA CTATATATGC AATCCTGGTT TGATTTTATC CACCTTTTAG
ATTGTTTAGA TAAGGACAAC AAATTGCGAT TTGTCTCAGA AAGTCATTCT CTGGGCGTCC
GTACCACATC TCCAGTTGTC TCGCACGCTT CTCACCACGA TGAATTTTCC TCCTAATTTC
TCTAGTCCTT TGTTTCTACG ACTCTTAGAA TTTCCTGGCG AGCGCTTCCA TTTTGGTAGT
GCTTTTGGTA TTGCACGCTG AGCAGTAAAT CGTTTCGGTG TGTTCATGGA TCAAAGAGAG
GTCGACGAAG CCGAAGCGCA GCTTGCTCGA GCTCTGGGCC TGTCGAATTT CGATTCCTCC
TCGAACGATT CAAGGGATCC GGAACAGCAA TTCATCCCCG ACCCCTACGC CGAACAGGTC
GATCTATCAA AGCTGCGTGG CGTGGCCAGT GGTGACAATA AGAAAGTGGA CGACGATTCC
GAGACCGATC TGGAAGCCCT CGCCGAACTG GTTCCCGACA TCAAGTCAAG ACGCAAGAAT
ACGGATAGCA GCGGTATTGA AGTTGGATCC CATTCACACA TTCCCATGCC TGCACAGGAT
CAGTACAAGT CTTCTCCGGC ATTAGACGAA GTGATCGCAA GGGCAGATTG CGCGCCTCCT
CCCTCGGACG AGGATTCGAC CGACGGCCGT AGACTTAGCA TGCCATCGGC CACCGATTAC
TTGAAACATA CGGTATCCAC GCAGGCGAAC GGGTCCCGTA ATGTGGATGA CGTTTCGAAA
CTAAGTTCGA TGAAAGGCGG TAACAAGGAC AGGACAGGGC ATGTTTATAA GCCAAGCAAA
GGACCACGTG TACCTGCTGA TGAGGATGAA GAATACGACG AGGACACTCT TGGAGAATAT
CCGAGGTCGG AGCACGTGCT GGGGGAGGCC GAAAATGTAT CTAAGGACTT TATGAATCCG
TCTTCAGGGC AGTCTATCTC CACCGCCACA CAGACCCACG AAATGCGCAT GCCCATGTTC
CTCCCGACCT TCAAACCTGC TACGGGCTGC ACCAATGCCT CAGACTTTGT GGTGCGCTGC
TTCGTTGCCC GTCTGCGATC GGGCATTACA GTGGTAAAAC ATGGGCGATC CAGGTGGTGC
AAGTCGCGAT TGCGAGTCCT GCACATTCAT CCGGACGGTC GATCCCTCAG TTGGAGGCCT
GCTGAAGGAG AGCCCACCAC AAATAGACGC CCTCCCAAGC TCGACTTGAG TACTTGCCTC
GAAGTCCGGC ACGCCTGGAG CCCTGATCCT CACAATCCTG TCTACACAGG CACACCTATT
CTGCGACAAA AATGCGAAGC TGCGAATGCA CACAAGTCCT TTGCGTTGAT TTTCAAAGGT
CGCACCGTCG ACATCACGGC CGTCACTGCG GATCAGTGCA AGGTGTTGAT GGAAGGCTTT
TCGGCTTTGT GTTTTCGGTT GCAGGTGGCT AATCTTGCTG GTCGCAAAAA AACTCGGCCC
ATGCCGGAAG AAGATGGTAT CAGCACAACG GCCAGCAACA CTCTGACCAA CAATTCCTCG
GCTCCCCGTA GATAATATGC CGCACTTATA TTTGCACCAA TAGAGAGGCG AGAGGCAGTG
TCCTTCTCGC TACTCTCTAT GTATGGAAGC TTACTGTTAA TTCAACTCGC ATATTTTAAA
AAAACTTTCT ACCATGTCCT
 
Protein sequence
MDQREVDEAE AQLARALGLS NFDSSSNDSR DPEQQFIPDP YAEQVDLSKL RGVASGDNKK 
VDDDSETDLE ALAELVPDIK SRRKNTDSSG IEVGSHSHIP MPAQDQYKSS PALDEVIARA
DCAPPPSDED STDGRRLSMP SATDYLKHTV STQANGSRNV DDVSKLSSMK GGNKDRTGHV
YKPSKGPRVP ADEDEEYDED TLGEYPRSEH VLGEAENVSK DFMNPSSGQS ISTATQTHEM
RMPMFLPTFK PATGCTNASD FVVRCFVARL RSGITVVKHG RSRWCKSRLR VLHIHPDGRS
LSWRPAEGEP TTNRRPPKLD LSTCLEVRHA WSPDPHNPVY TGTPILRQKC EAANAHKSFA
LIFKGRTVDI TAVTADQCKV LMEGFSALCF RLQVANLAGR KKTRPMPEED GISTTASNTL
TNNSSAPRR