Gene PHATRDRAFT_46493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46493 
Symbol 
ID7201826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp503811 
End bp506227 
Gene Length2417 bp 
Protein Length787 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180844 
Protein GI219120200 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0749134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTTTCTGTT CTTGAGGCCA CATCCACTTG AAACCTTGCT TGGTGTTCAT AGCATGGGAC 
CCAACGCGGC CGAAGACGCC GAGGCAACGG CTTTATTGCC GAGGCGAAAA GGGAACGAGA
TCGGTTCCTA TCAATACCCG ACAGCCCCTT CGTCAACGCT GGAAGAGGGC CATTACTCGT
CGCCTGAGAG TAATGGCAGC AGAAGTGATG GTAGGGATCA TGCCGTTCAT TTTTCTTTTA
CGAGGCGCGA CATTTCGAAC AAATTTGTTG AAGAATCCTG GTGTAATCGT TTGGGAAGGT
CATGCCTTTC GTGGACTTTG CTCCTGATCT TTTTCATTCT TGTTTTTGAA GGATGTCTTA
TATACCTTTC TTATCGAACG ATGTCTCCAG CGGCCGTGCC AATGTTAAAT CTGTATGATT
ATATTGTTGT GGGTGGTGGT CCCTCTGGTA TAATTGCAGC GACAAAGCTT GCACAATCCT
TTCCGACACT TCAGATATTA CTGCTCGAGT CCGGGACCGA CAGTCAAAGT TCAGTTCTCA
AACAACAATC TATACTAAAA GAAGGTGCAA CAGTCTCCGC GGCCGAATCT GGTAGCACAC
TTTGGCAAGA GGACGCTTAC CAACTCAACA AATTTGACGT GCCACTACTG TGGAGCGGCG
TCGCAAGCAG TCGAGGTAGA CGGGACGTCC TACATCTGCA AGCCCCGTCT TGGTCCTCCT
CGCATCACTG GCCCATAGAT AAAACACTTA TGGGGCGTGG TCTAGGCGGA TCCGGGCTGC
ACAACGCAAT GATTTACGTC CGTTCGCTGC CGACCGATTT GGAAGCGTGG AATGTTACGG
GGTGGACCTA CGACGATATT CTACCCCACT ACGTGGCATT GGAGCAATAC GTAGAGGATC
ACATACCGTC ACAGCCTTTT TGGACAAACG ATCAAGGATC CACAATTTCG AAGGCAAACT
GGCGCGGTAC TACTGGTCCG ATACGAACTA TACCTGCCGG TAGTGCGGTG GACGCCTTAG
CACCGCTCTT CGTACAATCG GCAGTTATAA GTGGCGAACG GTTAGCCAAG CGTGGCTTCA
ATCACCCTAG CCCGGCGGCT CGTCTTGGTG CAGGGTACTA CGAATTCAAT ATTCGGCATG
GTGTTCGCGA CTCGGTTGCG CACGCCTTGC TCGGAGGACA TAAGGCAGTA CCAAGGAATC
TGATAGTTCG TACAGGTCTG ACTGTTACGA GAGTGACAAC CAAGCCGAGG CGGAACGAAG
TACCGCGAGT GACAGGCATA GAGTATTTTC ATAGTGCAAC TGGACGGATG GGGAAATTTT
TGCTGCGTTC GGATGATGTT TCCGAAGTTA TCCTGGCGAC GGGTGCCATT ATGACTCCTC
AATTGTTGGC CAACACTGGT ATAAGACCTG GAGGATCTGT TGTACACCTT CCAGGCGTGG
GTCGGAATTT GCAAGATCAC CCCGTAGTGG CACTCAAATT TAAACTGGTC GCAGAAATGG
AGCAGGACGC CTCTTCAATT TATACTCTTG GAGATGAAAT GGAAGACTAC GTCTTATCTG
TGGCTGGTTT GGAAGATGGC CAAGCTAAGC ATAAGAAGCT CTCCAACTCA AGTCTCTCTT
TGCAGCAAGC TTTGTACAGT CGTCTTGGCA CACTGGGAAC GGCCGGATTT TCGGCGGGCG
CGTTTTTGCA GTCACCATTC GCCAAGCACG ATGTTCCCGA TCTTCAAGTG ACCGTATTTC
CTCGCGAAAT AGAGCCGCAT GTGACTCGAA AACAAAACGC CAACGAACGA GCCCAAATGC
GGTGCCGGTC TATGCTGATC ACGGTCGCGC TACTACAACC GGACGCTCGG TACCAGGTTG
AACCACTGTT GTCGGATTTG ACTTCAGCCA ACGAAATCTT TGAGCAGACT GCTGAGACAG
AGCGATCAAT GAACGCATCT GAATCGTCCG TTCCGTTAAC ACATTATCTT GGATACAATC
TGCCATCCAT TGAGCTGCCG GCAGGCCGAT CAGAATATTT GTCTAAACGA GATGTGCGAG
TATTGGCGTG GGGAATAGAG CGTGTTCGTG CAATCCAAAA GATGCCACCA TTATCGCAGG
CGACCGGCGA TGAGCTGGTT CCTGGTGCCG AGCTAGTCGG TGAGTATTTG GAAAATCATA
TTCGGGTTGA AAGTATGCCC AACAGTCACT GGGTCGGGTC GACTAAAATG GGCCCAGACA
GTGATACTTT GGCTGTTGTC AACGAGCGAC TAGCCGTACG CGGAGTACAA GGACTGCGGA
TTGTGGATGC CGGGGTTATT CCTCAGGTTC CGAATGGCAA TACGCACAGT ACGGTATGTG
TTGTAGCTAG TCGTGGCGCC GAACTCATTG AGCAAGATCG ACGGAAGGCA AGCCAGCAAT
CCAATAATCC AAACTGA
 
Protein sequence
MGPNAAEDAE ATALLPRRKG NEIGSYQYPT APSSTLEEGH YSSPESNGSR SDGRDHAVHF 
SFTRRDISNK FVEESWCNRL GRSCLSWTLL LIFFILVFEG CLIYLSYRTM SPAAVPMLNL
YDYIVVGGGP SGIIAATKLA QSFPTLQILL LESGTDSQSS VLKQQSILKE GATVSAAESG
STLWQEDAYQ LNKFDVPLLW SGVASSRGRR DVLHLQAPSW SSSHHWPIDK TLMGRGLGGS
GLHNAMIYVR SLPTDLEAWN VTGWTYDDIL PHYVALEQYV EDHIPSQPFW TNDQGSTISK
ANWRGTTGPI RTIPAGSAVD ALAPLFVQSA VISGERLAKR GFNHPSPAAR LGAGYYEFNI
RHGVRDSVAH ALLGGHKAVP RNLIVRTGLT VTRVTTKPRR NEVPRVTGIE YFHSATGRMG
KFLLRSDDVS EVILATGAIM TPQLLANTGI RPGGSVVHLP GVGRNLQDHP VVALKFKLVA
EMEQDASSIY TLGDEMEDYV LSVAGLEDGQ AKHKKLSNSS LSLQQALYSR LGTLGTAGFS
AGAFLQSPFA KHDVPDLQVT VFPREIEPHV TRKQNANERA QMRCRSMLIT VALLQPDARY
QVEPLLSDLT SANEIFEQTA ETERSMNASE SSVPLTHYLG YNLPSIELPA GRSEYLSKRD
VRVLAWGIER VRAIQKMPPL SQATGDELVP GAELVGEYLE NHIRVESMPN SHWVGSTKMG
PDSDTLAVVN ERLAVRGVQG LRIVDAGVIP QVPNGNTHST VCVVASRGAE LIEQDRRKAS
QQSNNPN