Gene PHATRDRAFT_37749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37749 
Symbol 
ID7202290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp861848 
End bp863587 
Gene Length1740 bp 
Protein Length579 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181819 
Protein GI219122993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGCCGA CACGGACGAG TATCCTGCTG TGTGTCGTGA TGTTGTTGTC GAATGCACGT 
GGGGAACACA GAACCGCCTG GCAGGCTCTG CGTAGCAGGT CTGTGCGTCC GCAGCCCGTG
CCATTGCTTG CGCCACCACC GTCGGCACCA TTCCGATTCT CGTCTGGCGC AAAGTCGAGT
ACGATTCTCT GTGCTAAGAA ACCAAACACA AAAGCCAAGC CGTCCGTTGC ACCTCTCCAA
TCGTTGTCGC GGAAAAATCG CATTCAAAGC GTATTGGATT GGGCACAAAG AGCGGACGTT
CAAGTAAGCA AGGAAATAGC GTTGGATTCT CGAGTGGCCG AGTACGGGCT CGGCTGGTAC
GCCTCCACCA ATATTCCCAC CAATCAAGTT TTGCTGAGTG TGCCCTCCAA TCGAGCCTTG
ACAGTGGAAA TTCCCGGTGA GGGACCGGAC GATCGCTCCG TTCTGGACTT GGTGGCGAGC
TCGGACAGTG GCAGCAAGAC AGAGGTACGG GCCTTGCCCT GGTTTGTGCA AATGAGTCTG
TACATCTATA AATTGGACCA AGTCGATGCG GACAAAGAAG GTGTTGATAT GCGCCCCTGG
TTGGATTCGC TACCGAGGTC TTTTGATACC GTCATACATT GGTCCGAGGC AAATCGGCAA
GAGTTACAGT ACGATTCTAT GGTAACTGCC GTGGCCAGTC AAGAACAAGA TTGGAAACGG
TACTACCAAT CGCTCTTGCA AGCTGGAGCC TCATCGTCGT CCTTGACATG GGAGCAGTTC
CTGTGGGGTT GTGAGATTGC TCGATCACGA GCCTTCTCCG GAGGATTTAC AGGATCCGCC
TTCAATCCAG GAGTATACGC CTTTACGCTC TTGCTCGTCA CAATCTATGT GGGTCTGGGT
GTGGGTAGCC TCGAACAAGC AGCCAACGGA GCTGGTGTGG TCTTTTCCGC AAGTATACTC
AAGGACTTTG TGTTGCCCAA ACTCTTCAAA AAGAGGCGAT ACGTAATTTG TCCCATGATT
GATATGGCCA ACCACCAGTC GGTTAAATTT GCTGGCCAAG TCTCCTTTGA GTACTTTGCT
AATGCTTACA GTTTAGCCAC GGATCAAGCT ATTCCGTCCG GTGACGAAGT TTACATTTCC
TACGGGCCGC GATCCAACGA TCAGCTATTG CAGTACTACG GATTTGTTGA GCGCAACAAT
CCAAACGATG TGTATGTCAT GCCACCTCTA CGAGAGTGGG ATATTGAAGC CTTGGAACGG
GCCACGGATC GCAAGTTTGC GGTGGGACGG TTGGAAAAAC TCAATCGTGC CGGATTGTTG
GGGAGTGCAA CGACGGTACT TTCAGACAAA AAGTACGACG AGACGGAGGT TGCCAACGCC
AATGGGGGCG TTGTGATAAC GCGCGTGTTG GGCCTAGACC CGGCCATTCT TCAAGCCTTG
CGAGCACTCG TGTCGACAGA GGACGAATGG AATGCCGCGG GCCAAGCAGT CGGCAGTTTT
GCGGAAGAAG GGTCGGGCGG AGCCGCCAAC GAGGCAGCCG CTCGGCTAGC GGCGCGAACG
GCGGTCGGAA TGGAGCTCCA ATCAAAAGAG ACCACCCTGC AAGAAGATGA AGCCCTACTC
CAACGAATGG ACACTGTGAA AAGTATGGAT GCTAGCAGGG AAGAGAAATT GGCGGTCCAA
TTTCGGATCG AAAAGAAAAA GTTGTTGTCC GAAACGCTGG ACAAGTTGTC AGTAAGGTAA
 
Protein sequence
MWPTRTSILL CVVMLLSNAR GEHRTAWQAL RSRSVRPQPV PLLAPPPSAP FRFSSGAKSS 
TILCAKKPNT KAKPSVAPLQ SLSRKNRIQS VLDWAQRADV QVSKEIALDS RVAEYGLGWY
ASTNIPTNQV LLSVPSNRAL TVEIPGEGPD DRSVLDLVAS SDSGSKTEVR ALPWFVQMSL
YIYKLDQVDA DKEGVDMRPW LDSLPRSFDT VIHWSEANRQ ELQYDSMVTA VASQEQDWKR
YYQSLLQAGA SSSSLTWEQF LWGCEIARSR AFSGGFTGSA FNPGVYAFTL LLVTIYVGLG
VGSLEQAANG AGVVFSASIL KDFVLPKLFK KRRYVICPMI DMANHQSVKF AGQVSFEYFA
NAYSLATDQA IPSGDEVYIS YGPRSNDQLL QYYGFVERNN PNDVYVMPPL REWDIEALER
ATDRKFAVGR LEKLNRAGLL GSATTVLSDK KYDETEVANA NGGVVITRVL GLDPAILQAL
RALVSTEDEW NAAGQAVGSF AEEGSGGAAN EAAARLAART AVGMELQSKE TTLQEDEALL
QRMDTVKSMD ASREEKLAVQ FRIEKKKLLS ETLDKLSVR