Gene PHATRDRAFT_20899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20899 
Symbol 
ID7201858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp802767 
End bp804647 
Gene Length1881 bp 
Protein Length496 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180899 
Protein GI219120316 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGTCT TTCATTTGAC TCTGAATTTG ATCAGCGCGT TCCTCTATTG CATGAATTAC 
TACATTGTTG AGCCATCTTC AACAATGTAC GTCAATCGAC TCGGTGCGCA CGACGCCATG
TCAGGGACAC TTATTGGTAT GATGCCCCTG GCTGCCTTTG CTTCGAGTTT ACCCTATAGT
ATTTGGACCA ATCGATCTTT TCGACAACCC TTCATCGCGA GTGGCGTCCT ACTAATTTGC
GGCAACCTGC TGTACAGCAT GGCCGACCGG TTTCAACGTA TCGATATTGC GCTGGCTGGA
CGATTTATTG CCGGTCTTGG TGCTCCAAAG TGCATCATTC GCAGGTATAT GGCTGATACG
ACTCCACTCG CTCTGCGCAC ATCTGTCAAT GCAGGATTTG GTATGGTGGT TGCCGCGGGG
TCCGCCATGG GACCGGCAAT GGCCGTTATG TTGAATCGCA TCGAATATAC GGTGGCCTTT
CCCTATATTG GCGTCATTTC CTTGAACGGC TTAACTCTGC CTGGATACTT TATGGCATCA
CTCTGGCTGA CCTTTACAGT CATTGTACTG TTGACTTTCG AGGAACCCGA TCGAGAAGGA
TTGGAGGAAC AAAAGTTGCT GGAGAGTCAA GGGGATATTC TCGTTAGTCC GACAAATCGC
TCGACCACCG ATAATTCCAC TTCGTATCGA AACCAGTACA ACGGTAGTAT CATAGGTCAC
TATGGCAAGT ATTCTGAAAT GCACGACAGC GACGATCGAT CGTTCGAGAT TAAATCCCAG
CAGCTGTCAC AAGATATTCC GCTTGGGTAC GAGTTACCCG AAGACTCGAC TTTTTGGCAC
AGAATCCATT ACTTTTTTGC TCTCATCACA TGGCCGGTTC GCCTATGTCT GGGTCTGCTC
TTTTGCAAAG TTTTTACAAT TGAAACCCTT GTTAGTGCAA CATCGGCGTT GTCCAAGAAT
CGGTACGGGT GGCAAGTAAA CCAGGTTGGA ACACTAGGGT TCATCATTGG TTGTTTGGTC
ATTCCATTTT CCATCTTGGT GGGAAGATTG TCTATGTCGC ATCAGGATCA CGTTTTGATG
CTTTGGTTGG TTGGCACGGG GTGTTTGGGC ATGTTTCTCT TAATCGATCT TTCCGATTTG
GTCGAAACGC AAGATCGGCA CTACAACGAA GGCCATCCAT TGGCTGTTGG TCCCAACCGA
TACATTTGTG GCTACTTTTT GTCATATCTG TCAATACAAT CCTTCGAAGG AGTGATCGGC
TCGACGTTAA GCAAAGTGAT TCCGACTGCA CTGGCCTCCG GAACAATAAA CTCGGGACTA
TTGGCCACCA TGGTTGATAC ATTTGGTCGT GCCTGCGGAG ATCTCTTTAT CTCTGCTGTT
GGCTTTGTTA ACCTACGCCA GCTCATGAAT TTATTGTTCA TACCTGGTTT TGCGATTATG
CTGATCTGTT TCGTCGTCAT CGAACGATTC CGAGACTTAC TGTCAGTGTA AAGCTGCAAC
CACGACGAAA ACACAATGAA CACCTGGTAA ATCACCAACT TTAAGGTCTT TTACCCAGAA
AATGCATATC TGTTCAAAAA TTCAGATGGT GGTCAGACTT TCTGCTTCCG AACGAAACAT
GCCCCACAGG ATAGATCCGA TGGCCTGCAG AGCCATGCCT GTTTCTTCCA CATCTTGGAT
TCGGGAACGA CTCAAACCAT TTCGACTCAC CATGACGGTC GAAGTTGCCG TGACTAACGA
CTGAGCTACA TGAATGCTGC GTTGTTATCA TTTGCAATCC ACTTACAATG AATGCAACAC
CCTGACAGTA CATGATTATT GAGGAAATGA GATGTCTGCT TAGAAAATAC CGAAAATAAA
TCTAGAGAAC AGTTAGAAAA T
 
Protein sequence
MQVFHLTLNL ISAFLYCMNY YIVEPSSTMY VNRLGAHDAM SGTLIGMMPL AAFASSLPYS 
IWTNRSFRQP FIASGVLLIC GNLLYSMADR FQRIDIALAG RFIAGLGAPK CIIRRYMADT
TPLALRTSVN AGFGMVVAAG SAMGPAMAVM LNRIEYTVAF PYIGVISLNG LTLPGYFMAS
LWLTFTVIVL LTFEEPDREG LEEQKLLESQ GDILVSPTNR STTDNSTSYR NQYNGSIIGH
YGKYSEMHDS DDRSFEIKSQ QLSQDIPLGY ELPEDSTFWH RIHYFFALIT WPVRLCLGLL
FCKVFTIETL VSATSALSKN RYGWQVNQVG TLGFIIGCLV IPFSILVGRL SMSHQDHVLM
LWLVGTGCLG MFLLIDLSDL VETQDRHYNE GHPLAVGPNR YICGYFLSYL SIQSFEGVIG
STLSKVIPTA LASGTINSGL LATMVDTFGR ACGDLFISAV GFVNLRQLMN LLFIPGFAIM
LICFVVIERF RDLLSV