Gene PHATRDRAFT_20905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20905 
Symbol 
ID7201865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp829084 
End bp830684 
Gene Length1601 bp 
Protein Length332 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181081 
Protein GI219120696 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTTGTGA CAGACACCGT AGAACAGTTC CGAACAAAAC CCTAATCATG ACGAGTCGTT 
GTAGAGTGCT ATTCTCGAGG ACGCTGGGTC CTCGGTTTCC AAAGTACGTC GGTAGTAACC
GTCGAGCCCT GTCGATGGAT GGCAGTGGCG GCGCTCACAA ACCGGACAAG GACAATGCCA
TCCCCAGCAA CGGTCGCAGT GATCCAACGG CAGGAGTGGA ACCTCCTTCC ACCTACAAGA
CTAGACTTCG GCAACTTATG GGAAAATCAC TCCAGCCACC AGGTACGTGA GATTTGGATA
CCTTTCCGTC AATCGGAACC TTTCGTGTCC CCTGCACTCC TGGCATACGA CACACACCGT
TAGATATACA CAAACATTCT TGGACGTGAG TACTGACGCA TTTTACTGTG GTCTCGGTTG
CTTTACTCGC TCTACTTTAC ACACACACAC GCGCAGCCGG ACGCTTGTGG GCACCCGACG
TTCTCAGCGA CGATGAAGAC GACGACAACG ATTACCGCCA ATTCATTCCG CAGATTGCCT
TACACGAAGG AGACGGCCGC GCACGTAAGC GTGTTTTGGT CCTGTGCACC GGTGGAACCC
TCACCATGGC TCCGGATCCC AATCAAGGTG GGGCGCTGGC TCCGGTGGAA GGGGCGCTGT
CCCAGTATAT GCGGGAAATG CCCGAACTAC ACGCCGACAA CATGCCAGAA GTGGTCCTAC
ACGAGTACTT CCCCTTTTAC GATAGTTCCG ACCTAGGTCC GGCCGATTGG GCCCGTCTCG
CCCACGATAT TCGCGCCAAC TATTTGCATT TCGACGGTTT CGTCATTGTC ATTGGGACCG
ACACCATGGC CTACTCGGCC ACCGCACTTT CCTTCATGTT GGAAAACCTC GGTAAACCGG
TCATCTTCAC GGGCAGTCAA ATACCCATGT GCGAACCGTA CAATGACGGG AGACGCAACC
TCATTATGGC GCTTATTTTT GCCTCGCGCG ATACCATCAA CGAAGTCAGC ATCTTTTTTC
ACGATCGTTT GTTGCGGGCC TGTCGCGCTA CCAAGGTCAA TACCCACCGC TTGTTGGCCT
TCGACTCGCC CAATCAGGAT CCACTCGCTA CCATTGGTAT CACCATTGAC GAAAACGAGC
ATCTCGTATT GCCGCCCGCC AAGGGCGCTC TGCGGGTGCA CTCACGGATG GACACCAGAG
TGCTGACTAT TCGACTCGTC CCGGGCTTTG ACGACGCCAT GATTCGAGAA ATGATCAATC
ACAATCTCCA AACGAACCTG CTCCGTGCAC TCGTACTGCA ATTGTACGGC ACGGGAAACA
TTCCCTCCGT TAAGGAAAGT TTCATTCAAC TCTTGGCCGA CGCCTCGGAC AAGGGCATCC
TGGTCGTCGC TTCGACCCAG TGTTACACCG GCTCCGTCAT GATGGGACAT TACGCCGTCG
GTAAGGCGTT GGAAAGTGCC GGTGTCGTCA GCGCCGCCGA TATGACGCAA GAAGCGATTG
CCTGCAAGGT AGGATATCTG TACGGACGAG GTGACCTCTC GCACGCCGAG GCGAGTAATC
TTATGGGGGT TTCCCTGCGG GGAGAAATAA CCCCCCAAGA A
 
Protein sequence
MAPDPNQGGA LAPVEGALSQ YMREMPELHA DNMPEVVLHE YFPFYDSSDL GPADWARLAH 
DIRANYLHFD GFVIVIGTDT MAYSATALSF MLENLGKPVI FTGSQIPMCE PYNDGRRNLI
MALIFASRDT INEVSIFFHD RLLRACRATK VNTHRLLAFD SPNQDPLATI GITIDENEHL
VLPPAKGALR VHSRMDTRVL TIRLVPGFDD AMIREMINHN LQTNLLRALV LQLYGTGNIP
SVKESFIQLL ADASDKGILV VASTQCYTGS VMMGHYAVGK ALESAGVVSA ADMTQEAIAC
KVGYLYGRGD LSHAEASNLM GVSLRGEITP QE