Gene PHATRDRAFT_21996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21996 
Symbol 
ID7203106 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp375336 
End bp376677 
Gene Length1342 bp 
Protein Length417 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182210 
Protein GI219123810 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACGAACAAA GCAGTATCCC ATATTATTCG TACCGAAATG CAGGCTGTGG CTCCTCTGAC 
TGCTGTACAG ATTCCGTGTT GCCTTTGTGG CGTGCTTACT GCGCCGAACG CAGCGAATCA
GTGCGCTTCG TGTCTGGCAC AGGAATTTGA TCTGAAAGGC CGCTTGCAGC GGGGTCCATC
GGGGGCACCA TTTGCGACAA CTTACCAGTG CCGGGAATGC CGTCGGTTCC GACGGACTGA
AAAGCACCAC GAACACGCCG GACCAGAATC CCCGGAGCTA CTAGCGATTT GCTTAAAAGC
AATTCCGGCG CTCCAATCGA CGGCTGAACC CCGCTTACAT TTGATCGATG CTGGTTGGGT
ATGGACGGAA CCACATTCCA TGCGGTGGAA GGTACGATTG ACGGTGAGGA CCGAAATTCA
GGCCGTGACG GTCCAACAAC GTGTAGTCGT TGAACTCCAC AATGCCTTTC GGCAGTGCAA
TGATTGCAAT CGTGAGTTCA CCAATCGTAC CTGGCAAGCG CTCGTCCAGC TGCGTCAAAA
ACGCTCAGAC GATGCGCCCA AGAAGGGACT GACGGCCTTG GAAATGGCGC TGGCGAAGAA
TAAGGAAATT CGAAAGCATG TACTCAAGAT CGATGCCGTA CGGAATGGTT TTGATTTCTA
CTTTCTGTCG CTGTCCTACG CCCAAGCGTT CAGCGCTTAC CTACAACGAG TTGGTCCGAT
GCGCGTCAAG ACCAGTAAGA AGCTCGTTTC ACAAGATTTT ACGAACAACA CAGCGAATAT
GAAGTACACG GTAGTTTGTG ATTTGGTACC ATTTTGCAAA GACGACTTGG TTTTGATTAA
GAAGGGCGCT AAAGGAAAAT TGTCGGGACG CTTAGCACTC GTGACGAAGG TGTCGAGTGT
CGTACATTTG ATGGATTCAT CTCCGAAACG GGAAGCGTTG CTCGATAGTC AAATGGAGCT
GTCGCCAGAC GCGTATTACA AACAAGAGAA GCTGTACACA ATTCTACAAG CGTCGAACCG
AACTATCCCA TTTGTGGTTT TAGATGTTGA CTTGTGTCAG CACGATGGGG GCGCAATGGA
TGAAAGTGGC CAGCCACTAT ATGCTGGAGT AGAAAATAGC GTGGAGAAGT ACTGTTTGGC
CGATGTTCAG GTTGCTCGTC AGTCGGATTT TGGGGTCAAT GATGAAGTTT TCAATTGTGT
CACGCATCTA GGCCATTTGA TCAGACCAGG TGATGTCGTC ATGGGATACG ATTTAGTTGC
AACGGTGGGT GGTGACTGGG AGGTCGAAGA GTCCCTCCAC AACAGTTTTG TACTGCCGGA
TGTTGTTTTA GTCAAAAAGA TC
 
Protein sequence
MQAVAPLTAV QIPCCLCGVL TAPNAANQCA SCLAQEFDLK GRLQRGPSGA PFATTYQCRE 
CRRFRRTEKH HEHAGPESPE LLAICLKAIP ALQSTAEPRL HLIDAGWVWT EPHSMRWKVR
LTVRTEIQAV TVQQRVVVEL HNAFRQCNDC NREFTNRTWQ ALVQLRQKRS DDAPKKGLTA
LEMALAKNKE IRKHVLKIDA VRNGFDFYFL SLSYAQAFSA YLQRVGPMRV KTSKKLVSQD
FTNNTANMKY TVVCDLVPFC KDDLVLIKKG AKGKLSGRLA LVTKVSSVVH LMDSSPKREA
LLDSQMELSP DAYYKQEKLY TILQASNRTI PFVVLDVDLL ENSVEKYCLA DVQVARQSDF
GVNDEVFNCV THLGHLIRPG DVVMGYDLVA TVGGDWEVEE SLHNSFVLPD VVLVKKI