Gene PHATRDRAFT_50413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50413 
Symbol 
ID7199220 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp295377 
End bp297696 
Gene Length2320 bp 
Protein Length555 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185304 
Protein GI219130296 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCGTCCCCG GCCCCTCACG GTCAATGCCT TTTACGCTTA AAAGGGTAAG GCCCAAGGCA 
AGACGCAACT CCTACCTCGC TAGTTGTTAA AACGTCGAAT GCGTGGTGGT GGTGTACCGA
CAATCCCTTG CTAAAGGTTT CGTTCGTTGC TCGAAGCTAC CGGATCGCCA TCCTCTTGGT
TGCGTCGACT CCGAGCCACC AATTCTCAGC ACACAGTCTC TCCGACTCTT ATCCAACACA
CACCTCTGGT GTGGGTGTTG GTGTGTGTTT CTGTGGCTGT ACCTACGAAC GTATTGAATT
ATATATAGTC CGTGCGCTCG CTACCGACAA TCACAATGAC CACAATGACG ACTCAGTCCA
GTGGGATGGG AGTGAGGATA TCACCCTTGT CTCTTTTGAC GCATCTGCTC GTGGGTCTCG
CCGGTCTCCA ACTGGGTCTC GTCATTGGGG GATACACGGT GATCCACACG GGCAGCGACG
CTGCAACTAC CGTCACCACT AGTGGTGATA GTAGTAGTAG TAGTAGTAGT ACTGGTGCCA
ACGCCGACGC CGTTGTTGGT ATCCCGTGTG TGCGAGACAC CACCGACGCT ACGATTCGCG
AATTACGCCT GAAACTAGCG CACGCCCTCG CGGAGGCTTC CGCCGAAACT ACCCTGCCCG
AGACTCTTCA GGGGTTCGTG GCCGGGATGG GACGCGTCGA TCGCGACGAA TTTATCCAGC
GATTCGATGT CGGTGTGCCG TGGGATGCCA AGCAACCCGG CAATAAGGAA GTTCTCCTGC
TGTACAATCA CCCATTGTCC CTGCCCCGTA ATGGAAGCAG CGACGCATCG TCCAACACCA
TTCCTCGCTA CGAATCGATC AACGACGCCG TCCAGAACTG TCACCAAATG AAAATAGTGC
TACAGAAACC GAATCAACCC AACGTCTGTT GGGCCGTCGT GGGACAGTGG GATTCGCACC
ACGTCCAGAA ACTCATGCGA CTGCCGCCAG AACATAGTTT GTCGACCAAA AAGGCTGTCT
CCGAGGGATT CCCGCTCCGC TACGTCTCTC GCAACCATTT GCCCAACGGT CGCCAGACGA
GAGTTCCCAC CGGTACTTCC TACTTCCCCA TCCTACACGA CTATCTCGGC AAACTGGAAG
CCACGCTCGC CCGCCTCAGT CCCGTCGCCG CGCAAGCGGC GGACCCCCAC AACTCTCTCG
TCGTGCTCGT TTGCAACCAC GGACAATCGG AACTCTTGTG GAATTTCGTT TGTGCCGCGC
GATCACGGTC GCTAAATCTC GCGCACGTCC TCGTCTTTGC GACCGACTCG GTGACGTACG
ATCTCGCGGT CGCCATGGGT CTGCACGCAA TGGACGTCCA GAACGCCTTT GGTGACATGC
CCACGGTTGC GGCCAGGCGC TACGGTGACG CCGCCTTTAC CGGAATGATG ATGAGCAAGG
TCTACGTCAT GCATCTACTC ATCACTCTAG GATACAATGT GCTCTTCCAA GATGTCGACG
TGGTGTGGTA CCAGGATCCC CTCGCGTATT TCAATAACGA CCAGTCCGAT TTTGATTTGT
TCTTTCAAGA CGATGGAGCG CATTGTAAGT GTCGCGCATC CTGTATTTTT GTGGATGGTT
GTGCGGAGGG TTGTTGTCGA ACGGCAAGCA CTATGACTTT CACGAGGGCG AAGCGCTCCG
GTGGTCGTTG AATACGGTCT TCCTGTCCGG TTTGGTGTAT TCGAATTTTT GTAGGCCCTC
CACCGTGTGG TCGAACCTTA GCGTTGCTAA CCTTTTCTTT CACACCTATT TTGTTAGCTC
CACGGTATGC TCCGTACAGT CCAAATACGG GCTTATACTA TGTCCGCCAC AACGAGCGTA
CGGAGTTTTT CTTTTCCACG TTGGCCCGCA TGGGTGATTT GATTCAAGCT TCGGGTTCGC
ACCAATCCGC TCTCAACGCG CTTTTAGCCG AACACGCATC CTGGCGCGGA CTCAAGGTTA
AAGTCTTTGG GACCAGCACG GACCAGGGTG AACTTTTTCC CGGTGGGTAC CATTTTCACC
GACGCAAAGG CTTCATGAAA GAAATGTTAG TTACCAAAAG TGTACACCCG TACGTCTTTC
ACATGTCATG GACGATGAAT AGTATCAACA AACAGCGATT CTTTCGACAG ATGGGAGACT
GGTTCGTCCA AGAAGCTTGC ATTGGCAAAA CTCTGGACGA TATTGGGACG GACGATTCGC
CGGACGTTGT ATCTAAATGC TGTGCTGCCG AGGCGCTTGT ATCCTGCCAC TATAGCGACA
AGCCGAGTAT TATACCGTGC AAGGACAGTC CTTCGATTGA
 
Protein sequence
MTTMTTQSSG MGVRISPLSL LTHLLVGLAG LQLGLVIGGY TVIHTGSDAA TTVTTSGDSS 
SSSSSTGANA DAVVGIPCVR DTTDATIREL RLKLAHALAE ASAETTLPET LQGFVAGMGR
VDRDEFIQRF DVGVPWDAKQ PGNKEVLLLY NHPLSLPRNG SSDASSNTIP RYESINDAVQ
NCHQMKIVLQ KPNQPNVCWA VVGQWDSHHV QKLMRLPPEH SLSTKKAVSE GFPLRYVSRN
HLPNGRQTRV PTGTSYFPIL HDYLGKLEAT LARLSPVAAQ AADPHNSLVV LVCNHGQSEL
LWNFVCAARS RSLNLAHVLV FATDSVTYDL AVAMGLHAMD VQNAFGDMPT VAARRYGDAA
FTGMMMSKVY VMHLLITLGY NVLFQDVDVV WYQDPLAYFN NDQSDFDLFF QDDGAHSPRY
APYSPNTGLY YVRHNERTEF FFSTLARMGD LIQASGSHQS ALNALLAEHA SWRGLKVKVF
GTSTDQGELF PGGYHFHRRK GFMKEIINKQ RFFRQMGDWF VQEACIGKTL DDIGTDDSPD
VRQAEYYTVQ GQSFD