Gene PHATRDRAFT_51970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_51970 
Symbol 
ID7201046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp842786 
End bp844453 
Gene Length1668 bp 
Protein Length555 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180331 
Protein GI219119129 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0997978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGTCAC GACTTCCCGT TACGGGCGTC GAAGTCGTTC GCGATACTAT TCGATATTCT 
GGGGTATCCA CTCCGATTTC CGAACGGGAA GGCTTGGGGA TTCGAGGCTT GGTACCCGCT
GCCTTTCTCC CACTCGAACT CGACGTCGAA CGATGCATGT TGCAAATGCG GTCCAAAGAA
TCGCCACTGG AAAAGTACAT TTATCTGCAC AACATTCAGG ATGTCTCGGA ACGTCTCTTT
TATGCTATTC TTTGCAAGTA CACGTCCGAA GTCATGCCGT TGGTCTACAC ACCCACCGTC
GGTGAAGCAT GTCAGAACTT TTCGGCCATT TATCGGGGTA CGCTTCGCGG CATGTACTTT
TCGCTAGAAG ATTCGGGCAA GATTCGTACA CTCCTCGACA ATTGGTTTAC CTCCAAGATT
ACTACAATTG TTGTCACGGA TGGTGAACGG ATTCTAGGAT TGGGTGATCT CGGTGTCAAC
GGTATGGGTA TTCCCATTGG GAAATTGGCC CTGTACACGG CCTGCGGTGG CATCGATCCG
GCCAAGGTCT TGCCGGTACA CATTGACGTG GGCACCAACA ACGAGGAAAA TCTGAACGAT
CCGTACTACC TCGGTCTTCG ACGGCCTCGG GAGCGAGGGC AAGCCTACGA TGATTTGATT
GCCGAGTTCT TTGAAGCTGC TCAGAACAAA TTCGGAGCCA ATGTGATGAT TCAATTCGAG
GACTTTGGTA ACTTGAACGC CTTCCGGCTA CTAAGTGCGT GGCAAGACAA GGCCTGCACT
TTCAATGACG ATATTCAAGG AACGGCAGCC GTGGCTCTGG CCGGTTTGCT TGCTTCCAAC
CGACTCACTG GCAAAGACTT GATTGATCAC ATTTTTTTGT TTGCCGGCGC GGGGGAAGCC
GGGACCGGTA TTGCTGAACT ATTGGCGCTC GCCATTGCCG AGAAGGGCCA CTTACCAATT
GAACAAGCTC GGAAGAAGAT CTTTCTCGTC GATTCGAAGG GTTTGGTGAC CAAATCGCGT
TTGGATAGCC TACAGCACCA CAAGGTCGAT TTTGCGCACG ATGTGGACGA CTGCCCAAAC
TTGTTAGCAG CAATCGACAT GCTCAAGCCT ACCGGATTGA TCGGTGTATC CGCCATTCCG
AATTCGTTTA CGAAAGAAAT TTGCGAAAAC ATGGCTGCCC ACAACAAAAT TCCGGTCATT
TTTGCGTTGA GCAATCCTAC GTCCAAGGCG GAATGCACGG CGCAAGAAGC CTATGAATGG
ACCGATGGGC GTGCAATTTT TTGCAGCGGC AGTCCATTTG ATCCGGTGAC GTTGCAGGAT
GGGCGCCAAC GTGTCCCGGG GCAGGGCAAC AACGCATACA TTTTCCCGGG CATTGGGCTT
GGCGTATTGG CGGCCGGATC TACTCGCATT ACAAATTACG ATATGCTGTT GGCAGCGGAA
ACGTTGGCGG CGGAAGTAGG TCCCGAAGAG TTGGACGTCG GTTGCATGTA TCCTCCACTG
TCTCGGATCA GACAGGTTTC GAAAAACATT GCAATCGCCG TTGCCAATCA GGCGCACGAA
ACGGGAGTAG CAACCGAGCA GAGACCGGTG GATATGGGAA AGTACGTGGA ATCACTCATG
TACGATCCAT TTGAGGAGGT TGACGTTCAC TTGGGATCCA AGAAGTAG
 
Protein sequence
MVSRLPVTGV EVVRDTIRYS GVSTPISERE GLGIRGLVPA AFLPLELDVE RCMLQMRSKE 
SPLEKYIYLH NIQDVSERLF YAILCKYTSE VMPLVYTPTV GEACQNFSAI YRGTLRGMYF
SLEDSGKIRT LLDNWFTSKI TTIVVTDGER ILGLGDLGVN GMGIPIGKLA LYTACGGIDP
AKVLPVHIDV GTNNEENLND PYYLGLRRPR ERGQAYDDLI AEFFEAAQNK FGANVMIQFE
DFGNLNAFRL LSAWQDKACT FNDDIQGTAA VALAGLLASN RLTGKDLIDH IFLFAGAGEA
GTGIAELLAL AIAEKGHLPI EQARKKIFLV DSKGLVTKSR LDSLQHHKVD FAHDVDDCPN
LLAAIDMLKP TGLIGVSAIP NSFTKEICEN MAAHNKIPVI FALSNPTSKA ECTAQEAYEW
TDGRAIFCSG SPFDPVTLQD GRQRVPGQGN NAYIFPGIGL GVLAAGSTRI TNYDMLLAAE
TLAAEVGPEE LDVGCMYPPL SRIRQVSKNI AIAVANQAHE TGVATEQRPV DMGKYVESLM
YDPFEEVDVH LGSKK