Gene PHATRDRAFT_40650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40650 
Symbol 
ID7198573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp67607 
End bp69355 
Gene Length1749 bp 
Protein Length582 aa 
Translation table 
GC content59% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184638 
Protein GI219128897 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGACA ATCTTACATC CCTCCCGACC AACGGCGAGT CGTTGCTACC GACGGACGTT 
CCCACCGATC GTCCCTTGTC CATTGGCGTT TTTGGCGGTT CCTTCAATCC CATCCATCTG
GGACACGTGC TTTTGGCCAT TACTACACAG CAGACCAAAC CGGTGGATCA AGTAGTATTG
GTACCCGTCT ACAAACACGC CGTCAAGCGT GACTTATTGC CCTTCGACGA TCGGGTCCGT
ATGTGCCGAG CCGCCGTCGG ATCCTTCGGT CAGCACAATC GCGCCATTGT GGTATCTACC
GTGGAACGCC GCGTAGGTGC CTCCAACGGA GCCATGCTGC GAGCTCTCCA ACAAGAATAC
CCCGAGGGGA CCCGCTTTTG GTGGATCTGT GGCGACGACT TCTTCCGATG GATGGAGCGA
CCCAAGGGGC TCGAAACACT CGCGCACGTT TCGGGATTGA TCGTCCAGCG ACGTCTCCAC
AAACGCGCCA ACGGACAACT CTTTCAGGAA GATCTAGACG AAGCCCGCGT CCGCGCCAAA
ACCCTACAGC TCGATATCCA TTTAGACTTT ATTTACGGAG AGTTGCCGCA CTTTTCGTCG
ACCCTCGTCC GGCAGGCACC GGGATCCTGG CGCTCCTTTT TGCCGCAAGC AGTCGCGGCC
TACCTGGACG CGCGACCGCA TTTACAGGAA CAGTTGTTGG CCAATCTACA AGCCGACGCC
ACGGCAGAAG CCGCGCAAAC CGTATTGTCC GGCGAACAGC CGGTGACCGC GGCTTCGAAC
ACCGCGTGCT CCACAACCAC CACGTCCGCG GCAGCCTTCA AACAAGCCGG GTTGTGGGTC
ATGCGGGGGC TTGATATGGT ACACATGCTC CAGTACGAAC GTGGCATGAC CGGACTGCGC
CTTTCCACCG GAACCACGCA AAAATACCAA CAAGAAACCC TTGAAGAAGT CCAACGCAAT
ACGGATCGCG TATTGCGAGA AATTCTCGAC GCCCACGCGG AAAGCGACCA GCTCGTACTC
CTCCCCGCCG ACGATGACTG GCCCGAAGTC CAAGCCCTCG CGGCGGAACT CCAGCAAGTC
CCCACTTGGT TGACGCGCGA CCGCGCCACA CTCGCTCGGC GCCAACTAGT TTTGACGGCC
ACACCCGGCG TCGAAGGCTG GGCGGCACGC TATTCCCTCG TGGAAAAGTT CCACGCCCGC
CTCGACCTAC TCACCCAAAG CACCGTCCGC GCGCTGGTAG AAATCCGTGC CAACCTCGCG
GTGGCGCAAG CCCAACCCAC ACCGTCTCGG AGTGTACCGG AACTTTTACG TTCCTGGTGT
CAGGGCAAGG AAGCCCTGGG ACGACTGCGT GCGTTTGTCT GTGCCGGCGG CCCGGACGCC
TCCACCCTGG TGCGCGAATC ACTCGCCACC CGACAACAGC TGGTACGCGT GATGGAAGCC
AAGGATCGGT GCATTGCGCG CGTATTGATT CTGGAAGCTG GTGTCTCGAC CCGATTGGCC
GCCCCCGATG CTTTGCACCG GATGCTGAGC GAAGTGACCA AGGCGGAATG GTCCCTCATG
GGCTGCTGCA GTTACGTGGC CAACCAACCC GGATCCATCC AATTGGTCCA TCAGCAGCTC
GCATCCGGTA GCGCCCCCTC CGACGAACCA TTCTCGGTCC AACACTTTTT TGAAGCCTCT
AGTACGGCCA TTGATTTTTT GTTAACCTTC GCCAAGGCCT TGGCTGCGTC CGCCTGCTCG
ACGTTGTAA
 
Protein sequence
MEDNLTSLPT NGESLLPTDV PTDRPLSIGV FGGSFNPIHL GHVLLAITTQ QTKPVDQVVL 
VPVYKHAVKR DLLPFDDRVR MCRAAVGSFG QHNRAIVVST VERRVGASNG AMLRALQQEY
PEGTRFWWIC GDDFFRWMER PKGLETLAHV SGLIVQRRLH KRANGQLFQE DLDEARVRAK
TLQLDIHLDF IYGELPHFSS TLVRQAPGSW RSFLPQAVAA YLDARPHLQE QLLANLQADA
TAEAAQTVLS GEQPVTAASN TACSTTTTSA AAFKQAGLWV MRGLDMVHML QYERGMTGLR
LSTGTTQKYQ QETLEEVQRN TDRVLREILD AHAESDQLVL LPADDDWPEV QALAAELQQV
PTWLTRDRAT LARRQLVLTA TPGVEGWAAR YSLVEKFHAR LDLLTQSTVR ALVEIRANLA
VAQAQPTPSR SVPELLRSWC QGKEALGRLR AFVCAGGPDA STLVRESLAT RQQLVRVMEA
KDRCIARVLI LEAGVSTRLA APDALHRMLS EVTKAEWSLM GCCSYVANQP GSIQLVHQQL
ASGSAPSDEP FSVQHFFEAS STAIDFLLTF AKALAASACS TL