Gene PHATRDRAFT_31492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31492 
Symbol 
ID7196677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp206169 
End bp207809 
Gene Length1641 bp 
Protein Length546 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176545 
Protein GI219109581 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTTAG TGGCGCAACA GCTGGAGACG CACAAGCGAC TGCAAGCCGC GCGACAGGAA 
CTATGGAGAC TATATCTGCC AAACTTTCTG TGTCCCTACC GCGAGTTCGT ATTAGACCCT
AGCCTTCAGT TCTGGACTAT CGTTGTTACT CTCGCTTTAC CCAAACGCTG GGATTTTTCT
ATCTCTCGAA TTTCACAGAC GTTGTTCATA CGTCCAACTC TTGCTACTTT GCGATTCTTC
TCCAAAGACC CTGATGGCGA AGATCTTCTG GTCACCGCTG GCAAAGAGCT GCTGCACCCA
GTTGCGCCCC GTCCACTGCA TTTTCAGCCT CCCTCCGACA TACAGTCGGC TCTGTCGGGT
GTTGCCGCGT TTGTCCTGAG TGTCGTCGCG GACGTGATTG GTGCATTTAT GGATCCCGTT
AAAATGAAAA AATGGTTTTC CGCAATGTCG GCCTTCCGGG CGTACTTACA AGCATCCGGT
GTAGGTGCAG AATTGGAGGA ATCCCTCATC AAACCTCTTT GGAGAGGGCG GCTTCTAGAC
AACCTAAAGA TTCTCAATGA CTGCCAAGAA ATCTTAGACG AGGACCGTAC GAAACTTGCG
AATGACCTTG ATTCAGAGGT AAGCTCGGAA GACCTCGTCG TCGGTTGCAG CATGATGCGT
TTCGCCACCG CCGCGTATGG TGTTGAAATG GTTCGCTCGG CAATTGATCG CGAAGCGAAT
TACGAACATG TCAACAGTGA GCGAAAGGCC ATTGCATTTC ACTGCAATAT CCCAACCGAG
GACGTCAAGT ATATTTATAT CCAGCCAGGA GACGAAATGC ACACGATGCG TCATTTCATT
GCGGTCGATG AAAAGACCAA ATCCGTCGTT CTCGCCATCC GGGGAACGTT ATCAATTTCC
GGTGCCTTGG CGGACATGCA AGCTATGGAT TTTGATTTTT GTGGCGGCAA GGCACACATG
GGTATAGCGG AACAAGCCAA TTTACTTTGG CAGAAAACAG GACAACGCCT CCGCAGGATC
GCTTCCGCAT ACTCGGAAGA ATACCGAATC ATTTTTACGG GACATTCGCT TGGAGGAGGT
GCCGCGTGCC TATTGCACGT GAAAGTGCAC ACAGAGAATC TGTTGCCGAC GAGACAGGTC
TACTGCTACG GCTTTGCACC CCCACCAACA TATTGCAAGG GTAGCACTCC TTCGCCAGGT
CTGGAAATGG CCGTCAAGAA CTGTGTATGC TTTGTGCACG ATAACGACTG TGTTCCACTT
CTGAGTGTGG CATCCATCCG TCGTCTGGCT TGCCTTATGG ATGCGGTTGA CAATTGCACG
GAAAATCTCT GGTTCACGAC ACGTTTCCGA ATCTTTTGGG AGTTTGTCAA GGTCCCTGGC
GATATCGTCA AAACTGTCTG CAGCGTTAAG CATGACTCGA AGGCAGTCGT TGGTGAGTCA
GCCATGGTCA TCCCAGCCCG TTGCATTGTT TGGATGAAGA AGACCTTAAG TGGACGTTTT
GAGGCCCTTG CGTGTAGTTC AAAAGCCATG GCCTCCATGA ATATCTTTGT CTGCCAAGAT
ATGATTGCTG ATCACATGCC AGAGCAATAC GAGGATGCCC TAGACAGTCT TGTAGCTAGA
AGGTTTCAAG AGCAACTGTA G
 
Protein sequence
MSLVAQQLET HKRLQAARQE LWRLYLPNFL CPYREFVLDP SLQFWTIVVT LALPKRWDFS 
ISRISQTLFI RPTLATLRFF SKDPDGEDLL VTAGKELLHP VAPRPLHFQP PSDIQSALSG
VAAFVLSVVA DVIGAFMDPV KMKKWFSAMS AFRAYLQASG VGAELEESLI KPLWRGRLLD
NLKILNDCQE ILDEDRTKLA NDLDSEVSSE DLVVGCSMMR FATAAYGVEM VRSAIDREAN
YEHVNSERKA IAFHCNIPTE DVKYIYIQPG DEMHTMRHFI AVDEKTKSVV LAIRGTLSIS
GALADMQAMD FDFCGGKAHM GIAEQANLLW QKTGQRLRRI ASAYSEEYRI IFTGHSLGGG
AACLLHVKVH TENLLPTRQV YCYGFAPPPT YCKGSTPSPG LEMAVKNCVC FVHDNDCVPL
LSVASIRRLA CLMDAVDNCT ENLWFTTRFR IFWEFVKVPG DIVKTVCSVK HDSKAVVGES
AMVIPARCIV WMKKTLSGRF EALACSSKAM ASMNIFVCQD MIADHMPEQY EDALDSLVAR
RFQEQL