Gene PHATRDRAFT_49902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49902 
Symbol 
ID7198527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp247975 
End bp249836 
Gene Length1862 bp 
Protein Length593 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184758 
Protein GI219129148 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGATG TTTTGGTTCC GCAGCCGGTA GATATTGACA TTGGTGTATC CAACATCGTC 
GACGACGACG AAGAAGGGCA ACAACATTCC GGGACGCATT CCTGCGGCAG CTGCGTGTTG
GCCTGGAACG GGGAAGTCTT TCGCGTCGTC ACCCAAACGC CACTCGGAAC GGACACTGAC
GACCTATCGG GGTCACACCA TTCCGACACT ACGCTCGTGG CCGACTGGAT ACGTCAGGAA
TTGACGCAGA CTGTCTCTAT CTGCCGTGAA GGTACCACCA TACGCAGCGT ACAACTCCAA
CAACAAATGG CTCTGGCACG CGTCCTGGAA CGTTTGGTGG ACGCTGATTT TGCCGTGACG
CTCGTCACAC CACACTCCGT ATTCTACGCA CGTGATCGTT TCGGCAAACG TTCCCTCCTA
GTACAAGACG GTCTATCGAC AGTCACGGCT GCCGGGACCG CTACCGAGAC CAGTCGTTCC
TGGAAATTGT CCTCCGTCAC TGACGGGACG GAATCGGCAG TGTGGACCGA AGTCGCACCG
GAAATGGTAC ATTGCTACTG CGTGGCTACG AAACAGTGTC TGCCCCCGCT GTCCTACCAT
CAACGCGAGA CTATGGATGT TGCACCGGCG CAAGTACTCC ACCAACCCTT GCAGGATGAG
TACGACAAAG ACAATATCCG AAGAGCGTGG ACTACTCGGC GCATCAATGA CCAATCCTTC
TCGTCGGAAC CTTGTACGGC TACATCTTTT GCTCCCTTAC CTTCACAGCC TACAAAGCTG
GAGACAGCCG TCGCCACGTT GTACGTTTTA TTGCGCGACG CGGTACGCCG GCGTGTCACG
GGTCCACGAG TGGCTGTCTT GTTTTCCGGT GGCTTGGATT CCGTCGTGCT GGCCGCCTTG
GCTTTGGAAA TTCTGTTGGA ACGCTATGAA CACACCCACG AACTCGTTTT GTGCAATGTT
TCGTTTGTAG AAGATGCCGC GCCGGGCGCT ACAGACTTTG CACGGACGGA TGCCTCAGCG
CTGCCGAGCC GTACCGACGC ACCAATACCA CCGCAAGCAG CCGATACTCG CGCAGCGATG
GTGTCGTACC GGGAGCTGGA ACGCCTGTTT CCCCAAGCGC GTATCTGCTT TTTGGCCAAA
CAGGCGACGT GGGGGGACAT TGTCCGCAAC GAAGCACACG TACGTCAATT GGTGCATCCC
CAGACGACGA CCATGGATTT GAACATTGGA ATGGCTCTAT GGTTTGCGTC TTTGCAATCC
ACGGATAGCA AGTCCCAGCA CGTTTCCGCG AAAAAAGTAC CGTTGGGTGA CGACTGTCGT
GTGTTGCTAT CTGGTTTGGG TGCCGACGAA CTCATGGGTG GCTACGGTCG TCACCGCCAG
GCTTGGAAGG ATGGTGGCAA CGAGCAACTA CGTCGGGAGC TGGATTTGGA TTTGACCCGA
CTGTGGTACC GCAATTTGGG ACGCGATGAT CGAGTGTTGA GTGATACCGG CCGAGAAGCG
CGCTTCCCGT ATTTGGATAC GGCCGTTGTG CAGTTCCTGT CGAGGTTGGA CTTGGACGTG
GTGTGTGACT TTAAGCGGCC ACCGGGAGAA GGAGATAAAC GCATTCTACG GGTTCTGGCG
GCGCAGATGT TGGGCCTGGA GGCGGCGAGT ACGGCCGTCA AGCGAGCGAT TCAATTCGGG
AGCCGGATTG CGCACGTGAG TGACAAACGA CGATTCGGTT CGCGGCGGCG AGCCTCCGGA
ACGGCCCGCG CCATCCACCA GACCGCGGTT ACCGATTTCT GAGCACCAGA TATTTCCCGA
GACTTGTGTA CAAAGTTGTT GACTGTTGAT GGGAGTACCG TAATCACCTT ACTACTATTC
TT
 
Protein sequence
MRDVLVPQPV DIDIGVSNIV DDDEEGQQHS GTHSCGSCVL AWNGEVFRVV TQTPLGTDTD 
DLSGSHHSDT TLVADWIRQE LTQTVSICRE GTTIRSVQLQ QQMALARVLE RLVDADFAVT
LVTPHSVFYA RDRFGKRSLL VQDGLSTVTA AGTATETSRS WKLSSVTDGT ESAVWTEVAP
EMVHCYCVAT KQCLPPLSYH QRETMDVAPA QVLHQPLQDE YDKDNIRRAW TTRRINDQSF
SSEPCTATSF APLPSQPTKL ETAVATLYVL LRDAVRRRVT GPRVAVLFSG GLDSVVLAAL
ALEILLERYE HTHELVLCNV SFVEDAAPGA TDFARTDASA LPSRTDAPIP PQAADTRAAM
VSYRELERLF PQARICFLAK QATWGDIVRN EAHVRQLVHP QTTTMDLNIG MALWFASLQS
TDSKSQHVSA KKVPLGDDCR VLLSGLGADE LMGGYGRHRQ AWKDGGNEQL RRELDLDLTR
LWYRNLGRDD RVLSDTGREA RFPYLDTAVV QFLSRLDLDV VCDFKRPPGE GDKRILRVLA
AQMLGLEAAS TAVKRAIQFG SRIAHVSDKR RFGSRRRASG TARAIHQTAV TDF