Gene PHATRDRAFT_49618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49618 
Symbol 
ID7198260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp216490 
End bp219596 
Gene Length3107 bp 
Protein Length958 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184422 
Protein GI219128442 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTTATCGTCC TAGAGTTACT GATTCAATCA CAATCGTACG ATCGTACGCA TACACAAAGC 
TCACCATGAG AGTGACAGCT GCAGTAGCCG GGTCCTTGTG TTTGCTGGGG ACGTCGGTGG
AATCCTTTGT CTTCCCGGCA CGCAAGAGTC CAAGGTTGTC CTTCCCTGCG GCAAATCCAG
TCGCCTCCTC CTTTCGCAAT GCGGAAGCTA TAGTCAGTTT GCACGAGAAA AAGAACGACA
TAGATATCGA CGATGTTACG AAAGAAGCGG AAGAAGCTTT GGCGGCGGCT GAAGCAGCGT
TAGGAGGTGG ACTTGGAAAC AACAAGCAGA CGGTAAATCC ATCCCCGAGC ACACTGTCCG
AAAGCACGCC ATTGCCATCT CCAAATAACA AAACGACGCC TTCATCTTCG CCTTCAAAAA
CGCCTCCAGC ATCTTTAAAA AAAACTCTGC CGCAGCCTTC TTCCAAAATA GCGCCTCCGC
CCTCGCCTTC TCCGCAAACA AAGCCCCCTC CCACTCCCCC TAGTCCTCGA GAGTTGGCTG
AACAAGAGGC TCGTCGTAAA AAAGCCCAAT TCTACCGCAA AGACGCTATT GCTGCGGCTG
TTGGTGCGGG ATTGCTAGGT GCAGCAGGAG GAGGCCTACT CTGGATGGAG TTTCCAGAAT
TCGGTATCGC CTTACGAGAC GCTGTCAGCG CCGACATTCC CATGTACGTG CCTCCTCTTA
TCGGCGCTGC CGTTTTGGGT GGAAGTGCAT TTAACAGTGC TTCGCAAGAT AATGCTGTGG
GCACCCTTAC CCGTGGAGTA TTTGGCAGTA CCACCAAAGC GATCGGCGGA GGGATTGTGG
GAGCAGTTAC TGGTGTCACC GGCTCTGTGA TCTCAGGCAT TGTAGCCATT CCTAAAAGAG
TGGTTAACGC GGCCGCTCAG AAAGTTCGGG AGACTACAGA CGGAATCAAG GCCATTCCGT
CAAGGGTTCA GGATGCTGCG GCCCGCAAGG TCCAGAGAAC AGCCGAGGAT ATTCGAGCCA
TTCCCACCAA AGTTTCTTCG GCGGCAAGCA GAGCCGCAGA AGAAGCCGCT CGGGAGATCA
AGGCAGCCCC CGGTCGGGTC GCAAAATCAG CGGAAGAAGC ACTGGAAAGA TCAGTGGAAG
AGACCAAGAA GAATATTAAC AGAATTGGAG AAGATATAAA AGCCTCTCCC TCAAAGGCTG
TGGATGCCTT CGAAGCAAAG GTGACGGAAG TCTTCGAAGC AAAGCCCAAA GAACCGCAGG
CACCTTTGCG TCCTCCTCCA CCTCTAGTCA GGAACGATGC CAAGTCTTCG TTCCCAAACC
TCCAGATTGA CCTGCCCAAG CTTGAGGTGC CGAAGATTCG CGTACCCAAT CTCGATGCGC
TCAAGATTGA CACGCCGAGT ATTGAGATGC CTAAGACTCC AGTGCCAAAG AAAGAGCGGG
ATGATGGCTT TGTTTTTGGT GAGGTTGGAC TCAAGAACTT TATTAACGCA CCGGTATCAA
AGGATTCCGC CCCGTCCAAA CCAGCTACGG TTCCCGAGAA GTCGCAGCAA CAGCGGGCAG
CTGCACAAAA GCGCGACCGC GCTCAAGCGA AAGTAGCATC CGAAAAGCAA CGTCGAGAAC
AAGCTAAAGT GGTGGCGGCA GAAAAGCAGC GCCAGGAGCA AGCCAAAGCG GCATCGGCCG
AAAAACAGCG TCAGGAACAA GCCAAAGCGG CATCTGTTGA AAAACAGCGA CAGGAACAAG
CCACTCGACG TGCTAAACAA GAGAAGGCTC GTCAAGAGCG GCAAAAGCGT CTTGAAGATA
TAGAAGCAAC TAAAAAGGCC AAGTTGGACC AGCGAATGCG CGAAACAGCA AAGCAGACGC
AGATTGCCGA ACAGAAGCGT TTAGAGGCGG AACGCCGAAA GAACGCATCA GGTTCGAAGC
CACGTCCTAG CTTTCAAATT CCCCGTCCCA GTTTCCGTAT TTCTCCCGAC TCTGACGCTA
GCAAACCTCG TCCCTCATTT TCGCTAGGTG GAATGCCCAA GCAGCAGAAG AGTCCCTCAG
AAAGCAAACC CCGTCCGTCC TTTCAATTGA ACCTCGGAGG GGGTAGCAGC AAAGCTGGAG
AGTCTGGTGT CAAACCTCGC CCTTCCTTCC AATTGAACCT TGGAGGCGGC AGCAGCAAAG
TCGAAAATTC CGATAGCAAA CCTCGTCCCT CTTTTCCACT AAATCTCGGA GGTCGCGGAA
GTGATGCAAA TGTTGTCAGT AAGCCTCGTC CTAGCTTTTC CCTCGGTGGA GGCGGCGGGA
CCCAATCAGG AAAGAAAGCT CCCCGCGGTG TCCCCACTAT TGTGCGCTGG AAACAACGCC
GAGATGGCGG TATAACCGGC TTCATCTACG GTTCTCCAAA TTTCGACGAT GGAGATCGGG
TGGAGACTAC AGCAATCGCA ACAGGAGATG TTGCTAATGG TGGCGTCGTT AAGACGGGAA
GTGGTTCTAG ATACTTCTTA AGTGAGACCC CTCCAATGGG AGGGAAAGCG AAAGGCGGCA
ACGATGCCGG CGCTTTGAAA TCCCTGCTCA GTGCTATTCC AGGAGCAACC ATCAATCTCT
CCCGAAGCAG GAAGACTCCA GCGGAAGTAA AGGCTGAGGA GACTTTGAAG AAGGCGGAAG
CGGCACGACC GAGAACATTT TCACTATTCG GTTTGGGTGG AGACGGGGCA GACTCCAGAC
CACCTTCCCA GAAGGAAAAG GGCTCTGGCG GAGGCAAAAA GCCCGCAGTG ACGGCACCCC
GCGGAGTTCC TACTTTGAAT AGATGGAAGA AGAATAGAGA TGGATCCGTC ACTGGTTTTA
TTACGGGATC ACCCAACTTT TCTGAAAATG AGAAAGTTAC AACATCCCCG ATTACGCAGG
GCACCGTGAA GTCCACTGAA ACTGTGAAGA CTGGAAGCGG CTCTCGATAT TTTCTTGCAT
GAATGAGTTG AAAATCTCTG AGTTCATCAA AACAACCTGT ACTATGTGTT GGAGTCCTTT
GATAGCATTG ACGATAAATT AATAGGATGA AGCAAGGGCA AATCCATTGA ATGCATAGCT
TCTACAAACG CATCCAAGCT TCAAAGCATG CATTTTAACT AAGAAAT
 
Protein sequence
MRVTAAVAGS LCLLGTSVES FVFPARKSPR LSFPAANPVA SSFRNAEAIV SLHEKKNDID 
IDDVTKEAEE ALAAAEAALG GGLGNNKQTV NPSPSTLSES TPLPSPNNKT TPSSSPSKTP
PASLKKTLPQ PSSKIAPPPS PSPQTKPPPT PPSPRELAEQ EARRKKAQFY RKDAIAAAVG
AGLLGAAGGG LLWMEFPEFG IALRDAVSAD IPMYVPPLIG AAVLGGSAFN SASQDNAVGT
LTRGVFGSTT KAIGGGIVGA VTGVTGSVIS GIVAIPKRVV NAAAQKVRET TDGIKAIPSR
VQDAAARKVQ RTAEDIRAIP TKVSSAASRA AEEAAREIKA APGRVAKSAE EALERSVEET
KKNINRIGED IKASPSKAVD AFEAKVTEVF EAKPKEPQAP LRPPPPLVRN DAKSSFPNLQ
IDLPKLEVPK IRVPNLDALK IDTPSIEMPK TPVPKKERDD GFVFGEVGLK NFINAPVSKD
SAPSKPATVP EKSQQQRAAA QKRDRAQAKV ASEKQRREQA KVVAAEKQRQ EQAKAASAEK
QRQEQAKAAS VEKQRQEQAT RRAKQEKARQ ERQKRLEDIE ATKKAKLDQR MRETAKQTQI
AEQKRLEAER RKNASGSKPR PSFQIPRPSF RISPDSDASK PRPSFSLGGM PKQQKSPSES
KPRPSFQLNL GGGSSKAGES GVKPRPSFQL NLGGGSSKVE NSDSKPRPSF PLNLGGRGSD
ANVVSKPRPS FSLGGGGGTQ SGKKAPRGVP TIVRWKQRRD GGITGFIYGS PNFDDGDRVE
TTAIATGDVA NGGVVKTGSG SRYFLSETPP MGGKAKGGND AGALKSLLSA IPGATINLSR
SRKTPAEVKA EETLKKAEAA RPRTFSLFGL GGDGADSRPP SQKEKGSGGG KKPAVTAPRG
VPTLNRWKKN RDGSVTGFIT GSPNFSENEK VTTSPITQGT VKSTETVKTG SGSRYFLA