Gene PHATRDRAFT_48157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48157 
Symbol 
ID7203497 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp377108 
End bp378965 
Gene Length1858 bp 
Protein Length491 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182672 
Protein GI219124776 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAAACAAAC CGCACGAACA ACGCAACATA CTCACCTACA CGAATTCAGA ATTGGGAAAC 
GGGCAATAGA GTCGGTTGAT CTGACATATA TCTGATCCTT ACCGTTTGAT ACTCCGCTCA
CACACACTCT ACTGATTAAG TCCAATCTCG GTTGCACCCT AGAATGAGCT ACCGTCTGGC
GCATCTCTCT TCCTCTCGAT TCACTTCCCC GTCGTTGGTA GCACAAAGAC GAGTACGTGG
GCTCGAGGCA GCTAATTCAA GCTGGGCAAC CCATCAATCA GTGAAAGTGT GGTTTTCCTC
GAACTCTGTA TCTTCATTGT CACTTTACAT CCACCGTCCA GTGAATTCAC GATGCACTGA
AGTAATCGTC GCCAGCGCGC TGGCTCTCTT AACTACTAGC GTCTTAGTCT TGGCGTTTCC
CACAACTACT GCTACCTCTG AAGATCCCCA TAACCACACT ATTCCTGCAC CAAACAGTCC
TGCTCAACTC GTTGTCTCTT CAATTTGGTC TGAACCCACA AAGTTGGTAT CGCACGGCGG
CGGAGGTAAC GATGGGGACG GCAAAGGGAC GAACGGTATC TTGGGCTCGC TCGTGTTCGC
CGATGTCGCG CACTCGATCG GCGAGCATTG CCCCTCGAAA TCACCGAACC GGAAGGCTTT
GTCTATATCT GCTGCTGCCA CTAGTGCTAG TACCACCGAA AATCCAGCGA GTTCTATAGA
AACAGCTACC GTAGTACAAT TGGGTGGCAA CTCTGTTACG AGAGGAAGTG TGCGTCAGAA
ACCGTACGAT GTAAGTGAAT TTGCGTCCCA GAGAATGGCT TTTCGTGTAT TCATTTATTC
ACTGAGTTGC TGACCTCTTC GTTATTCTGT GCGGTTGTTT CTTGTCCGGA CCCTGCAGGT
TTCGGTCCGC GCACTTCAAG GAGGCCGCAT GACCATGGAA GACGAATACG TAGTAGCCAA
CGGCGGCCGT TTTGCTGGTG TCTTTGACGG ACACGGGGGT GGAGGAGTCA GCCAGCGGCT
ACGGGTTAAT TTGTACAACA AAACTTGCGC CGCCCTCGCA CGCAAACAAC ACGAATTGAC
CGATGCGAGT TCGGTGCTTT CGCACGTGGC AGCCTTACGG GATGCCTTCG ACGAAATGGA
GCAGGATGTT CTGGAAGATG ATGGCTTGCA ATATCAAGGA AGCACAGCTG TGGTGGTGGT
CGTACATGAA TCGGAAGAAG GAAAGCGAAC TTTGCTGTCG GCTAACGTTG GCGATAGTCG
GGCTATCTTG TCGCGTAATC AAAACGCCGT TGATCTCACG CGAGACCACA AACCAAATGA
TGATCGCGAG AAGGCGCGTA TCTTGGCCAT GGGCGAAACA ATTGAATGGG ATCTTATAAG
CAAGGTGCAT CGAGTCCGAA ATCTGAGTCT TAGTCGAGCG ATCGGCGATC GGTACGCCAA
ACCCATCGTA TCTGGGCAAG TCGAAATTCA ACACTATCCT GTGCAGGAAC AAGACGATGA
ATTCTTTTTA CTCGCTTCGG ATGGGTTGTG GGATGTCATG ACGAGTCAGG ATGTTATCTC
TTATGTGCAT AGACAGATGG AACAGGAATT AGATCGAGAG AGCTTACACA AGGATGATCG
CGAGAACTAC AAACTGGTAC TCCGGAGGAA TATGGCGAAG TTCGTCGCCC GCGAAGCGAT
GCGCCGTGGA TCAGCCGACA ATGTTTGCGT TCTCATGGTG TGGCTGAATG ATATGGGGTT
GCGATGAATT TAGATTGTGA ACCTCCTTAT TTATGCACTT GCATGTAAAC TGAGTGAGTG
CTGCTATGCT TTTGCGATCA CTAGTAATAA TATTCGATGT ATACTAAAGA GAGCGATC
 
Protein sequence
MSYRLAHLSS SRFTSPSLVA QRRVRGLEAA NSSWATHQSV KVWFSSNSVS SLSLYIHRPV 
NSRCTEVIVA SALALLTTSV LVLAFPTTTA TSEDPHNHTI PAPNSPAQLV VSSIWSEPTK
LVSHGGGGND GDGKGTNGIL GSLVFADVAH SIGEHCPSKS PNRKALSISA AATSASTTEN
PASSIETATV VQLGGNSVTR GSVRQKPYDV SVRALQGGRM TMEDEYVVAN GGRFAGVFDG
HGGGGVSQRL RVNLYNKTCA ALARKQHELT DASSVLSHVA ALRDAFDEME QDVLEDDGLQ
YQGSTAVVVV VHESEEGKRT LLSANVGDSR AILSRNQNAV DLTRDHKPND DREKARILAM
GETIEWDLIS KVHRVRNLSL SRAIGDRYAK PIVSGQVEIQ HYPVQEQDDE FFLLASDGLW
DVMTSQDVIS YVHRQMEQEL DRESLHKDDR ENYKLVLRRN MAKFVAREAM RRGSADNVCV
LMVWLNDMGL R