Gene PHATRDRAFT_48580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48580 
Symbol 
ID7194740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp260238 
End bp261668 
Gene Length1431 bp 
Protein Length476 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183060 
Protein GI219125592 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000300421 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGCTG CGTCCGCCCT CAAATGCGCA TCCTATGAGG GAAGGAGTGG CGTAGCTGCC 
GCGGTCCGGC TGGTGTCGTC AATGCGTACT CGCAACTCCG AGACGATCGG TGCATTGTCG
CCGTACTGCT ACGATCACAC CCACTTGCCA TTGTCTACAA CAAGGGCTCA GGCTTTCTCG
GCTGCTTCCG CCGTGAGAGA CTTTGCAGAT CCGCAGGGGG CGGCCCGCAA ACACCGTCCG
ATTTACATCG CGGCGACGAG GCAGCACGTG GGCAAGACCT CGGTCAGTCT CGCACTGATG
AAAGGATTGC AGCGCCGTGT TCCCAAAGTA GGTTTTCTAA AACCCGTTGG ACAGCATTCC
GTACGAATTG CCGAGCTCGA CGGCAGTGTC GTGACGGTCG ACAAGGACAC CGCTTTGATC
GTTCAACACT TTGGACTTAC ACGGCATCAG ACATTGCAAG ATGCCAGTCC AGTCCTCATT
CCGCCGGGGT ACACAAAAGA TTACGTTGAC GGGAAAATTA CGCTCGATAC TCAACGTGCC
TCCATCGGAA AAAGTTTTCA ACGCGTCGCT TCCTTTGCCG ATATTGTCCT CTGCGAAGGA
ACCGGACATT GCGCCGTCGG TAGCATCGTA GACGCCAGTA ACGCCGCCGT CGCGTCTTGG
CTCGGCGCCC GGATGGTCCT TGTGGCCAAC GGTGGTCTGG GGAATTCTGT GGACGAACTT
GAACTCAATA AGGCCTTGTG CGACAAACAT GGGGTTGAGA TTGCCGGCGT TATCATCAAC
AAGGTCTTGC CCGAAAAGTA CGAACAGACA AAATACTATC TGGAGAAAGC ATTGCACGAT
CGGTGGGGTA TTCCCTTGCT GGGATGCGTT CCGGATCGGG CGTTTTTAGG ATGTCCCGCC
TTGGCCGATC TAGAGCGTCT CTTCCCCGGC GCGATGTTAG TTTCTGGGCT CGATCATCGA
CTGCGACATT ATACGGTGCA AGATTTGAAC CTCGTCGCCA CATCGCTCGA AGTCTTTTTG
CGCAATCTGC GAACCGATCC CTCCCGCACG CTTTATGTTT GTCACGCTTC CCGAAACGAT
ATCTTGCTCG GCTTCCTTAT GGAAAGTCAG CAACGGCCGG ACTGGGAAGC CGCGTTAGTT
GTCACAGGCT GTCACGATTA TCCCGTCAGT GACCAGGTTT TGCAAATCAT CACTTCCATG
CCTTCGGCCC CACCGGTGCT CTTGGCATCG CCACCGACGC GACAAGTCAT GCACGATATA
CACCACTTTA CCCCAAAATT GAATTTTGAG GATGGACACC GCGTCGAAGC CGCTGCCGCT
CACTACGAAC CCTACATTGA CTTCGATCTT CTTTTGTCGA GGGTCGGAAC GACGTCTACT
GGCTCCTCGA AATCTACGTC GAAAGCCGGC CTTGCAGTCG CAGTGCCGTA G
 
Protein sequence
MIAASALKCA SYEGRSGVAA AVRLVSSMRT RNSETIGALS PYCYDHTHLP LSTTRAQAFS 
AASAVRDFAD PQGAARKHRP IYIAATRQHV GKTSVSLALM KGLQRRVPKV GFLKPVGQHS
VRIAELDGSV VTVDKDTALI VQHFGLTRHQ TLQDASPVLI PPGYTKDYVD GKITLDTQRA
SIGKSFQRVA SFADIVLCEG TGHCAVGSIV DASNAAVASW LGARMVLVAN GGLGNSVDEL
ELNKALCDKH GVEIAGVIIN KVLPEKYEQT KYYLEKALHD RWGIPLLGCV PDRAFLGCPA
LADLERLFPG AMLVSGLDHR LRHYTVQDLN LVATSLEVFL RNLRTDPSRT LYVCHASRND
ILLGFLMESQ QRPDWEAALV VTGCHDYPVS DQVLQIITSM PSAPPVLLAS PPTRQVMHDI
HHFTPKLNFE DGHRVEAAAA HYEPYIDFDL LLSRVGTTST GSSKSTSKAG LAVAVP