Gene OSTLU_49558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_49558 
Symbol 
ID5001925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp805624 
End bp806816 
Gene Length1193 bp 
Protein Length376 aa 
Translation table 
GC content60% 
IMG OID640417346 
ProductDMT family transporter: UDP-glucuronic acid/UDP-N-acetylgalactosamine 
Protein accessionXP_001417874 
Protein GI145346808 
COG category[G] Carbohydrate transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5070] Nucleotide-sugar transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGGGCGGC GACGGAGCGG TCTGCGCGAC GCCAGCGCGC GATCGAAGAG GCGCCCGCGA 
TGGGGCTTCC GACGCATCGC GACAGCGACG CGCGCAGGCG TCGTAAGGGC GCGTCCGCGT
CTTTGTTTTA CGGTTCGACG TCCATCGCGA CGGTGTTTCT CAACAAGTCC ATCTTCGCGA
CGTGGAAGTT TAAGTTCCCG GCGACGCTCG TCACGGCGCA GACGATATTC ACGGTGTTTG
CCATCGTCGC GCTGGAGCAC GTGGGCGCGA TCTCGCCGCG GGGGGGGAAA GGGTTTCGCG
GGAACTTTAA CGCGAAGGCG TTCAAGCGCG TCGGCGTGGT GAGCGCGGTG TTTCAAATGA
AACTCGTGCT CGACATGAAG GCGCTGTCGA TGATAAACAT CCCGATGTAC GGGGTGTTGA
AGTCGGCGAC GACGCCGTTC GTGATGGCGA TCGATTGGGT GATGATGGGG AAAGTGGCGC
CGGCGCGCGT GCAGGCGGCG GTGTGGTTGA CCACGCTCGG GGGAGTGTGC GCCGGTACGG
GAGATTTGGA GTTTAATTTC CTAGGGTACC TAGTTGCGCT GTGCAGTGCG CTGTGCACGG
CGATGTATGT TGTGTTGGTT GGTAAGATTG GGGACGAATT GCAGTTGGAT TCGTTCACAT
TGTTGTTGTA CAACTCGTTG TGGAGCGCGC CGTTGAGCTT GGCGATTTGT TTCGTGTTCG
GCGAGCACCG CGGGTTATTG GATTATCCCT ACCTCGGCCA CTTTGGGTTT TTGATTGCTT
TTTTGTGCTC GTGCTCGAGC GCGTTCATAT TGAACTACGC GACGTACCTG TGCACGCAGC
TGAACGAGGC GCTGACGACG TCGGTGGTGG GACGGACGAA AGGCATAGTT CAAGGCGTCT
TCGGTTTGTT TGCGTTTCAC GTGCGGGCGA GCGCGACAAA CGTGGCTGGT ATAATTTTGA
ACTCAGCGGG CGTGGCTTGG TATGCGTACG AAAAGTACAC CGGGGCGAAG CGCTCGAGTC
CGCGTGCTAT CGCGCCGGCG ACGCTCAACG CCTGCGTAAT TCACCGCGAA GATAGCCAGC
TCACGCTCGA GTCATCGGCA CGGTCGGAGG GTGCGGTCGC GAACGAAAGA CCGGTGGTTC
CGACGAACGG GCGCATGGCG CTGAAAAACT CGCACCAGCA CGCGCATTGA CGG
 
Protein sequence
MGLPTHRDSD ARRRRKGASA SLFYGSTSIA TVFLNKSIFA TWKFKFPATL VTAQTIFTVF 
AIVALEHVGA ISPRGGKGFR GNFNAKAFKR VGVVSAVFQM KLVLDMKALS MINIPMYGVL
KSATTPFVMA IDWVMMGKVA PARVQAAVWL TTLGGVCAGT GDLEFNFLGY LVALCSALCT
AMYVVLVGKI GDELQLDSFT LLLYNSLWSA PLSLAICFVF GEHRGLLDYP YLGHFGFLIA
FLCSCSSAFI LNYATYLCTQ LNEALTTSVV GRTKGIVQGV FGLFAFHVRA SATNVAGIIL
NSAGVAWYAY EKYTGAKRSS PRAIAPATLN ACVIHREDSQ LTLESSARSE GAVANERPVV
PTNGRMALKN SHQHAH