Gene OSTLU_33799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33799 
Symbol 
ID5000886 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp76413 
End bp78134 
Gene Length1722 bp 
Protein Length508 aa 
Translation table 
GC content52% 
IMG OID640416307 
ProductBCCT family transporter: glycine betaine/carnitine/choline 
Protein accessionXP_001416550 
Protein GI145344046 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGA CGACGCTGTG GGGATTCACG ATCGGCGTGT TGTCTCGTGA AGAGGCGGCG 
GAGGAATTTT TCGGCAAGGC GACGCTTTGG ATTTCCAACA ACTTTACGTG GTTTTACACC
TTGACGCAGA ACGCGTGGAT TTTCGTGGTG ATTTTCGTGC TGTTTCGGAA AAGGTACGCA
AACATCAAGC TCGGACAGCC GGATGATAGG CCTGATTATT CCGATCTCGT GTGGTTCACG
TTGATTTTCA CCACGGGGCT CGGGACGGGG ATATTTTACT TTGGAGTGAG CGAGCCGATG
TACTACTATC GTGATGATTC CAAACTTTTA GGAACAGCAA CCAACTATAT AGCGAAGATC
CCGTTCATGA ACGATGATCA AAGGGCGAAC ATGGCGATGT TTGTCACTTT TTTGCATTGG
GGTTTGCACG GTTGGGCGAC GTACATCTTG GTCGCGCTGA CGCTCGGCGT CGTGCATTAC
CGACTGGGCA GGCCGCTGAC GCTTCGCAGT GCGTTTTATC CCATCGTAGG CGATTACGTC
AACGGGCTGT TCGGCGACTT GATTGACTCG ATGTCCATCG CTTGTACGAC GTTTGGGTTG
TGCACGTCGC TCGGTTTGGG GGCGAGCAGT ATCAATACCA TCTTGCACCG CATGAGCGCT
TCGATTCCTG ACGACGACGA CAACACGAAG TCGGCCATCA TTTGGGGCAT CACCGCGCTC
ACTACGGGTG CACTCGTAAG CGGGCTCCAC CGCGGGGTTA TCACTTTCGC CATTGTCGCC
TTCTCAATTC TACTTGCGCT GATTATGATT TTGTTCATGC TTGACAACAC GTGGTACCTT
GCAAATTCGT TTACGCAGCA AATCGGAACA TACTTGCAGT ACGTGATTCT CGGCGGATTC
GACAATGACG CCATAGCGCA GCTGAACATA GAGTTTACGC AGAACGACAC GCACTTGTGG
GGTGCGCAAG GGATAAAACA ATTAGTGGAA ACGGCCTTGA ACAAGACGCT TGCCGATCCG
ACGACGTATT ACTCGAGCTC GCCGACTTCG TTCATGGATA CGTGGACGGT ATTTTACTGG
GCTTGGTGGA TCACTTGGGC GCCGTTCGTA GGAATGTTTT ACGCTAAAAT TTCACGTGGT
CGCACCATAA GATCGCTTAT TCTGGTCGGT ATGTTTGCGC CGATGTTTTT GGGTTTCTTT
GCTATTTCCG TCCTCGGATC GCTTGGTATT CGAATGCAAC GCATCGCGGA ATTGGCGCTC
GGCGCTGCAC CAGATTGGCA GAAAGGAGTT GTGAACTGCG GTGCCCTCGG GTATTTAGAA
AACACACCCG TGAGTGCGGA CGCAAAAAAG TTGGCCTCGG AGCTCGGAGT CTACGCCTTG
GCGTGTCGAA AATCGACGGA TAGAATTTTG GACATTCTTG AGCCTTACAA AAACTTGACG
ACGCTTTTAC AAACCTTAGT TCTTTTGGGT GTGATTTTGT TCTTCGTCAC CTCTGCGGAT
GCGGGCGCGT TCGTGGACGA CAAAATCGCC GCGAACGGCA TGGAAAATCC TCCTGTTCTT
CAAAAGGTGT GGTGGAGCAT CACGCAAGGC GCCACCGCGC AGGCGTTGCT AAGCTCCACT
AAAAACGGTT TGTCCACGCT TCAATCGGTG AGCATTTGTG CCGCTCTGCC GTACACGTTC
GCGCTGAATT ACATGTGTGT AGCACTCTTT CGCGCTATGG AC
 
Protein sequence
MATTTLWGFT IGVLSREEAA EEFFGKATLW ISNNFTWFYT LTQNAWIFVV IFVLFRKRYA 
NIKLGQPDDR PDYSDLVWFT LIFTTGLGTG IFYFGVSEPM YYYRDDSKLL GTATNYIAKI
PFMNDDQRAN MAMFVTFLHW GLHGWATYIL VALTLGVVHY RLGRPLTLRS AFYPIVGDYV
NGLFGDLIDS MSIACTTFGL CTSLGLGASS INTILHRMSA SIPDDDDNTK SAIIWGITAL
TTGALVSGLH RGVITFAIVA FSILLALIMI LFMLDNTWYL ANSFTQQIGT YLQYVILGGF
DNDAIAQLNI DSPTSFMDTW TVFYWAWWIT WAPFVGMFYA KISRGRTIRS LILVGMFAPM
FLGFFAISVL GSLGIRMQRI AELALGAAPD WQKGVVNCGA LGYLENTPPY KNLTTLLQTL
VLLGVILFFV TSADAGAFVD DKIAANGMEN PPVLQKVWWS ITQGATAQAL LSSTKNGLST
LQSVSICAAL PYTFALNYMC VALFRAMD