Gene OSTLU_30899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30899 
Symbol 
ID5000891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp948475 
End bp950197 
Gene Length1723 bp 
Protein Length529 aa 
Translation table 
GC content61% 
IMG OID640416312 
ProductAAAP family transporter: amino acid 
Protein accessionXP_001417102 
Protein GI145345187 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCTTCGGGT ACGCTTGTGA CGTTAACGTT GGCGCAAAGC GTGCGGGGTG AGCGAGTACA 
ACGCCAAAAC GCGCACGAAC GCAAGCTCGG AGGGTCGAGT TGAAGATCTT TGTCCCGAGC
GAACGCCAAT GTCGAGCGAC ACCACGGACG CGCACGACGC CGCGCACGCC GAGAGCGAGC
TCGAGCACAT GATTTCGAGC GCCATGAGCG CGCTCGCGAG CTCGGCGCTC GTCGGTGAAA
CTGACGGTCG GGCGAAGCGG CGAGGCAACG TCTCCGGGTC GACGGCGACG CTCGCGAACT
GCGCCATCGG CGCGGGAGTG CTGGCGACGC CGTTCGCGGT GAGTAAGTTC GGCACCGTCG
GTGGTGGAAT TGTCGTACTC ATCGCCGCGC TACTCGTCGC GTACACGCTC GTCGTGCTCG
TGCGAGCGGG ATCGGCGTTC GAGTCCACGT CGTATCAAGG CTTGGTGCGC GACGCGTTCG
GAACTCGCGC GTCTCGATTC GTGAGCGGGA CGTTGGTGGT GTACTTGTTC GGATCGTGCG
TGGCATATTT GATCATCATC GGTGATTCGT ACGCGAAAGT GATGAGCGCG GTCGCCAGCG
CGGGGTCGAG CGCGTGGTGG GGAAGTCGAC GATTCGCCAT CGCCGTCGGA GCGACGTTTT
TGGTGACGCC GCTGAGCTTA CTTCGAGAGA TGAGTCGGTT GGCACCGGCG AGTGCGGTAG
CCCTGGTTTC GCTCGCGTAC ACCGCCGCGA CGATCACGTG CAAAGGAATG ACCCGCACTT
CTGGCGGTGA TGACGCCAAA GCCGTGGCTT TCAAATTCAA CACCGATTCC ATCTCCGCCG
TGCCCATCGT CGTCTTCGCG TTTCAGTGCC ATATTCAAGT CTTGGCGATT TTCTCCGAGC
TATCGGCAGA TTCCGCCCCT GAACCGCATT TCGAAGACGA CATCGAACCT ATCGATGGCG
ACGCTAGGCA AGCCACCGAG GCGCGACGCC TCAGTCGAAT GTACACCGTC ATCGCGCTCG
CCGTCGGCGC GTGCTTTTGG GGCTACCTCC TCGTGGGCGA GTTCGCGTAC GTGTCGCATC
CAAACGTGAC GTCTAACGTC CTAGATAGCT ACGGCAAGGA TGACAAAGCC ATGATGGTGG
CGACAATCTT CATGGGCTTC AGCGCCGTCG CATCGTTTCC GGTGAATCAT CACGCCGCGC
GCGCGGCTTT GGACGACTTA CTTGCGGAGG CGTTTGGTTG GGAGGTGTGC GCGCCGGGAC
AAGCGCCGGT GACTCGTCAC GCGACGCAAA CGTTCGCGTT CGTCGTCTTC ACCACGCTCG
TAGCGTTCGC GGTGGAAGAT TTAGGAAAGG TATTCGAGTT CATCGGTGCC ACGTGCGGAA
GTCTCGTCAT GTTCGTGATC CCAGCCTTGC TCTTGCTGCA TCCGAAGATG CGCTCGTCGA
AGGCCGCGGC GGACGTCGAG GAGCCGGCGG ACGATTTGCT CGATGGTCTG GACGACGTCA
CGAGGGAACT TTTGAGTTCC GCTCGCGATC TTCTCGAACA GGATTTTGAC GAGGAAGGTA
ACATCCTTCC ATCAGGCGAC GATGCGAACG CGTCCTTCGC GGCAAAACCG GGAGTCGGAA
CCGTCGTCGT CGCCGGCGCG CTGATTTTAT TCGCGAGCTT TGTGGCGATT AGCAACGTGT
ATGTGCTGCT TTTCAGCGAA CAGAAGCGCG ATTCGTAGAC GCT
 
Protein sequence
MSSDTTDAHD AAHAESELEH MISSAMSALA SSALVGETDG RAKRRGNVSG STATLANCAI 
GAGVLATPFA VSKFGTVGGG IVVLIAALLV AYTLVVLVRA GSAFESTSYQ GLVRDAFGTR
ASRFVSGTLV VYLFGSCVAY LIIIGDSYAK VMSAVASAGS SAWWGSRRFA IAVGATFLVT
PLSLLREMSR LAPASAVALV SLAYTAATIT CKGMTRTSGG DDAKAVAFKF NTDSISAVPI
VVFAFQCHIQ VLAIFSELSA DSAPEPHFED DIEPIDGDAR QATEARRLSR MYTVIALAVG
ACFWGYLLVG EFAYVSHPNV TSNVLDSYGK DDKAMMVATI FMGFSAVASF PVNHHAARAA
LDDLLAEAFG WEVCAPGQAP VTRHATQTFA FVVFTTLVAF AVEDLGKVFE FIGATCGSLV
MFVIPALLLL HPKMRSSKAA ADVEEPADDL LDGLDDVTRE LLSSARDLLE QDFDEEGNIL
PSGDDANASF AAKPGVGTVV VAGALILFAS FVAISNVYVL LFSEQKRDS