Gene OSTLU_43398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43398 
Symbol 
ID5005373 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp24306 
End bp26000 
Gene Length1695 bp 
Protein Length506 aa 
Translation table 
GC content67% 
IMG OID640420794 
ProductAPC family transporter: amino acid 
Protein accessionXP_001421369 
Protein GI145354178 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.797401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTGA CGCGCGACCG CCTCGGGACG ACGACGACGG CGACGGCGAC GGCGACGGCG 
ACGGCGGACG ACGCGCGCGG CGCGAGCGGG GCGCTGCGAA AGGTGCTGAC GCGCGCGGAC
CTGACGACGC TCGGCGTGGG CGGGATCATC GGGGCGGGCG TGTTCGTGCT CACGGGGTCG
GTGGCGAGGG AACACGCGGG ACCGGCGGTG GCGGCGTCGT ACGCGCTGAG CGCGTTTACG
AGCGCGGTGA CGGGATTGGC GTACGCGGAG TTCGCGGTGG CGATGCCGGT GGCGGGGAGC
GCGTATAATT ACGTGTACGG CACGTTCGGG GAGTACGCGG CGTTTCTCAC GGGGTGTAAT
CTGGCGCTGG AGCTCACCAT AGCGAGCGCG GCGATCGCGC GCGGGTGGAC GTCGTACGCG
ACGGCGGCGT TCGGCGTCGA GGCGCGGCGC GCGCGAGTGC GCGTGATCGA TGGGTTGATG
GAGATAGATT TAATAGCTGG GATCGTGGTT TGCGGCATGA CGGCGCTGCT CGTGAGCGGG
GCGAAACAGA CGGCGCGGTT TAACGCGGCG GTGACGTACG CGAGTTTGGT CGTCGTCGCC
GTCGTTCTCT TGGCGGGCGC GCCGGAGATT CAACCGTCGA ATTGGACGCC GTTCGCGCCG
TACGGGATGC GTGGAATAAT ATCCGGTGCG TCGGTGGTGA TTTTTGCCTT CGTCGGTTTC
GACACCGTGG CGACGTGCGC GGAAGAAGTC GCCAATCCCG CGGCGGATTT GCCGTTTGGT
ATTTTGGGAT CGCTCGGGAT TTGCGCGGCG TTGTACTGCG CGATGTGCGT CGTGATCACG
GGCATGGTTT CGTACGACGA CATCGACGTC AACGCCCCGT TCGCGATGGC GTTCACGGCG
TACGGTATGC CCGCGATCGC GACCATCGTG TCCATCGGCG CCGTCGCCGC CATCACGACG
TCGCTGTTGC TTTCGATGAT GGGGCAGCCG CGCATCTTCA TGGTGATGGC GCGAGACGGT
TTGCTCCCGA AGTGGTTCTC GCGCGTCAGC GAAAAGCACG GCACGCCGGC AAACGCGTCG
ATATTCAGTG GCGCCGTCAC CGGCGCGCTG GCGGTGCTGT TGGACATCAA CATACTCGCC
CAGCTCGTGA GCATCGGCAC GTTGAGCATC TTTTGCGGCG TTAATTTGGG ACTCATCGTG
TCCAGATGCG CGCCTCGAGA CGACGACGAT TTCGCCCGTC GCGCGCCGGC TTTGAAGCGC
GCGGGCGCGC TCTTCGTCTC ATCGATGGCG TTCGGCGTGG ACTACCGCGC GCGCGCGCGA
ATCTCCTGGT TTGGCGCCGT CGCCCTCGCC GCCGTCGTCG CGAGCGCGTG CAGCTTTTTG
ACGCTTCCGA TGACGCACGC GCCGAAGACG TTTCGCGCCC CGTTCGTCCC TTTCCTCCCC
GCTTTAGGCG TCCTCCTGAC GTGCGTCTTA ATCGCCGGTC TCGGCGCTCT GGCGTGGATT
CGATACGCCG TCTACACCGT CCTGTGTTCC GTCGGCTATC TCTCCTTCGC CGTTCGCAGG
TCGCGCGAGT CCCCCGAAGG CGTCGACGCC GTCGAGCTCG CCGTCGTCGA CGCCGCCGAC
GCCGACGCCG ACGCCGACGC CGACGCCGGT GACGCCGTCA CGCTCCTCCC CGTCTCCGCC
CGACCCTCTT CCTAG
 
Protein sequence
MRVTRDRLGT TTTATATATA TADDARGASG ALRKVLTRAD LTTLGVGGII GAGVFVLTGS 
VAREHAGPAV AASYALSAFT SAVTGLAYAE FAVAMPVAGS AYNYVYGTFG EYAAFLTGCN
LALELTIASA AIARGWTSYA TAAFGVEARR ARVRVIDGLM EIDLIAGIVV CGMTALLVSG
AKQTARFNAA VTYASLVVVA VVLLAGAPEI QPSNWTPFAP YGMRGIISGA SVVIFAFVGF
DTVATCAEEV ANPAADLPFG ILGSLGICAA LYCAMCVVIT GMVSYDDIDV NAPFAMAFTA
YGMPAIATIV SIGAVAAITT SLLLSMMGQP RIFMVMARDG LLPKWFSRVS EKHGTPANAS
IFSGAVTGAL AVLLDINILA QLVSIGTLSI FCGVNLGLIV FLTLPMTHAP KTFRAPFVPF
LPALGVLLTC VLIAGLGALA WIRYAVYTVL CSVGYLSFAV RRSRESPEGV DAVELAVVDA
ADADADADAD AGDAVTLLPV SARPSS