Gene OSTLU_46561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_46561 
Symbol 
ID5003823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp483918 
End bp485659 
Gene Length1742 bp 
Protein Length500 aa 
Translation table 
GC content51% 
IMG OID640419244 
Productpredicted protein 
Protein accessionXP_001419612 
Protein GI145350438 
COG category 
COG ID 
TIGRFAM ID[TIGR02167] bacterial surface protein 26-residue repeat 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0924573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.560204 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATTGGTGACT GGAACACGAG CCAAGTCATG GACATGCGTT TTATGTTCGC TACTGCTTAT 
GCTTTCGATA GTCCTATCCG CAACTGGGAC ATCAGCCAAG TGACGAACAC GAGTTACATG
TTCTATTCTA TTGCTGCATC TTTCAATGTT TTCAACCAAT CCGTCGAGAA CTGGAACACG
AGCCAAGTTA TGGACATGCG TGGGATGTTC GGGAACGCAC GTTACTTCAA CAAGCCTATC
GGCGGCTGGA ACACGAGCCA GGTTACGGAC ATGCGTAACA TGTTCGTTAC CGCCTCTTCC
TTCAACCAGC CCATCGGTGA CTGGGACACG AGCCAGGTTA CGGACATGCG TTACATGTTC
GCTACCGCCT CTTCCTTCAA CCAGCCCATC GGTGACTGGG ACACGAGCCA GGTTACAGAC
ATGCGTGGAA TGTTGGGGTC TGTGTCCTTC AATCAGCCCA TCGGGAACTG GAACACGAGC
CAGGTTACAG ACATGCGTAC GATGTTCGCT ACCGCCTCTT CCTTCAACGC GCCTATCGGT
GACTGGAACA CGAGCCAAGT TACAGACATG GGTAGAATGT TTTACGCAGC CTCTTCTTTT
AACCGGTCTA TTGGCGACTG GAACACGAGC CAAGTCACGG ACATGCAGTC GATGTTTTAC
GCCTCTTCAT TCGATCAACC CATCGGGAAC TGGGACACAA GCCTAGTCAC ATACATGAAT
TCGATGTTCA ACAGCGCAGT TTCCTTCAAC CAGTCTATCG GTGGCTGGAA CACGAGCCGA
GTGGTGAGCA TGCAGTCGAT GTTCGCGAAC GCATTTTCCT TCAACCAGCC CATCGGGAAC
TGGAACACGG GCCGAGTGGT GAGCATGCAT TCCATGTTTG ACAGCGCAGT TTATTTCAAC
CAGCCCATCG GGAACTGGAA CACGGGCCGA GTGGTGAGCA TGAATTCCAT GTTTGACAGC
GCAGGCTTTT TCAACCAGCC CATCGGGAAC TGGGACACGA GCCAGGTTAC AGACATGAAT
TCCATGTTTG ATAGCGCAGG CTCTTTCAAC CAGCCCATCG GGAACTGGAA CACGAGCCAG
GTTACAGACA TGAATTCCAT GTTTGATAGC GCAGGCTCTT TCAACCAGCC CATCGGTGAC
TGGGACACGA GCCAGGTTAC AGACATGCAT TGGATGTTCC AGGGTGCCTC TTCCTTCAAC
CAGCCTATTG GAGACTGGGA TACGAGCCAA GTGGTGACCT CGTACTCATT TAGCGAAATG
TTCAAGGACG CAACGGCGTG GCTATGCTCA CACACACACA ATACGTCGGA TCCGATGCAT
TTTCAAAGAT ACTACGATGG TCCACCGTCT TTTTGGTTCG AGTCAGCGGC TGGAATCGGT
GCAGTGTGCT TGTCTCCACC ACCGCCAACC ACACTACCGT GTGTATATGT CGCCGCCCCT
CCGTTGACCG CAAGGTCGAC TGGAACCGCG AGGTCGAGTG TCGCACTCTC GCTATATACG
ACAGTCATTA TCATATACTT CCACGCCTGA AACTGAGACA GAAACTGAGA CATCGCACAT
TCACAAATCC TACAAGGATA TGATACCGTA GTCATAGTGG ATATATACAG TGCTGCTCTC
GCATTTCCAG TTCCTACCTG ATGAATACTG GATTATGGGG TGATTTTCAA ACAAGTGTAG
AGTACTAGCC TCCATTTCTC GCTTGTTCTT GAAATACAAC TTGAATTACA ACTCGAGGCA
AA
 
Protein sequence
MDMRFMFATA YAFDSPIRNW DISQVTNTSY MFYSIAASFN VFNQSVENWN TSQVMDMRGM 
FGNARYFNKP IGGWNTSQVT DMRNMFVTAS SFNQPIGDWD TSQVTDMRYM FATASSFNQP
IGDWDTSQVT DMRGMLGSVS FNQPIGNWNT SQVTDMRTMF ATASSFNAPI GDWNTSQVTD
MGRMFYAASS FNRSIGDWNT SQVTDMQSMF YASSFDQPIG NWDTSLVTYM NSMFNSAVSF
NQSIGGWNTS RVVSMQSMFA NAFSFNQPIG NWNTGRVVSM HSMFDSAVYF NQPIGNWNTG
RVVSMNSMFD SAGFFNQPIG NWDTSQVTDM NSMFDSAGSF NQPIGNWNTS QVTDMNSMFD
SAGSFNQPIG DWDTSQVTDM HWMFQGASSF NQPIGDWDTS QVVTSYSFSE MFKDATAWLC
SHTHNTSDPM HFQRYYDGPP SFWFESAAGI GAVCLSPPPP TTLPCVYVAA PPLTARSTGT
ARSSVALSLY TTVIIIYFHA