Gene OSTLU_49512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_49512 
Symbol 
ID5001872 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp471629 
End bp473956 
Gene Length2328 bp 
Protein Length743 aa 
Translation table 
GC content57% 
IMG OID640417293 
Productpredicted protein 
Protein accessionXP_001417782 
Protein GI145346616 
COG category[R] General function prediction only 
COG ID[COG1287] Uncharacterized membrane protein, required for N-linked glycosylation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0545641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGCT TGATCGATCT CTCGCTCGGT GGGTTCGTGC GCGCGATCGC GCTCGCGATC 
GCCGTCTACG CCGCCGTCGA CATTCGTCTG TACGCGGTTC GAGATTACGG AAGGATCATC
CACGAGTTCG ACCCGTGGTT TAATTATCGC GCCACGGTGT ACTTGGCGGA CAACGGATGG
AGTAAATTCA AGACGTGGTT CGATTACATG TCGTGGTACC CGCTGGGACG GCCCGTGGGG
ACGACGATTT ATCCCGGGAT GCAGATCACG GCGGTGTTTC TCTGGCGCGT CTTCAACGCG
ATCGGGTTAA AGATGTCGTT GAATGACGTC TGTGTGTTCT TCCCGGCGTA CTTCGCGGCG
GTGGCGACGT TCTTAGTCGC GCTGTTGACG AAGGAGTGCA GTGGAAGCGG CACGGGAGCG
GTGGTGGCGG CGTTGGTGAT GGCGATCGTG CCGGCGCACA CGATGCGATC CGTAGCGGGC
GGTTACGATA ACGAATCTTT GGCCGTCACG GCGATGTGCC TGACGTTTTA CACGTGGTGC
CGCGCTTTGA GAACCCCGAA ATCTTGGTGG GTCGGTGGAT TGGCTGGTTT GGCGTACACA
TACATGGTCG CCGCTTGGGG CGGGTACACG TTCGTGCTCA ACATGGTTGG TTTGCACGCC
GCCGTGCTCG TGCTGTACAA GCGTTACAGC TCTTCGCTGT ATAAGGCGTA CTCGTTATTC
TTTGTGGTAG GTACGATCGG GGCGCTTCAG TTTCCCGTCG TCGGTACGCT TCCTTTCACG
TCCGCGGAAC AACTCGGACC CATGGGAGTG TTCCTCGGTT TCCAAGTGTT GGAGTTTGTC
GAGCGCACGA TGAAGCCGGG TATGGAGTCG CGCGAAAAGT GGCAGCGACG CATTCGGATG
TACACATTGG CTTTCGCCGC CGCCATCGGC GCGATAACAG TGCTCAGCGC CACTGGTGTC
TACAACATTA ACGGCTTGTC CGTGCGCGTG AAGAGCTTAT TCATCAAGCA CACGAAGACC
GGTAATCCGC TCGTTGATAG CGTTGCCGAG CATCAGCCGG GTAACGCCGA CTCGTATTAC
CGATTCTTAC ACTTCAATTA TTTCATCGCC CCGATCGGGT TCTGTTTCTC TGTCGTGCAC
TTTATGTTTG ACGGTTCCGC GGGCGCCTTG TTCTTGCCGT TATACGGCTT GGTGGCGTAC
TACTTCGCTA ACCGCATGGT CCGTCTCATC ATCTTCCTCG GCCCCGTTGC GGCGTGCTTG
GTTGGTGTGT GCATCGGATA CGCCATTGAA GACGCGCTTA GCATCATGCG GAACGACGAC
ATCGAGCAAG CTGAGACGCC GTCAGAAACC GAGACGCCAA AGTCGGCGAA GAAGAAGTCG
AAACAAGCCG CGTACAAGAA GCCCGAACCC TTGAGCGTTA CGGTCAAGAG AGACATGACA
AAGTGGTGGA TGTCGTCCAC GGTTCGCCAA ATTCGCTTGA TCGTCACTGT CCTTGCTTTG
GTGTTCACGT CGACCAAGAT TCCAGTCTTC TACAAGTACT CGCACCAAAT GGCGGAGGGC
ATGTCTCAGC CGAGCATCAT GTTCAAAGGT CAGCTCAATG ATGGCAGAAC TGTGATGGTC
AAGGATTACG TCGAGGCTTA CGACTGGTTG CGAACGCAAA CCCCCGAAGA CGCTCGCGTC
ATGGCGTGGT GGGACTACGG CTACCAAATC ACTGGAATTG GTAACCGAAC GTCCATCGCA
GACGGTAACA CGTGGAATCA TGAGCACATC GCTAACCTCG CTCGCATGTT GACCTCGGAC
GAAGCTAAAG CGCATAAGGT CATTCGACAC TTGGCCGATT ACGTGCTTGT TTGGGCGGGC
GGTGGAGGAG ACGACATGGC CAAGAGCCCT CACCTTTTCC GCATCGGCGC TTCCATCGGG
AGCGGCCGCG CCACGGCGGT GGAGATGGAG AAGATTACGA GTACTTTTGG CGTCGATCGA
TCTGGTCGTC CGACGCCCAA AATGGCGTCG TCTTTGCTCT TCAAGCTCGT TTCCCCTCAA
GGCTCCGTCA GTCCGGAGCA CTTCCGAGAA GTGTACACGT CCAAGTATCG CAAGGTTCGC
ATTTACGAAG TCGTCAACGT AGACGAAGGT TCCAAGGCTT GGGTCGCTGA TCCCGCCAAC
CGCATGTGCG ATAACGCCGA CGGAACTGGA TTCTGCCCGG GTGCTTACCC GCCGGCGTTG
AAAAAGTTCC CGAGCATCAT CAAGCCGGCG TACAAAGTTC CCGCCTGGAT CAAAGCCAAG
CGCGCAGCCG CCAAGTCCGC CAAATCCGCG GCAAAGGACG AGCTTTAA
 
Protein sequence
MQRLIDLSLG GFVRAIALAI AVYAAVDIRL YAVRDYGRII HEFDPWFNYR ATVYLADNGW 
SKFKTWFDYM SWYPLGRPVG TTIYPGMQIT AVFLWRVFNA IGLKMSLNDV CVFFPAYFAA
VATFLVALLT KECSGSGTGA VVAALVMAIV PAHTMRSVAG GYDNESLAVT AMCLTFYTWC
RALRTPKSWW VGGLAGLAYT YMVAAWGGYT FVLNMVGLHA AVLVLYKRYS SSLYKAYSLF
FVVGTIGALQ FPVVGTLPFT SAEQLGPMGV FLGFQVLEFV ERTMKPGMES REKWQRRIRM
YTLAFAAAIG AITVLSATGV YNINGLSVRV KSLFIKHTKT GNPLVDSVAE HQPGNADSYY
RFLHFNYFIA PIGFCFSVVH FMFDGSAGAL FLPLYGLVAY YFANRMVRLI IFLGPVAACL
VGVCIGYAIE DALSIMRNDD IEQAETPSET ETPKSAKKKS KQAAYKKPEP LSVTVKRDMT
KWWMSSTGMS QPSIMFKGQL NDGRTVMVKD YVEAYDWLRT QTPEDARVMA WWDYGYQITG
IGNRTSIADG NTWNHEHIAN LARMLTSDEA KAHKVIRHLA DYVLVWAGGG GDDMAKSPHL
FRIGASIGSG RATAVEMEKI TSTFGVDRSG RPTPKMASSL LFKLVSPQGS VSPEHFREVY
TSKYRKVRIY EVVNVDEGSK AWVADPANRM CDNADGTGFC PGAYPPALKK FPSIIKPAYK
VPAWIKAKRA AAKSAKSAAK DEL