Gene OSTLU_26649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26649 
Symbol 
ID5004501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp574046 
End bp578548 
Gene Length4503 bp 
Protein Length1484 aa 
Translation table 
GC content58% 
IMG OID640419922 
Productpredicted protein 
Protein accessionXP_001420554 
Protein GI145352435 
COG category[S] Function unknown 
COG ID[COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGCG TGCGCGCGGA GGCGGCGATG AAGATAACGC GGGCGTTGGG TGACGCGGGG 
ACGGGAACGC CGGCGTTGGG GTTGTCGCAG TACGAACGAG ATCTGGCGCG CGCGAAGACG
GCGGAGAACA GGGCGCTGCG AAGGGCGCCG CGGCCGCCGA CGTCGGCGCG GACGTTCTCG
GCGTCGGGGA CGCCGCGGCT CGGACGCGAT GCGTCGAACG CGACGTCCGG CGCGCGAAAC
GCGACGACGA CGGGGTATCC GAGTGAAGAC GACGAAGACG CGTACGGGGA CGAGCAGGAC
GGGTATTCGG ACGAGACGGA GCCCGAGCGC GACGAGCGTC TCGTCATCAA GGGCGTTGAA
ACCGTGCTCG AGCGAACGCG CGAGCGACTG ATCGATGCGG ATAGCGAAGA TGAAAGTGTG
CTGGGTTACT ACAGATTACC GACGATGGCG AGCGGTAAGA TTGCCTTCGT GAGCGAGACA
AACATTTGGA TCGCCTCCGC GGACGGCGGC GTCGCCTCTC GATTGTCGTC GACGTACAGT
CGCGAAGGTG TGCCCAGACT TTCCCCCGAT GGCTCGCACA TCGCGTTTCT CGGCTTGACG
TACGACGGCT ACGACGTATT CGTCGCTCCG ACAGACGGAG GCGTCGCGAC GCAAGTCACG
TTTGGGGGCA AGACGCAAGA CATCGCCGAT TGGCCCGAGG ACAACGAGAT TCTCCTCATT
TCCACGGCGT TTTCCAAGTC GGCGTCGACA CAGCTGGTGA CTCTCAACCC AGACACTAGA
GCGATGTCGC CGTTGCCGTT TGCCAGAGCG ATCGAGGGCG TGAAAGAATC GGGCGGATGC
TACGTCTTTG TTCCGTTGCG ACAGACGAGC TCGACAAAGC GCTACGAAGG TGGCGAACAA
TCGCGCTTGT GGCGATGGTG TCGTGCCGAC TCCGAGGCGA AGCCGCTCAC GCCAGAAGAA
ACGTGGGGCG TGCGGGGAGC ATGGAGTCCG AGTGTTTCTG TCGCGAAGGG CCTCGAGAAC
TTCATCTTCT TCATCAGCGA CAAGTCGGGC GTCGCCAACG TGTGGTCGAT GACGACGGAT
GGCGAGTTCG CGATGCAGCT CACGAATGAG TGTTTGTTTG ATGTTCAAGA ATTTACCGTC
GATGGTGATG TGCTCGTGTA TCGACGCGGT GGTTCACTGT ACCGCCGTGC AATTTCGGCG
CACACGGCGC GCGCGCCCGC GCTCGGCGAG GAAGTCGAGC TGACATACTC CCTTGTGAGC
GAGTTCCGCA AAACGATGCC GACGGAGATT GAGGACCCTT ACTCGTCAAT CAGTGAACTT
GTGTTGACAA ACGATGGCGC TTTTGCCGCG TTTGTGATTC GTGGTCAGGT GTTTCACTCC
GCGCTCGTTC CATTCTTGGG AACACGCGTG GAAAAGGTGA CTCGCTACAG CGGGGCGGTG
CGATACAAGC ACATTCAGTT TGTGAACGAC GCCAGCTCAG ACGGACAACC CAAACTCTTG
GCGCTTTCTG ACGCATCGGG CGAATACGAG TACGTCTTGC TCGAGCGTCA AATCGAAGGT
GGACCGTGGA GAGAGACGAA GATCACGACC GGCGGCAAGA TCAAAGGTGC CATGGAGTTT
AGCTCCATCT CTCCGGACAA TACCGCCCTG TTGCTCGCCG ACACGGATGG GAATTTGAAG
CTCGTCAACT TGACGGAGAC GGCGGTGAAT ACGGCGTTGT ATCGAACCAC GGAACCTGAG
CCAATTTCAG CGAAGCTCGC CAAACGTCAA AGTAAACCGG GCAGGCCGGA TGCATCGCCC
GCGGGCCCTC AGCGCCCTAA GCCAATGTTG GGCAGAAGCC CCGCCGCGCG CCGCGCCGCG
CGCCGCGCCG CGCGCGTCGG AAGACGCGTA TCGGGGCGCC GTGCGAGTCT ACACTTGGCG
AGCATCGGCT TGGGGAAAGA AAAGTCCGAT GACAAAGCGT CGTCGGTTTA CGCCGATATC
GACGGCATAG CGAGTATTGA CACCATGTTG AATGGTTTCC GTCCGGAGGG CGTGTACGGC
TTGGCGTGGA GCCCCGATAG CTCGTACATC GCGTTCGCCA AGGATGAAGA AAACGATTTT
TCATCCATTC ACGTGTTGAA TGTGACGTCT GGGGAAAGTA CGCGAATCAC GACGCCGAGT
TACAACGCGC ACAGTCCAAA ATTTTCACCC GACGGCTTGT ACTTGTACTA CTTCAGCGAT
CAGCAAATCG CCTCGGGCGG GAGTTCGCCG TACGGCGCAC GCGGTGCTGA GCCGATGTTT
TATGAGACGT CTCGATTGAT GTGTGTTCCG TTGCGTATTG GCATGACGTG TCCGTTTTTC
ATCGGTGACG AGTTGAATCC TGAAGGCAGC GTGTTCGATG CAGAGTTTGG CAAGGCGTTA
CCGACGATGA TTTCGACGAA AAACATCGAG CAGCGAGCGC AAGTTGTCCC CGACTTGCCA
GAGGCCCAAT ACTTGGGCTT CAACATCATC GGCGACGGAA GCGGGATGCT CCTTACCTTG
GTTGAAGATG ACCAGGTCAT CGTCGCGGTG TACAGCATGT ACTCGCGTCA AATCTTGCCG
TTGTATCCCG ATCCTATGGC GGTAGTCGTG AGCGGGGACA TGCAAGTCTT CACCCTCGTC
ACCGAACAGG GTGTCGCGCT TTTGTCCGCC AAGGGCGTAT TACAGCCAGG ATTGGATCAA
GAATCCGCTC TTCGTGGAGC CGAAGTTTGG TACCCTCCGG ACACTTTCGG TGTCATCGTG
AATCCTCGCG AGGAATGGCT GCAAATGTAC GAGGAGGCGA TGCGCAACAT GAGAGATGCC
TTCTATGACC CAAAATTGCA CGGAGTCGAT TGGACATCCA TCACCGATCG CTATCGCCCA
TTGGTATATA AAATATCTGC CAAGGGCGAG TTGCGGGACA TTTTAGCCCA AGCCTTCGGT
GAGCTCTCCG CGTTGCACGT GTTCGTATCC GTTCATTCTG ACGATCCCGT GCTTCCGCTC
GGTTCGCCCT CGGCGTGCCT CGGCGCGAAT TTCGAAATGA CACCGTTGGG TTTGAAAATC
GCGTACATTC ACGAAAGTTC GGGGATCTTG GAGGCTCCGC AGTCATCATT GAGTCGTTCT
TCACTTTATT CGCACGTCAA ACTGCAGGCG GGAGACATCA TCGCCAAAAT TGATGGCGTT
CTCGTCAACT CCACCGAGTT GCCTCTGTCG AATTTGCTTC GCGGCAAAGC TGGGATGCAA
GTCTTACTCG AAGTCATCAA ATCGCCGCGG TCGGAGCAGG AGCGAAAAGA CGAGGAGGAT
TTGCAAAAGA TGAAAGAGTT GCGCATGATG CAGCAGATGA TGGGTGGTGG CATGGGTATG
GCGTCGTCTC AACTCGGCGG CGCGAGCACC GCGACGAAGA CGCTTCGGGA TCCGCACATG
GCGAGCCTGG TTCGTAAGCA CGCCACCCCA GGGGCGCCCA GACTTCCGCT GGGTCAGCCG
CGGCACAGTG TCGGACCGTT GAAAAACGGT GATGTGCACG CTGACGCGAA GTTGGCCGGG
AAGGAACATA AGGGCGATGA CGAAAATGAG CAGCCCGCAC CGACAACGGA GATGATCATC
GTCATGCCGC TCATGCTTGA GGAGTGCGAC GTTTTAGAAG CCGCGGATGC TGTCGTTCAA
AAGCGGGAGT ACGTGAAGAG CCAGACCAAC GACGACGTGG CATACGTTTA CTTGGAAGAC
ATGGAGCAAG AAGGGCGCGG TTCGAGTAAT TCCTTCGACG ACTTCGCCGC TCAGTTTTAC
CCGAACGTGC GCAAGAAGGG CTTAATCATC GACGTCCGTC GCAACGCCGG CGGAAACATT
GACACGTGGC TGCTCGAGCG TCTTCGTCGC GTGGCGTGGA TGTTCGACAC CGAACGAAGC
GGTGCCGGCG ACACGACGAT GCAATACACG TTTAGAGGCA AAGTTGTCGT GCTGATTGAC
GAGCAAACAA GCAGCGACGC CGAGATGTTC GCGCTCGGAA TTCAACAGCT CGGCATCGGT
CCGGTGATCG GCGAGCGCAC TTGGGGAGGC GCCATCGGGT ACAGCGGCCA CCCCGAACTT
CGTCTAGTGG ACGGTTCGGG GTTCACCATT CCATCCTTTG GCCCGTACAT CAACGGAGAC
TGGGCGATCG AGCAAAAGGG CGTCACCCCT GACGTTCTCG TGTCCAACTT ACCAGTGGCG
ACGTTTCACG GGAAGGACGC CCAACTCGAC GCTGCGATCG CCTCCATCCT GGACTTGATG
GGCACCGAAA GCGTCGAATT GCCGCCGAAA CCTACGTATC CGAACCGTGC GTTCGATTCC
ACGTCGTGCG CCGCTGGCGA GACTGGCGTG GACTCGGCGT ATCTCGTCTC CACCCAGGCC
GCGCAAGACG ACGCGTCGAC GCCAGCCGTC GAGTCGGCGA CGCAAGTGAT TCAAGCCGAG
ACCGCGGTGC CGTGAGCGCC ACACCATAGT ATGATATTAA CAGGCTTATT GCAGTAGTAG
AAT
 
Protein sequence
MASVRAEAAM KITRALGDAG TGTPALGLSQ YERDLARAKT AENRALRRAP RPPTSARTFS 
ASGTPRLGRD ASNATSGARN ATTTGYPSED DEDAYGDEQD GYSDETEPER DERLVIKGVE
TVLERTRERL IDADSEDESV LGYYRLPTMA SGKIAFVSET NIWIASADGG VASRLSSTYS
REGVPRLSPD GSHIAFLGLT YDGYDVFVAP TDGGVATQVT FGGKTQDIAD WPEDNEILLI
STAFSKSAST QLVTLNPDTR AMSPLPFARA IEGVKESGGC YVFVPLRQTS STKRYEGGEQ
SRLWRWCRAD SEAKPLTPEE TWGVRGAWSP SVSVAKGLEN FIFFISDKSG VANVWSMTTD
GEFAMQLTNE CLFDVQEFTV DGDVLVYRRG GSLYRRAISA HTARAPALGE EVELTYSLVS
EFRKTMPTEI EDPYSSISEL VLTNDGAFAA FVIRGQVFHS ALVPFLGTRV EKVTRYSGAV
RYKHIQFVND ASSDGQPKLL ALSDASGEYE YVLLERQIEG GPWRETKITT GGKIKGAMEF
SSISPDNTAL LLADTDGNLK LVNLTETAVN TALYRTTEPE PISAKLAKRQ SKPGRPDASP
AGPQRPKPML GRSPAARRAA RRAARVGRRV SGRRASLHLA SIGLGKEKSD DKASSVYADI
DGIASIDTML NGFRPEGVYG LAWSPDSSYI AFAKDEENDF SSIHVLNVTS GESTRITTPS
YNAHSPKFSP DGLYLYYFSD QQIASGGSSP YGARGAEPMF YETSRLMCVP LRIGMTCPFF
IGDELNPEGS VFDAEFGKAL PTMISTKNIE QRAQVVPDLP EAQYLGFNII GDGSGMLLTL
VEDDQVIVAV YSMYSRQILP LYPDPMAVVV SGDMQVFTLV TEQGVALLSA KGVLQPGLDQ
ESALRGAEVW YPPDTFGVIV NPREEWLQMY EEAMRNMRDA FYDPKLHGVD WTSITDRYRP
LVYKISAKGE LRDILAQAFG ELSALHVFVS VHSDDPVLPL GSPSACLGAN FEMTPLGLKI
AYIHESSGIL EAPQSSLSRS SLYSHVKLQA GDIIAKIDGV LVNSTELPLS NLLRGKAGMQ
VLLEVIKSPR SEQERKDEED LQKMKELRMM QQMMGGGMGM ASSQLGGAST ATKTLRDPHM
ASLVRKHATP GAPRLPLGQP RHSVGPLKNG DVHADAKLAG KEHKGDDENE QPAPTTEMII
VMPLMLEECD VLEAADAVVQ KREYVKSQTN DDVAYVYLED MEQEGRGSSN SFDDFAAQFY
PNVRKKGLII DVRRNAGGNI DTWLLERLRR VAWMFDTERS GAGDTTMQYT FRGKVVVLID
EQTSSDAEMF ALGIQQLGIG PVIGERTWGG AIGYSGHPEL RLVDGSGFTI PSFGPYINGD
WAIEQKGVTP DVLVSNLPVA TFHGKDAQLD AAIASILDLM GTESVELPPK PTYPNRAFDS
TSCAAGETGV DSAYLVSTQA AQDDASTPAV ESATQVIQAE TAVP