Gene OSTLU_30925 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30925 
Symbol 
ID5001582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp250 
End bp2026 
Gene Length1777 bp 
Protein Length427 aa 
Translation table 
GC content59% 
IMG OID640417003 
Productpredicted protein 
Protein accessionXP_001417396 
Protein GI145345817 
COG category 
COG ID 
TIGRFAM ID[TIGR02167] bacterial surface protein 26-residue repeat 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGGG CGCGACGAGG CGATGGCCGT CAAGCCCTCG TCGCGGCGCT CGTGGGGGCG 
CTCGTCGCGA TCTCGATCGC GCCGCTGCGC GCTCGGGCGG CTCCTTTCCC GGCGGATGGT
GCTAATCTCG GGACCGCTAT CACTAACTGT CTCGCGGCTG ATGCGACGGG TGCGTGCGAC
TGCTCGTCGT CGTCCGTGGA CTGCAAGGAA GGGAACGGTA TCGCAATCGG GTCGTGGAAT
ACGTCGCAGG TGACGAGAAT GGACGGCATG TTCTATGGCG CCGCAGCCTT CAACCAAAGC
ATCTCTGCTT GGAACACGGC GGAGGTGACG ACGATGGCGT ACATGTTCGC GTTCGCCACA
GCCTTCAACC AAGACATCTC TGCTTGGAAC ACGGCGGCGG TGACGAATAT GGAACGCATG
TTCCATAACG CCGCAGCCTT CAACCAAGAC ATCTCTGCTT GGAACACGGC GGCGGTGACG
ACAATGTTCG CCATGTTCGG GAACGCCATA GCCTTCAACC AAGACATCTC TGCTTGGAAC
ACGGCGGCGG TGACGACAAT GCAAGCCATG TTCTATGACG CCGCAGCCTT CAACCAAGAC
ATCTCTGCTT GGAACACGGC GGCGGTGACG ACAATGAACT CCATGTTCTA CAAAGCCACA
GCCTTCAACC AAGACATCTC TGCTTGGAAC ACGGCGGAGG TGACGACAAT GGAACGCATG
TTCTATAACG CCACAGCCTT CAACCAGGAC GTCTCTGCTT GGAACACGGC GAAGGTGACG
ACCATGACAC GCATGTTCGA TAACGCCGTA GCCTTCAACC AAGACGTCTC TGCTTGGAAC
ACGGCGGCGG TGACAGACGC CGCGTGGATG TTTAATGGCG CAACCGCCTG GAATTCAAAG
TACGAACGCA ACGATGCCAG TACATCCACT GATGGTCCAC CCTCGGCATG GTCTATCCGC
ATGTGTCTCG CCGACCAGCG CGTCGTCGCC AACGCGTGCG TGTCGTGCCC CGCCGGGACG
ACCAACGCCG CGGGCGACCT CCGCTTGGGT ACGGACACGG CATGCGACGC AGCTCCGTTG
TCGAATGCAA GAACGTTTAC GCACGCTGAA ATTGCCGGTG TCGTCGTTGG TGTCGTCGCG
TCCATCATCG TCGCGGTCGC GCTCTGCGTT AGGCGTCGCC GCCTCGCCGA CGATAGGCTC
CGCGCGCGCC TCGGCATCCC CACCCACAGC GCGCCCGTCG GGTTTCCGCC ACAAGAGCCG
CAAATCATCA TCGTTCAGCG ATGAGCGCGC TCGCGACGAG CGACGTTCGA GACGCATCGC
GCTCGTCGAC GCGCGCACAC ATTTCGAGGG CGAGCTCACA ACAGTATCTC GATTAAGCGA
CAACAGAATA TATTCAATTC AATTCAATTC AATTTTTTCT CAAAGACGAC GATTCACGGC
GCGCGTCGTG CTGGGTTTCG AGATGGCAAC GAAGAAATTT GTGTGAGAAA GCTTGCGGCA
TAACGGATCC GCCGCAGGCT TTCCACTCAA ATCGACTGTG TGTCACGCAA GAGGAGTGCT
TGGGGTTCCG TTATTTTCAC AGAGGACAAA CTACGACTGG TCGTTTGTCG TCCAACACGA
ACACTGCAGA AGGAACTTTG ACGCAAGAGT CCGGAGACGT CCGGAGCGTC GAGGGGGAAG
TCTGCGGTGC CGTGCTGGTC GACGAGTGAT GGCCGTAGGG TGTCGTGAAG GTGCACAGGA
AGGATGCGAA TGCTAGTGAC TTTTTAGGGT TTAGTAC
 
Protein sequence
MKRARRGDGR QALVAALVGA LVAISIAPLR ARAAPFPADG ANLGTAITNC LAADATGACD 
CSSSSVDCKE GNGIAIGSWN TSQVTRMDGM FYGAAAFNQS ISAWNTAEVT TMAYMFAFAT
AFNQDISAWN TAAVTNMERM FHNAAAFNQD ISAWNTAAVT TMFAMFGNAI AFNQDISAWN
TAAVTTMQAM FYDAAAFNQD ISAWNTAAVT TMNSMFYKAT AFNQDISAWN TAEVTTMERM
FYNATAFNQD VSAWNTAKVT TMTRMFDNAV AFNQDVSAWN TAAVTDAAWM FNGATAWNSK
YERNDASTST DGPPSAWSIR MCLADQRVVA NACVSCPAGT TNAAGDLRLG TDTACDAAPL
SNARTFTHAE IAGVVVGVVA SIIVAVALCV RRRRLADDRL RARLGIPTHS APVGFPPQEP
QIIIVQR