Gene OSTLU_28422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28422 
Symbol 
ID5006340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009372 
Strand
Start bp11327 
End bp12537 
Gene Length1211 bp 
Protein Length306 aa 
Translation table 
GC content48% 
IMG OID640421761 
Productpredicted protein 
Protein accessionXP_001422282 
Protein GI145356110 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4421] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAGT TTCACCAGCT ACCACTGAGG GCGTACGTGG ACGTAAAAGT CGACCCGGTG 
ATTTTCACGC ACGCTCTCAA TAAGAACCAC ACGTCGAATC TGTTCGTGAT CGCTCCAGAC
GACAAACGCG GCGCAAATCC CTTTCATTTC GCGCAATCTG CCATGTTCTT TTTTCACAGC
GCTCTCAGGA CGCTAGAGAA TTCGGATAGC TCTTCTGTCG TGATAATTTT TAGACAGCGG
CCGCCCGCTC AAAGTTGGAT TGATAGTTTG ACTCAGCAGA TCTTTGGCGA CGTCAGGGTC
GTTTATGGTG ACGAGTTGAC ATCGCCTATT TGTGCCAGAA GAGTTGTCGT CGCTGGAACT
ATGATAGGTT TACTTCAAGG CCCATATGAC GCTCAATTAT TTCGAGACCG AGTGTATGGC
AATCTCAAAA TCAACCCGAA ACGAATAAAC AGAGCGGATT TGCGAGTGAC ACTGATTGAT
AGAAAGAAGA GACGTGTCAC TAACGTAGGC GAATTACAAG AAATATTGGA TGAGCGTCGA
CTTTGGTACA AAACTGTTCG GCTCGATACG CTTTCTTTCA AGGAGCAAGT GTCCCTCATG
TCAGAAACAG ACTTGCTCAT TTCATCGCAT GGCGCCGATC TTACGAACGT TATATTCATG
CAACGAGAAA GCGCAGTCAT TGAGCTCTTT CCTTCGACGG TTTGGTACTA TGAGCTCTAC
GCAAAAATCG CACGGAACGC CGGATTGTTC CACACGTACG CTCTCGGCGA TCAAACGCAC
GCCGTTACGA AGACCATTGC GGAGTGCTTT GAAAGTGCCT GTCTGACCGA ACTGAAACGC
GACTTTATGA TACCGCCTGA ACGTTTTCGT ACTTCTCTCG ATCACGCGCT CAGTCTCCTT
GGAGTCGCCA ACGCAGTCTA GTAGATTGAT CCGCTTCGAG TTGCGGCATT AGACACGACG
TTCAGTGGCG GTCCAAAAGT CCCAACGCGC GTTCGTGCAA TCATCAAAGT ACGAGAAGCA
CGGGTGAGGA GGAAACGTGA TCGAGGAAGC TAAGCCGCTG TAAGTACTAT TCGATCCAAC
AATACACGAT GCTTGAGATA GAACATAAAG CTCAGCAAAA GTATCCATAT TCCCTTTTCG
CATACTCCGC TTGAAGACGC TCCAACTTCT GTCGATGTGA AAAACATTGG TACTGTGCAC
AGTGCGCACG C
 
Protein sequence
MSKFHQLPLR AYVDVKVDPV IFTHALNKNH TSNLFVIAPD DKRGANPFHF AQSAMFFFHS 
ALRTLENSDS SSVVIIFRQR PPAQSWIDSL TQQIFGDVRV VYGDELTSPI CARRVVVAGT
MIGLLQGPYD AQLFRDRVYG NLKINPKRIN RADLRVTLID RKKRRVTNVG ELQEILDERR
LWYKTVRLDT LSFKEQVSLM SETDLLISSH GADLTNVIFM QRESAVIELF PSTVWYYELY
AKIARNAGLF HTYALGDQTH AVTKTIAECF ESACLTELKR DFMIPPERFR TSLDHALSLL
GVANAV