Gene OSTLU_88387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_88387 
Symbol 
ID5004049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp145670 
End bp147187 
Gene Length1518 bp 
Protein Length505 aa 
Translation table 
GC content56% 
IMG OID640419470 
ProductF-ATPase family transporter: protons (vacuolar) 
Protein accessionXP_001420095 
Protein GI145351461 
COG category[C] Energy production and conversion 
COG ID[COG1156] Archaeal/vacuolar-type H+-ATPase subunit B 
TIGRFAM ID[TIGR01040] V-type (H+)-ATPase V1, B subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.116254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.503853 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCCG CGGACGCGCT CGCGAGCGAT TTCGCGCGCA CGCAGATCGC CGCCGCGAAG 
GCGACGATCG ACGCGATCGA TGCGATCGAC CCGAAGCTGA CGTATCGAAC CGTGCGCGCC
GTCGCGGGTC CGCTGGTGAT TTTAGATCAA GTTTCGAACG CAAAGTACGC CGAAATCGTC
AACGTGAAGC TCGGCGACGG CACGCTCAGG CGCGGACAAG TGCTCGAGGT CGATGGTGAA
CGCGCGGTGG TGCAGATATT CGAAGGCACG AGCGGGATCG ATGGGAAAAG GACCGAGTTG
GAGTTTACGG GGGAAGTGCT GAAGACGCCG GTGAGCGAGG ATATGTTGGG ACGGATCTTT
AACGGATCTG GGAATCCCAT CGACGGTGGA CCGCCGGTGA TGGCGGAGAA ATATTTGGAT
ATCCAAGGGG CGAGCATCAA TCCGAGCGAA AGGACGTACC CGGAGGAGAT GATACAGACG
GGAATATCGA CCATTGACGT GATGAATTCC ATCGCTCGAG GACAAAAGAT TCCATTGTTT
AGTGCGGCGG GTTTGCCGCA TAACGAGATT GCGGCGCAGA TTTGTCGACA AGCTGGATTG
GTGAAACGGC GGGACGATGA GGGGCGAGAG GTGGAGGGGG GAGACGCGGG CACGTCGGCG
GGCGAAGACG ACTTCGCCAT CGTCTTCGCG GCGATGGGGG TCAACCTGGA GACGGCAAAC
TTTTTCCGTC GTGATTTCGA ACGGACTGGA AGTTTGGAGA AAGTTGTTTT GTTCCTGAAC
CTCGCGAATG ATCCCACGAT TGAGCGAATC ATCACTCCGC GCATCGCTTT GACGACGGCG
GAATACCTGG CGTACGAGTG CGGGAAGCAC GTCTTGGTGA TTTTAACCGA CATGTCTTCC
TATGCCGACG CTTTGCGCGA GGTCAGCGCG GCGCGCGAGG AAGTCCCGGG GCGAAGAGGT
TATCCTGGTT ATATGTACAC CGATTTAGCT ACGATTTACG AGCGAGCGGG TCGAATCAAA
GGACGAAAAG GTTCAATCAC GCAGTTGCCG ATTTTAACCA TGCCGAACGA CGACATCACG
CACCCGATTC CAGATTTGAC GGGATACATC ACCGAAGGGC AGATTTACAT CGACCGCCAG
TTGCATAACC GACAAATCTA CCCACCCATC AACGTTTTGC CATCGCTCAG TCGTTTGATG
AAGTCTGCGA TTGGCGAAGG CATGACGCGT CGCGACCACG GCGAAGTCTC CAACCAGCTC
TACGCCAACT ACGCCATCGG CAAAGACACC TTGGCTATGA AAGCCGTCGT CGGCGAAGAA
GCGTTGAGCT CCGATGATTT GCTTTACCTC GAGTTCTTGG ACAAATTCGA GCGCAAGTTC
ATCAATCAGG GTAACGAAGG ACGGAACATT TACGACGCCT TGGACCTCGC GTGGTCGCTG
CTTAGAATTT TCCCGCGCGA GCTCTTGAAG CGCATTCCCG CCAAGACGCT CGATCGATAC
TACGACCGCG CGGCTTGA
 
Protein sequence
MSPADALASD FARTQIAAAK ATIDAIDAID PKLTYRTVRA VAGPLVILDQ VSNAKYAEIV 
NVKLGDGTLR RGQVLEVDGE RAVVQIFEGT SGIDGKRTEL EFTGEVLKTP VSEDMLGRIF
NGSGNPIDGG PPVMAEKYLD IQGASINPSE RTYPEEMIQT GISTIDVMNS IARGQKIPLF
SAAGLPHNEI AAQICRQAGL VKRRDDEGRE VEGGDAGTSA GEDDFAIVFA AMGVNLETAN
FFRRDFERTG SLEKVVLFLN LANDPTIERI ITPRIALTTA EYLAYECGKH VLVILTDMSS
YADALREVSA AREEVPGRRG YPGYMYTDLA TIYERAGRIK GRKGSITQLP ILTMPNDDIT
HPIPDLTGYI TEGQIYIDRQ LHNRQIYPPI NVLPSLSRLM KSAIGEGMTR RDHGEVSNQL
YANYAIGKDT LAMKAVVGEE ALSSDDLLYL EFLDKFERKF INQGNEGRNI YDALDLAWSL
LRIFPRELLK RIPAKTLDRY YDRAA