Gene P9303_16371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_16371 
Symbol 
ID4776412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1430359 
End bp1431459 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content53% 
IMG OID640087146 
ProducttRNA 2-selenouridine synthase 
Protein accessionYP_001017646 
Protein GI124023339 
COG category[R] General function prediction only 
COG ID[COG2603] Predicted ATPase 
TIGRFAM ID[TIGR03167] tRNA 2-selenouridine synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.218284 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGAA TGGGTACCCA TACTCCTTAC TCGATCGAAA GATTCCGTCA GGCCAATGGT 
CCGGTTGTGG ACGTTCGCAG TCCAGCTGAA TTCAATAAGG GCCATTGGCC AGGTGCGATC
AACCTTGCTC TCTTCAACGA TGAGCAACGG GCGGCGGTCG GCATCACTTA CAAAAAAAAA
GGTCGTCAGC AAGCGATCAA GCTAGGTCTT GAGTTCACAG GTCCAAAGCT GTCAGGTCTT
GCTGAAGCCC TGGCGAAACT CAGCAGAGAT CCAACCACTG AATCAGAAGA CGTCTCTGCA
TCTGATCTAC GCATTTACTG CTGGCGAGGC GGCATGCGCT CCGCGAGTGT TGCTTGGCTA
GCGGGTCTTC TGGATCTCAA GCCTGTGCTT CTGGAGGGTG GATACAAGAG CTATCGGCGA
TGGGTACTGC AACAATTTGA GCAAACCTGG CCTTTGCGAT TACTGGGTGG ACGAACAGGC
ACGGGCAAGA CTGATCTGCT CATTGCCATG GCACAACGGG GGGTCGCTGT AGTCAATCTT
GAAGGTTTGG CGAACCATCG GGGCAGCAGC TTTGGTGGGT TGGGCTTACC GCCACAACCA
AGCACTGAAC ATTACGAAAA TCTCCTTGCC GAAGATTTAC AGCGCTGTCA AAACTGTTCT
GCTAATGAAA TCTGGCTGGA AGCGGAAAGT TCACAGGTGG GTCGCTGCAG AATCCCCCGA
GCACTTTTCC ATCAGATGCA AATGGCACCA GTGCTGGAGA TCAACCGCTC ACTTGATGAA
CGGGTTGCTC AACTGGTTGA TGTGTATGGC CAGCATGGCC GTGAGTCTTT GCAGGAAGCG
ACCCAACGCA TCAGCCGCCG TTTAGGCCCA CAGCGCACAC GACAGGCCTT GGATGCAATC
GCTCTTGAGA ACTGGGACCA AGCCTGTCGC GCCATGTTGG ATTACTACGA CCGTTGCTAT
GACTACGAAC TAAGCAGGAC CCCTCAAAGA CAAAGCGTGG ATCTTTGCGG ACTGAACACA
ACCAAAGCCG CTGAGATGTT GATTGAACGA GAGCTTGTCA GATCAAGCCC CAAGCCACAG
CTGGTTATGA GCTCAACGTA A
 
Protein sequence
MSGMGTHTPY SIERFRQANG PVVDVRSPAE FNKGHWPGAI NLALFNDEQR AAVGITYKKK 
GRQQAIKLGL EFTGPKLSGL AEALAKLSRD PTTESEDVSA SDLRIYCWRG GMRSASVAWL
AGLLDLKPVL LEGGYKSYRR WVLQQFEQTW PLRLLGGRTG TGKTDLLIAM AQRGVAVVNL
EGLANHRGSS FGGLGLPPQP STEHYENLLA EDLQRCQNCS ANEIWLEAES SQVGRCRIPR
ALFHQMQMAP VLEINRSLDE RVAQLVDVYG QHGRESLQEA TQRISRRLGP QRTRQALDAI
ALENWDQACR AMLDYYDRCY DYELSRTPQR QSVDLCGLNT TKAAEMLIER ELVRSSPKPQ
LVMSST