Gene P9211_00141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_00141 
Symbol 
ID5730613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp17159 
End bp18160 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content39% 
IMG OID641284356 
ProducttRNA-dihydrouridine synthase A 
Protein accessionYP_001549899 
Protein GI159902555 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00742] tRNA dihydrouridine synthase A 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00872968 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTGCCA CTTACACTCC AAGAATTACT AACGCCTATC GTTTTAGTAT TGCGCCAATG 
CTGGACTGCA CAGATAGGCA TTTCAGGGTA ATTTTTCGAC AAGTCAGCCG AAGGTCATTG
CTATATACGG AAATGATTGT AGCTAAAGCA TTGCAACATC AAAAAGGAAG GCGTCTTTTA
GATTTTGATG AAATTGAACA CCCTATTTCA CTCCAAGTTG GCGGAGATAA TCCAAAAGAG
CTAGCTGACG CTGCCAAGCT TGCAGAAGAT TGGGGATATG ACGAAATCAA TCTAAATCTT
GGTTGTCCTA GTCCAAGAGT GCAATCAGGC AACTTTGGTG CATGCCTTAT GGCAACTCCA
AATCAAGTCG CTAAATGCAT AGAAGCCATG AAAAATGCAA CCCAAATTCC TGTGACAATC
AAACATAGAA CTGGCATTGA TAACTTTGAC AGTGAGGAAT TTCTTTTTTC TTTTGTCGAT
CAAATAGCTA AAGCAGGAGC AGACAGATTT GCAATTCATG CTCGCAAAGC ATGGCTGGAA
GGATTAAATC CAAAACAAAA TCGAACTATT CCACCATTGG AATATCTAAA AGTTAAAAAA
TTAAAATTAA AGCGTCCCGA ATTAAAAATA GAGTTCAACG GCGGCTTACA TACACCAGAG
GAATGTGTAA AAACTTTAAA GATATTTGAT GGGGCAATGG TTGGAAGATC CGCATATTCA
AATCCTATGC TTTGGCAAGA AATGGATTCG TTAATTTATG GAGAGGATTA TTTTCCAGTA
AAAGCTTCGC AAGTTATTCA AAATTTAATT CCATATGCAC AAAAGCACCT TGATAATGAA
GGGAGGCTTT GGGATATAAG CAAACATCTT CTTCAATTAG TCCAAGCGGT ACCTGGAGCT
CGTGCATGGA GACACAACCT AAGCACTAAA GCTCAAAAAT CAAGCGCAAA ACTAATAATT
TTAGAAAAAG CTGCCCAACA ATTAGAAGAA GTTGGGCTTT AA
 
Protein sequence
MVATYTPRIT NAYRFSIAPM LDCTDRHFRV IFRQVSRRSL LYTEMIVAKA LQHQKGRRLL 
DFDEIEHPIS LQVGGDNPKE LADAAKLAED WGYDEINLNL GCPSPRVQSG NFGACLMATP
NQVAKCIEAM KNATQIPVTI KHRTGIDNFD SEEFLFSFVD QIAKAGADRF AIHARKAWLE
GLNPKQNRTI PPLEYLKVKK LKLKRPELKI EFNGGLHTPE ECVKTLKIFD GAMVGRSAYS
NPMLWQEMDS LIYGEDYFPV KASQVIQNLI PYAQKHLDNE GRLWDISKHL LQLVQAVPGA
RAWRHNLSTK AQKSSAKLII LEKAAQQLEE VGL