Gene P9303_00141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_00141 
Symbol 
ID4775971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp17882 
End bp18886 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content54% 
IMG OID640085513 
ProducttRNA-dihydrouridine synthase A 
Protein accessionYP_001016036 
Protein GI124021729 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00742] tRNA dihydrouridine synthase A 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGTTC CTTTCACGCC ACAGGTTGAT GGTGCTTATC GGTTCAGCGT GGCTCCAATG 
CTCGACTGCA CAGATCGACA CTTCAGGGTA CTAATGCGAC AAATCAGTCG CCGGGCGTTG
CTTTACACGG AAATGTTGGT TGCCCAAGCT CTGCATCACA GCAACCGTCT TGATCATCTG
CTCGATTTCG ACATCATCGA GCATCCCCTG TCTCTACAAG TAGGGGGCGA TGATCCAAAA
ATGCTTGCAG AAGCAGCGCG CCTGGCCGAT GCCTGGGGCT ACGACGAAAT CAACCTCAAC
GTGGGATGTC CCAGCTCAAG AGCAAAAGCA GGCAACTTCG GTGCCTGCCT AATGGCTAAA
CCTGATCAAG TCGCACGTTG TGTTGAAGCG ATGGCGATGG CGAGCCCTCT TCCAGTCACC
GTGAAACACC GTCTAGGAAT TGATGATTTC GATAGCGACG CTCTACTCAT GACCTTTGTC
GACCGAGTGT CCCTCGCAGG AGCCACTCGC TTTACTGTGC ATGCACGAAA AGCCTGGCTA
GAAGGGCTTG ACCCCAAACA AAACCGCACG ATTCCACCAC TTCAACATCA ACGAGTCACC
CATCTCAAGC AACAACGCCC GCAGCTCACT ATTGAAATCA ATGGAGGACT AGAACACCCT
GCCGACTGCC TAACAGCGCT GCAAACCTGT GATGGGGCAA TGGTGGGGCG AGCAGCGTAT
GCGCATCCGC TCCGCTGGAA GAGCATGGAT GAGCTGGTCT ATGGAGAAGA ACCACGCTCA
ATCAATGCTT CTCAAGTCAT AGGAGGATTA CTCCCTCATG CCGAAACCCA CCTGAGCCGA
GGTGGCCGGC TATGGGATCT TTGCCGACAT CTTTTACAAC TCGTTGAAGG GGTACCGGGC
GCCAAATCCT GGAGGCGAGA CCTTGGCATC AAGGCTCAAA AAGCCGATGC CGATCTAACA
GTGCTGCAAA AAGCAGCCCA GCAACTTGAA GATGCCGGGC TATAA
 
Protein sequence
MIVPFTPQVD GAYRFSVAPM LDCTDRHFRV LMRQISRRAL LYTEMLVAQA LHHSNRLDHL 
LDFDIIEHPL SLQVGGDDPK MLAEAARLAD AWGYDEINLN VGCPSSRAKA GNFGACLMAK
PDQVARCVEA MAMASPLPVT VKHRLGIDDF DSDALLMTFV DRVSLAGATR FTVHARKAWL
EGLDPKQNRT IPPLQHQRVT HLKQQRPQLT IEINGGLEHP ADCLTALQTC DGAMVGRAAY
AHPLRWKSMD ELVYGEEPRS INASQVIGGL LPHAETHLSR GGRLWDLCRH LLQLVEGVPG
AKSWRRDLGI KAQKADADLT VLQKAAQQLE DAGL