Gene A9601_12221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_12221 
Symbolgid 
ID4717936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1039754 
End bp1041166 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content33% 
IMG OID640078938 
ProducttRNA (uracil-5-)-methyltransferase Gid 
Protein accessionYP_001009613 
Protein GI123968755 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1206] NAD(FAD)-utilizing enzyme possibly involved in translation 
TIGRFAM ID[TIGR00137] tRNA:m(5)U-54 methyltransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCATATA AACAAGTAAT AATTATTGGA GCTGGTCTCG CAGGATCAGA AGCAGCTTGG 
CAAGTTGCAA GTTCTGGTGT TCCAGTTAAA TTAGTTGAGA TGAGGCCTAT TAAATCAACT
CCAGCTCATC ATACAAGTGA ATTTGGAGAA TTGGTTTGTA GTAATAGCTT TGGAGCTCTA
AGCCCTGATA GAGCTGCCGG TTTATTACAA AAAGAACTTA GAATTTTTAA TTCATTGATA
GTTCAAACAG CAGATAAATT TGCTGTTCCA GCTGGAGGCG CTTTGGCGGT TGATAGATCT
AAATTCAGTA TTGCTTTGAC TGAAGCTTTA TCTAATCATC CTTTAATTGA AATTAAGAGA
TTTGAACAAT TAGATTTGCC AAGCAAAGAA AATATAACTA TCCTTGCAAC TGGTCCATTA
ACTGCAGATG AATTATCCTA TAAAATTCAA GCTTTTACAG GTATAGATGC GTGTCATTTT
TTTGATGCCG CTAGTCCTAT TATTTATGGA GATACTATTG ATCAAGAGAT TGTATTTAAA
GCTAGTAGAT ATGACAAAGG AGATCCTGCA TATTTTAATT GCCCTATGGG AAAAAATGAT
TATATCAATT TCAGAAATGA ACTAATAAAA GGTGAACAAG TTAATTTAAA AGACTTTGAG
AAAGAATCAG CTAATTTCTT TGAAGCTTGT TTACCAATTG AAGAAATTGC TAGAAGAGGA
GTTGATACAA TGAGGTACGG ACCACTAAAA TCAATTGGTT TGTGGAATCC AAAATGGGGA
GATTTATTTG ATAGGGAAAA TAGATTAAAA AAGCGACCTC ATGCAGTTGT CCAATTAAGG
AAAGAAGATT TAGAAGGAAA ATTGCTAAAT ATGGTAGGTT TTCAAACTAA CCTCAAATGG
TCTGAGCAAA AAAGAATATT TAGGTTGATT CCTGGTTTAG AAAAGGCTGA GTTTGTACGT
TTTGGAGTAA TGCATAGAAA TACTTTTTTA GAATCTCCAA AATTACTTTT ACCGACATTA
CAATTTTTGA AAAGAGAAAA CCTTTTTGCT GCAGGCCAAA TAACGGGTAC CGAAGGTTAT
GCAGCAGCAG CAGCAGGGGG CTTGCTTGCA GGAATAAATG CATCCTTATT AGCTAAGGGT
AAAAAAACAG TAAGTTTCCC TGATCAATCA ATGATTGGTT CTCTAATGAA TTTTATCAGT
AACAAAAATC AAATATTATC TAATCAGCAA AAGAATAAAT TCCAACCAAT GCCCGCTTCA
TTTGGTTTAG TTCCAGAGCT AATTAAAAGA ATAAAAGATA AAAGATTAAG GTACAAAGCT
TATCAAGAAA GATCTACAGA AGCCTTGAAT GACTTTAAAA ATCAACTAGA TTCTTGTTTT
GATAAAGACC ACTTACTTAG CAAAATTTAC TAA
 
Protein sequence
MSYKQVIIIG AGLAGSEAAW QVASSGVPVK LVEMRPIKST PAHHTSEFGE LVCSNSFGAL 
SPDRAAGLLQ KELRIFNSLI VQTADKFAVP AGGALAVDRS KFSIALTEAL SNHPLIEIKR
FEQLDLPSKE NITILATGPL TADELSYKIQ AFTGIDACHF FDAASPIIYG DTIDQEIVFK
ASRYDKGDPA YFNCPMGKND YINFRNELIK GEQVNLKDFE KESANFFEAC LPIEEIARRG
VDTMRYGPLK SIGLWNPKWG DLFDRENRLK KRPHAVVQLR KEDLEGKLLN MVGFQTNLKW
SEQKRIFRLI PGLEKAEFVR FGVMHRNTFL ESPKLLLPTL QFLKRENLFA AGQITGTEGY
AAAAAGGLLA GINASLLAKG KKTVSFPDQS MIGSLMNFIS NKNQILSNQQ KNKFQPMPAS
FGLVPELIKR IKDKRLRYKA YQERSTEALN DFKNQLDSCF DKDHLLSKIY