Gene NATL1_00131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00131 
Symbol 
ID4779204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp17073 
End bp18080 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content34% 
IMG OID640083276 
ProducttRNA-dihydrouridine synthase A 
Protein accessionYP_001013842 
Protein GI124024726 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00742] tRNA dihydrouridine synthase A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.385799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.966125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCCA ATTCTTTAAA AACAAATGAA ATAGCTAACT ACAGATTAAG TATTGCTCCA 
ATGATGGATT GCACAGATAG GCATTTTCGT GTCCTTATGC GCCAGATCAC CAAAAAATCG
CTTCTATACA CTGAAATGAT TGTGGCCCAA GCGCTCCATT ACAGCAAAAA CAGGAATAAA
TTATTAGATT TTGATGAGAT TGAGCATCCT ATTTCCATAC AACTAGGAGG AGACAACCCA
AAACTTTTAG CTGAAGCAGC TCAAATGGCT GAAGATTGGG GCTATGACGA AATCAACTTA
AACATTGGAT GCCCAAGTCC GAGAGTAAAA TCTGGCAACT TTGGAGCTTG TCTTATGGGA
AAACCAAAGA TAGTGGCCAA TTGTATTGAG AAAATGAAGG AATCATGCAA TATACCAATA
ACCGTCAAAC ACAGGCTTGG TATTGATAAT CTTGACAGTG ATGATTATCT TCTGACATTT
GTTGATACTT GTTCACTTGC AGGAGCAGAC AGATTCATCA TTCATGCAAG AAAAGCTTGG
CTTAATGGAT TAAATCCAAA AGAAAATCGT ACAATTCCAC CTCTTCAATA TGAAAGAGTC
CAAAAATTAA AAAATCACAG GCCTGAATTA ATTATTGAGC TGAACGGTGG AATAAATACA
ATTAATGATT CTATTGAAGC TCTAAAAGTA TTTGATGGGG CAATGGTAGG AAGAGCTGCA
TACTCTCATC CATTTCTTTG GACAAAAATT GATTCATTAA TTTTTGGACA AAAAGAAAAA
TATTTATCAA GATCAAAAAT AATTAAAAGG CTTATTCCTT TTGCTGAAAA GCATTTAGAA
AATGATGGAC GTCTTTGGCA AATTTCTAGA CATATTTTAA ATCTGATAGA AAATATTCCT
AATGCAAAAA TATTGAGACA AGAATTAAGT GAAAAATGTC AAACTCAGAA AGCTGATATT
TCTATTTTAA AAAAAATTGC CCAACAACTC GAAGATGCTG GGCAATAA
 
Protein sequence
MISNSLKTNE IANYRLSIAP MMDCTDRHFR VLMRQITKKS LLYTEMIVAQ ALHYSKNRNK 
LLDFDEIEHP ISIQLGGDNP KLLAEAAQMA EDWGYDEINL NIGCPSPRVK SGNFGACLMG
KPKIVANCIE KMKESCNIPI TVKHRLGIDN LDSDDYLLTF VDTCSLAGAD RFIIHARKAW
LNGLNPKENR TIPPLQYERV QKLKNHRPEL IIELNGGINT INDSIEALKV FDGAMVGRAA
YSHPFLWTKI DSLIFGQKEK YLSRSKIIKR LIPFAEKHLE NDGRLWQISR HILNLIENIP
NAKILRQELS EKCQTQKADI SILKKIAQQL EDAGQ