Gene NATL1_04411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04411 
Symbol 
ID4780537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp404273 
End bp405265 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content40% 
IMG OID640083718 
ProducttRNA-dihydrouridine synthase 
Protein accessionYP_001014270 
Protein GI124025154 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family
[TIGR00742] tRNA dihydrouridine synthase A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.324632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGT TTTCTCCGAT TTGTTTATCA GGCAATGGGA CGCCTAGATC TATAGAGTGT 
CTGGTTATTC AGTCCCCACT TGCAGGAGTG AGCGATCAAA TATTCAGGAA CTTTGTTCGT
AGATGGTCTC CAAAAGCTTT ACTTTTCACC GAAATGGTAA ATGCTAAAAG TCTTGAATTA
GGTCATGGTG AAGAGAAAGT AATTGAGCTT TCAGAAGAAA GTGGTCCAAT TGGTGTTCAA
CTTTTTGACC ATAGGCCAGA TTCAATGGTA GATGCTGCCA TCAAAGCCGA ATCATCTGGT
GCATTTCTTA TAGATATCAA TATGGGTTGC CCAGTAAAAA AAATTGCCAG GAAAGGAGGC
GGAAGTGCTC TATTGAAAGA ACCAGAACTT GCGCAATTAA TCGTCAAAAA GGTTTCAAAA
GCCATATCTA TTCCAGTAAC AGTCAAAATA AGATTGGGGT GGTGTGAAAC CACAAGTGAT
CCAGTATCTT TTGCTTTAGG TCTACAGGAG GCTGGGGCTC AACTCATAAC TGTTCATGGG
CGAACAAGAA GGCAAGGATT TTCTGGCCAT TCAAACTGGA AAGCTATTGC CCAAATCAAA
AAGTCATTAG ATATACCTGT CATCGCTAAT GGTGATATTA AAAACTCTCG AGATGCTATT
GAGTGCCTAA AGATCACGAA TGCCGATGGG GTGATGATAG GAAGGGCAAG TATGGGAGCT
CCATGGCTGG TTGGACAAAT TGATGAAGAA ATTAAAAACC AAACAACTTT TAAACCACCT
GACGCAAAGA TGAAAGTGAG CTTATCTTTA GAGCACCTAA AATTACTTGT TTCAAAGAAA
GGAAGTCATG GGCTTTTGAT TGCTAGGAAA CATATGAATT GGACTTGTAG AGGTTTTGAG
GGTGCGTCTA CTCTTCGCCA TAAATTAGTT AGAGCAAGCA CTCCAAACGA CGCAATTAAA
CTACTGGAAG ACGAACTGAT TAAATTCAAC TAA
 
Protein sequence
MKEFSPICLS GNGTPRSIEC LVIQSPLAGV SDQIFRNFVR RWSPKALLFT EMVNAKSLEL 
GHGEEKVIEL SEESGPIGVQ LFDHRPDSMV DAAIKAESSG AFLIDINMGC PVKKIARKGG
GSALLKEPEL AQLIVKKVSK AISIPVTVKI RLGWCETTSD PVSFALGLQE AGAQLITVHG
RTRRQGFSGH SNWKAIAQIK KSLDIPVIAN GDIKNSRDAI ECLKITNADG VMIGRASMGA
PWLVGQIDEE IKNQTTFKPP DAKMKVSLSL EHLKLLVSKK GSHGLLIARK HMNWTCRGFE
GASTLRHKLV RASTPNDAIK LLEDELIKFN