Gene NATL1_18751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18751 
Symbol 
ID4780197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1533834 
End bp1535060 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content37% 
IMG OID640085164 
Productaldo/keto reductase 
Protein accessionYP_001015695 
Protein GI124026580 
COG category[R] General function prediction only 
COG ID[COG1453] Predicted oxidoreductases of the aldo/keto reductase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAATAA ATAAAGATAG TAAAAACAAT TCAATTCCTA GAAGAAGATT TGGGCGAACT 
GAGATTCAGA TACCAGTCTT ATCTTTGGGA GGGATGCGCT TTCAACAAAG CTGGAAGGAT
TTAGATCCTA AAGAGATCAA TAATCAACAG CAAGACATCT TGCAAAAAAT AATTAACCAT
GCTGCTAAAA ATGGTATGCA TCATATAGAA ACTGCTCGTC ATTATGGAAC ATCAGAACGC
CAGATAGGCT GGGCCTTCGA TCAAATCGAT GACCCAAAAA GAATATTACA AACGAAAGTA
CCACCAAATA ATGATCCTTC GATATTCGAG CAAGAACTTG AGTTAAGTAT GAGTAGATTA
AATTCCAAAA AAATTGATTT ATTAGCTATT CATGGTATCA ACCTTCCTGA GCACTTGGAT
ATGACTATTC GTCCTAATGG ATGTCTACAG ATTGTTCGTC GTTGGCAAAA AGATGGTCTT
ATTGGTCATG TTGGCTTCTC AACTCATGCA AACGTTGACC TGATAATTAA AACAATTGAA
ACAGGGCTTT TTGATTATGT TAATTTGCAT TGGTATTTTA TTCGTCAGGA AAACGAAAGA
GCTTTGCAAG CAGCGGATGC TAATGATATG GGGGTTTTTA TTATAAGTCC TACTGATAAG
GGAGGTCATT TACATACACC GTCATTGAAA CTTTTAGATT TATGTAGTCC TTTGCATCCA
ATAGAATTTA ATGATCTATT TTGTTTGAGG GATCAAAGAG TTCATACATT AAGTGTTGGA
GCCTCAATTC CTGAAGATCT GGATATTCAT TTAAGTGCGA TTTCTAAAAT TGACACCATG
CATGGGGTAA TTAATACCAT TGAAAAGCGA CTAATAAATG CCTCGTATAA ATCGCTTGGA
GAGTCATGGT TGACTACTTG GAATATAGGT CTACCTCGCT GGGATCAAAC TCCAGGAGAG
ATAAATATAC CTACTTTATT ATGGCTAAAC AATTTATTAG AGGCTTGGGA TATGGAAAGC
TTTGTTAAAG ATCGTTATGG TCTTTTAGGG AGAGGGGGAC ATTGGTTCCC AGGGGCAAAC
GCAGATTGCT TGGATTGTGA AGTGAGTGAG GATGACTTGA AAAAAGTTTT GATTAATAGC
CCTTGGAGCT CTGAAATACC TTCTGTACTT AGAAAATTGA AAGATAGATT GGGAGGGGAA
AGAAGAGATA GATTATGGGG GATTTAG
 
Protein sequence
MPINKDSKNN SIPRRRFGRT EIQIPVLSLG GMRFQQSWKD LDPKEINNQQ QDILQKIINH 
AAKNGMHHIE TARHYGTSER QIGWAFDQID DPKRILQTKV PPNNDPSIFE QELELSMSRL
NSKKIDLLAI HGINLPEHLD MTIRPNGCLQ IVRRWQKDGL IGHVGFSTHA NVDLIIKTIE
TGLFDYVNLH WYFIRQENER ALQAADANDM GVFIISPTDK GGHLHTPSLK LLDLCSPLHP
IEFNDLFCLR DQRVHTLSVG ASIPEDLDIH LSAISKIDTM HGVINTIEKR LINASYKSLG
ESWLTTWNIG LPRWDQTPGE INIPTLLWLN NLLEAWDMES FVKDRYGLLG RGGHWFPGAN
ADCLDCEVSE DDLKKVLINS PWSSEIPSVL RKLKDRLGGE RRDRLWGI