Gene NATL1_15981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15981 
Symbol 
ID4781209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1307657 
End bp1309033 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content38% 
IMG OID640084880 
Productputative thioredoxin reductase 
Protein accessionYP_001015420 
Protein GI124026304 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0492] Thioredoxin reductase
[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID[TIGR01292] thioredoxin-disulfide reductase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCCG AAAACAAAGA ATTAAAAACT GAAAATTTAG TAATTATAGG TTCAGGACCC 
GCTGGATATA CCGCTGCTAT CTATGCGGCA AGAGCGAATC TACAACCACT TATTATTACT
GGGTTTGAAA AAGGTGGAAT TCCAGGTGGT CAATTAATGA CGACAACATT TGTAGAGAAT
TTCCCAGGTT TCCCAAATGG AGTTCAAGGT CCAGAATTGA TGGATTTAAT AAAAGCACAA
GCTGTGAGAT GGGGGACAAA TCTAATTGAA GAAGATGCCA TATCAATTGA TCTAAGCAAA
AGACCCTTTT CTATTGTTAC TACTACCAAA AAAATCAAAA CAAACTCCTT AATTATTTCC
ACCGGAGCAA GCGCCAATCG TCTAGGCCTT AAAAATGAGA AATTATTTTG GAGTAAGGGT
ATTAGTGCCT GTGCAATTTG CGATGGAGCA ACACCTCAAT TTAGAGATGA AGAACTTGCC
GTAATTGGAG GAGGGGACTC GGCTTGTGAA GAAGCCGAAT ATCTAACAAA ATATGGTAGC
CATGTACATT TGTTAGTCAG ATCAAGGAAA CTAAGGGCTT CAGCTGCAAT GGCCGATCGA
GTAGAAGCAA ATCCCAATAT TACGATTCAT TGGGAGACAG AGCTTTTAGA TGTTTTGGGT
AATGATTGGC TAGAGAAATT AAAAGTTAAA AGAAAAGACA CTAACCAAGA ATATGAAATT
TTAGCTAAGG GACTTTTTTA TGCTATTGGA CATACTCCCA ACACGTCTTT ATTTACCAAT
CAATTAAATA AAGACTCAAA AGGTTACTTA ATAACACAAC CTGGAAGACC TGAAACTTCA
TTAGAAGGCG TTTATGCTGC AGGAGATGTT GCTGATTCGG AATGGCGTCA AGGTGTTACT
GCGGCAGGCA GCGGTTGCAA AGCAGCTTTA GCTGCTGAAA GATGGCTAAC AAAAAACAAC
TTAGCAACAC TTATAAAAAG GATTGAATTA GAGCCCTCAA AGGCTGAAAA AGCAAAAACT
TTAGAAATTA GTAACGAGGC AAATTTCGAC CCCGACAAGA CATGGCAAAA GGGAAGTTAT
GCACTAAGAA AGTTGTATCA TGAGACAGAA AAACCTCTCT TCGTAGTTTA CACATCCAGT
AGCTGCGGGC CTTGTCACAT TCTCAAGCCC CAACTTCTCA GAGTTCTTAA CGAGTCAAAA
GGGAAAGCTA TAGGTGTTGA AATTGATATT GAAAACGACC AAGATATTGC CAAGCAAGCT
GAAGTAAGCG GGACTCCAAC AGTACATCTT TTTAAGAATA AGGAATTAAA AAAACAATGG
AAAGGTGTTA AAACAAGAAG TGCATATAAA GCCGCATTAG ATGAACTAAT TAATTAA
 
Protein sequence
MPAENKELKT ENLVIIGSGP AGYTAAIYAA RANLQPLIIT GFEKGGIPGG QLMTTTFVEN 
FPGFPNGVQG PELMDLIKAQ AVRWGTNLIE EDAISIDLSK RPFSIVTTTK KIKTNSLIIS
TGASANRLGL KNEKLFWSKG ISACAICDGA TPQFRDEELA VIGGGDSACE EAEYLTKYGS
HVHLLVRSRK LRASAAMADR VEANPNITIH WETELLDVLG NDWLEKLKVK RKDTNQEYEI
LAKGLFYAIG HTPNTSLFTN QLNKDSKGYL ITQPGRPETS LEGVYAAGDV ADSEWRQGVT
AAGSGCKAAL AAERWLTKNN LATLIKRIEL EPSKAEKAKT LEISNEANFD PDKTWQKGSY
ALRKLYHETE KPLFVVYTSS SCGPCHILKP QLLRVLNESK GKAIGVEIDI ENDQDIAKQA
EVSGTPTVHL FKNKELKKQW KGVKTRSAYK AALDELIN