Gene NATL1_01371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01371 
Symbol 
ID4780013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp134017 
End bp135459 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content36% 
IMG OID640083401 
Productcysteine desulfurase activator complex subunit SufB 
Protein accessionYP_001013966 
Protein GI124024850 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0719] ABC-type transport system involved in Fe-S cluster assembly, permease component 
TIGRFAM ID[TIGR01980] FeS assembly protein SufB 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAAT CTAATACTGT TGAAGAAATT GTTTCCCAAC CTTATAAATA CGGTTTTATA 
ACTGAAATTG AAACTGAAAA GATCCCAAAA GGATTGAATG AAGATGTCAT TAGGTTAATT
TCCTCCAAAA AAAATGAGCC AGAATTTTTA TTAAAATTTC GTTTAAGAGC TTATGAGCAA
TGGTTGAAGA TGAAAGAACC TGATTGGTCA GGCTTAACTT ATTCCAAAGT AGATTATCAG
GATCTTGTTT ATTATGCAGC CCCAAAACAA GACATTAAGA AGAAGAGTTT AGATGAAGTC
GATCCTAAAT TACTTGAAAC TTTTGAAAAG CTTGGGATAC CACTTAGTGA ACAAAAAAGA
TTATCTAATG TAGCTGTAGA TGCTGTCTTT GACAGTGTTT CTATAGCAAC TACATACAAG
GAAAAGCTAG CTGAACATGG AGTAATTTTC TGCTCTATTA GCGAGGCAAT TAGCGAATAC
CCTGATCTAA TTGAAAAATA TATGGGTACA GTCGTTCCTC TAAATGATAA CTTTTTTGCA
GCTCTAAACT CTGCAGTTTT TAGTGATGGT TCGTTTGTTT ACATACCCAA AGGTGTCGAA
TGTCCAATGG AACTATCATC TTATTTTAGA ATAAATTCTG GCGACACTGG TCAATTCGAA
CGAACTTTAA TTATTGCCGA AGAATCCTCC TCCGTTAGTT ATCTAGAGGG CTGTACAGCG
CCTATGTTTG ATACCAATAC ACTCCATGCT GCTGTTGTTG AGCTTGTCGC GCTCGATAAT
GCATCTATCA AATATTCAAC TGTGCAAAAT TGGTATGCAG GTAATGAAGA AGGAGTTGGA
GGAATATATA ACTTCGTCAC AAAAAGAGGG GAATGTAGAG GAAAGAAAAG CAAAATAAGC
TGGTCTCAAG TCGAAACAGG TTCTGCAATT ACATGGAAAT ACCCAAGTTG CGTATTACAA
GGAGACAATT CCATTGGTGA GTTTTATTCA ATTGCACTTA CTAATAATTG TCAGAAAGCG
GATACTGGGA CAAAAATGAT TCACATAGGA AAAAATACAA AATCAAAAAT CGTAAGTAAA
GGTATCAGTG CTGGAAAGTC TAAGAATAGC TATCGAGGTC TTGTATCCAT AAGTCCAAAT
GCTGAGGGCG CTCGAAATTA CAGTCAATGT GACTCAATGT TGATCGGTGA CAAAGCCAGT
GCAAATACAT ATCCATATAT TCAATCCAAA CAACCTCAGT CAAATATTGA ACATGAAGCT
AGTACTTGTA GGATTTCAGA AGATCAACTT TTTTACTTAC AAAGTAGAGG AATTGATTTC
GAGGAATCAG TCTCAATGTT AGTCAGTGGA TTTTGTAGTG ATGTTTTCAA TGAACTACCT
ATGGAGTTTG CATCTGAGGC AGACAAACTT TTAGCTTTAA AATTGGAGGG TTCGGTTGGA
TAA
 
Protein sequence
MTQSNTVEEI VSQPYKYGFI TEIETEKIPK GLNEDVIRLI SSKKNEPEFL LKFRLRAYEQ 
WLKMKEPDWS GLTYSKVDYQ DLVYYAAPKQ DIKKKSLDEV DPKLLETFEK LGIPLSEQKR
LSNVAVDAVF DSVSIATTYK EKLAEHGVIF CSISEAISEY PDLIEKYMGT VVPLNDNFFA
ALNSAVFSDG SFVYIPKGVE CPMELSSYFR INSGDTGQFE RTLIIAEESS SVSYLEGCTA
PMFDTNTLHA AVVELVALDN ASIKYSTVQN WYAGNEEGVG GIYNFVTKRG ECRGKKSKIS
WSQVETGSAI TWKYPSCVLQ GDNSIGEFYS IALTNNCQKA DTGTKMIHIG KNTKSKIVSK
GISAGKSKNS YRGLVSISPN AEGARNYSQC DSMLIGDKAS ANTYPYIQSK QPQSNIEHEA
STCRISEDQL FYLQSRGIDF EESVSMLVSG FCSDVFNELP MEFASEADKL LALKLEGSVG