Gene NATL1_02831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02831 
SymbolclpB2 
ID4779077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp260411 
End bp263206 
Gene Length2796 bp 
Protein Length931 aa 
Translation table11 
GC content35% 
IMG OID640083548 
Productputative ATP-dependent Clp protease, Hsp 100, ATP-binding subunit ClpB 
Protein accessionYP_001014112 
Protein GI124024996 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTT TGAATCAAAA GTCATCAAAA ATGAATGGAA GTCTCACTAC AGAACCAGAT 
TCGTTTAGCG ATGAAGCTTG GAGTCTTTTA TTAATAGCTG AACAATCAGC CAGAAGATGG
AGACATAAGA ACTTAGATGT TGAGCATCTT ATTGAAGTGC TTTTTAGAAA TAAAAAATAT
CAAAAATATA CAAATTCCTT ACCCATAAAC CATAAAGAAT TAAATGAAAT CTTAGAAAAC
TTTATCGCTG AACTACCAAT AAACAACCAA CCAGATTTAT TTATTGGAGA AGACTTAGAA
ATTCTTCTTG AGGTCGCTGA TGATTTTCGC TCTCGATGGG GATCTAATCA AATAGAAATA
TCTCATATCC TCATCGCAAT TGGAAGAGAT AATCGCTTAG GAGAAGATCT TTTTTATCAA
GCAGGTCTAC CGAGTGAAAT TCTCGAAGCG GAATTGAGAC GACTACCAGC ACCGAAATCA
TTTAAACAAT CAAAAAGAAA TCAAAACAAA CCAATAACAA ATCGACCACA GAAAGATTCA
CAGTCTTTTA TGCCTACCGA AACCACTGCA AAAGATCCAA AACCCGAGCC ACTTCCTCCT
CTCTCTAAAG AAGAAATCAC ATCAAAGCAA GAACCTTTAA GCCTTAATGA GGCACCAAGT
GCCTTAGATT TATACTGCAA AGATCTTACA ACTGAAGCTG AAAATGGAAC ATTAGACCCT
GTGATTGGCA GAGAGTCTGA AATAAAAGCA ATTACAAAAG TTTTATCTAG AAGAGGTAAA
AACAATCCAG TACTAATTGG TGCTCCTGGT GTTGGAAAAA CAGCAATTGC AGAATTATTA
GCTCAAAAAA TTGTAGATAA CGAACTTCCT GAATCTCTTC AAGGTCTAAG GCTAATTTCA
CTTGATATCG GTGCATTAAT TGCTGGAGCT AAATTCCGAG GACAATTTGA AGAACGCTTT
AGATCATTAT TAAGTGAAAT CAACAATAGC GAAAAAGGGG TAATCCTATT CATAGATGAA
TTACACACAA TTGTAAGCAA AGACAGATCA AATACTGATG CTGGTAGTCT ATTAAAACCA
TTATTAGCAA GCGGAGACTT AAGATGCATT GGTGCAACTA CTCCAGACAA TTATAGACGT
ACGATCGAAA AAGACCTCGC TCTAAATAGA CGATTTCAGC AAGTATCAAT CAAAGAACCA
AGCTTAGATT TAAGCTTGGA AATCTTAAAA GGACTTAAAG AAAATTACGA GGTTCATCAT
GGCGTAATTA TTACCGATGA AGCACTAATT ACAGCAAATC GTTTAGCCTA TAGATATATA
AGTGATAGAT GCCTACCAGA TAAAGCTATT GACTTAATCG ACGAGGCTTC AGCTCAAGTA
AGAATAGAAT CTGCATCAAA ACCAAAAATC ATAGAAGAGA AAGAATCTCA GGTTAATCAT
TTAGAGTCAT CAATAATAAA TGCAGATAAA GATACAACTT TAGAGACTAT AAATAATCTT
CAAGAAAAGA AAGAATTGCT ACTCTTTGAG TTGGCTGAGA TTAAACAAAA ATGGCAAGAT
CAAATTGATA AATCAGCTGA ATTACAAGAA CTAAAAATAA GCTTGAAAGA ATTAAAAAAT
TTAATAAGAG AAGTGGAAAT CTCTGGTGAT ATGGAAGAAG TAGAAAAACT TAAATACGAC
CAACTCTACC AATTACAAGA AAGAATAGAA GAAATAGAAG TTTCTATTCG AGAAGATAAT
GAGTATGGTA ATTCCTTACT AAAAGATAAA GTCAATCCAG AAGACATCGC TGATGTTGTC
TCAAGATGGA CAGGAATTCC TGTTAGAAAG GTTGTATCGG GTGAAAGACA GAAACTTTTA
AAGTTAGAAC AAGACTTAGG GAAAAAAGTT ATTGGCCAAT TAAATGCTGT TCAAGCAGTC
TCAGCAGCAA TTCGCAGAGC AAGAGCTGGG ATGCAGGATA TAAAAAGGCC CATTGGATCC
TTTCTTTTTC TAGGCCCTAC AGGTGTTGGA AAAACTGAAC TTGCTAAATC ACTAGCAAGT
TCTTTGTTTG ATGAAGAGGA CGCTTTGTTG AGACTTGATA TGAGTGAATA TATGGAGAGA
AATGCTGTTT CAAGACTCTT AGGTGCACCG CCTGGATACG TGGGTTACGA AGAAGGTGGT
CAATTAACAG AGGCAATTAG AAAAAGACCT TATGCAGTTT TGCTTCTTGA TGAGATTGAA
AAAGCTCATC AAGAAGTTTT CAACATCCTA TTACAGGTAT TAGATGATGG AAGACTCACC
GATTCTCAAG GTCGAACAGT AGATTTTAGA AATACAGTTA TTGTTATGAC AAGCAATCTT
GCTAGCAAAG CAATCTTAAA TAATTCACTT CAACTTCAAA GCGAAAATTC AAATAAAAAT
ATTCTTTTAC AAGAATTAGA TCAAAAAATC AACGAAGCTC TAACAAAACA TTTTCGACCT
GAATTTTTGA ACCGCATTGA TGAAGTAATA AAATTCAACC CACTTAAACC TGACAGCTTA
GAGCAAATAG TTCGACTTCA ACTTGATGAA TTAAAGAAGC TTCTAAAGCA CCAAGGTTTA
GACCTTTATG TTGACGAAAA TACTATTAAA ATTCTTGCTG AAGAAGGCTA TGAGCCTGAA
TACGGGGCTA GACCGCTCAG AAGAGTGATT AGAAGAAGAT TAGAAAACCC ACTGGCCACA
CAAATTCTAG AAGAGGCTTT TCAAGGTGCA AAATCAATAA GGGTTGAGAC TAAAGAGGAT
GATTCAGAAA AACTTCTTTT TTTAATAGAT AACTAA
 
Protein sequence
MTTLNQKSSK MNGSLTTEPD SFSDEAWSLL LIAEQSARRW RHKNLDVEHL IEVLFRNKKY 
QKYTNSLPIN HKELNEILEN FIAELPINNQ PDLFIGEDLE ILLEVADDFR SRWGSNQIEI
SHILIAIGRD NRLGEDLFYQ AGLPSEILEA ELRRLPAPKS FKQSKRNQNK PITNRPQKDS
QSFMPTETTA KDPKPEPLPP LSKEEITSKQ EPLSLNEAPS ALDLYCKDLT TEAENGTLDP
VIGRESEIKA ITKVLSRRGK NNPVLIGAPG VGKTAIAELL AQKIVDNELP ESLQGLRLIS
LDIGALIAGA KFRGQFEERF RSLLSEINNS EKGVILFIDE LHTIVSKDRS NTDAGSLLKP
LLASGDLRCI GATTPDNYRR TIEKDLALNR RFQQVSIKEP SLDLSLEILK GLKENYEVHH
GVIITDEALI TANRLAYRYI SDRCLPDKAI DLIDEASAQV RIESASKPKI IEEKESQVNH
LESSIINADK DTTLETINNL QEKKELLLFE LAEIKQKWQD QIDKSAELQE LKISLKELKN
LIREVEISGD MEEVEKLKYD QLYQLQERIE EIEVSIREDN EYGNSLLKDK VNPEDIADVV
SRWTGIPVRK VVSGERQKLL KLEQDLGKKV IGQLNAVQAV SAAIRRARAG MQDIKRPIGS
FLFLGPTGVG KTELAKSLAS SLFDEEDALL RLDMSEYMER NAVSRLLGAP PGYVGYEEGG
QLTEAIRKRP YAVLLLDEIE KAHQEVFNIL LQVLDDGRLT DSQGRTVDFR NTVIVMTSNL
ASKAILNNSL QLQSENSNKN ILLQELDQKI NEALTKHFRP EFLNRIDEVI KFNPLKPDSL
EQIVRLQLDE LKKLLKHQGL DLYVDENTIK ILAEEGYEPE YGARPLRRVI RRRLENPLAT
QILEEAFQGA KSIRVETKED DSEKLLFLID N